Time |
Nickname |
Message |
00:03
🔗
|
|
antomatic has joined #archiveteam-bs |
00:05
🔗
|
|
antomati_ has quit IRC (Read error: Operation timed out) |
00:38
🔗
|
|
GE has quit IRC (Remote host closed the connection) |
00:52
🔗
|
|
BlueMaxim has quit IRC (Read error: Operation timed out) |
02:26
🔗
|
godane |
so i found a youtube user saved a ton of WABC and WCBS from march 1993 6pm news broadcast |
02:52
🔗
|
* |
Somebody2 is grumpy about UNESCO's Open Access policy... |
02:53
🔗
|
Somebody2 |
They license all the *text* they write under CC BY-SA -- but they don't *distribute* files consisting of only that text. |
02:54
🔗
|
Somebody2 |
Instead, they distribute files *ALSO* including various graphics and images from others, *NOT* licensed under any open license. |
02:54
🔗
|
Somebody2 |
Thereby making it impossible for others to legally mirror what they distribute! |
02:59
🔗
|
|
Silvan has joined #archiveteam-bs |
03:00
🔗
|
|
SilSte has quit IRC (Read error: Operation timed out) |
03:18
🔗
|
|
ndiddy has quit IRC () |
03:29
🔗
|
pikhq |
Or at least, highly impractical. |
04:01
🔗
|
Somebody2 |
yes, impossible to use the license they automatically grant |
04:10
🔗
|
|
bwn has quit IRC (Read error: Operation timed out) |
04:11
🔗
|
|
pizzaiolo has quit IRC (Remote host closed the connection) |
04:13
🔗
|
|
bwn has joined #archiveteam-bs |
04:35
🔗
|
|
Sk1d has quit IRC (Ping timeout: 194 seconds) |
04:41
🔗
|
|
Sk1d has joined #archiveteam-bs |
05:43
🔗
|
|
Aranje has quit IRC (Quit: Three sheets to the wind) |
06:09
🔗
|
hook54321 |
Yik Yak is probably going to shut down soon! We need to find a way to archive stuff from it soon. |
06:10
🔗
|
nyany |
that's that silly app for posting anonymous notes about your local area right? |
06:11
🔗
|
hook54321 |
Yeah. They also have a Regional section and some other stuff. |
06:23
🔗
|
|
Nyx has quit IRC (Ping timeout: 260 seconds) |
06:38
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
06:56
🔗
|
|
BlueMaxim has quit IRC (Read error: Operation timed out) |
06:56
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
07:14
🔗
|
|
Nyx has joined #archiveteam-bs |
07:26
🔗
|
|
odemg has quit IRC (Remote host closed the connection) |
07:28
🔗
|
|
schbirid has joined #archiveteam-bs |
07:42
🔗
|
|
odemg has joined #archiveteam-bs |
07:43
🔗
|
|
JAA has joined #archiveteam-bs |
07:47
🔗
|
|
GE has joined #archiveteam-bs |
08:03
🔗
|
|
Jonison has joined #archiveteam-bs |
08:17
🔗
|
|
Honno has joined #archiveteam-bs |
08:18
🔗
|
|
schbirid2 has joined #archiveteam-bs |
08:22
🔗
|
|
schbirid has quit IRC (Read error: Operation timed out) |
08:57
🔗
|
|
username1 has joined #archiveteam-bs |
09:02
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
09:16
🔗
|
|
schbirid2 has joined #archiveteam-bs |
09:19
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
09:21
🔗
|
godane |
so i'm manually grab the RedEye Magazine |
09:22
🔗
|
godane |
from chicago tribune |
09:37
🔗
|
|
username1 has joined #archiveteam-bs |
09:43
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
09:56
🔗
|
|
schbirid2 has joined #archiveteam-bs |
10:00
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
10:00
🔗
|
|
edsu has joined #archiveteam-bs |
10:01
🔗
|
|
pnJay has quit IRC (Leaving) |
10:02
🔗
|
|
GE has quit IRC (Remote host closed the connection) |
10:24
🔗
|
|
username1 has joined #archiveteam-bs |
10:27
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
10:51
🔗
|
|
JAA has quit IRC (Quit: Page closed) |
10:53
🔗
|
|
schbirid2 has joined #archiveteam-bs |
10:57
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
11:15
🔗
|
godane |
so i found a way to grab the RedEye Magazine |
11:16
🔗
|
godane |
i just had to login using facebook then run httpfox when downloading a pdf |
11:16
🔗
|
godane |
you can some api.readoz.com urls |
11:17
🔗
|
godane |
the first /download/ url has the authorization data in it |
11:28
🔗
|
|
icedice has joined #archiveteam-bs |
12:11
🔗
|
godane |
so i found a way to grab all the readoz.com ids for a channel |
12:14
🔗
|
|
username1 has joined #archiveteam-bs |
12:18
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
12:37
🔗
|
|
odemg has quit IRC (Remote host closed the connection) |
12:40
🔗
|
|
schbirid2 has joined #archiveteam-bs |
12:45
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
12:53
🔗
|
|
pizzaiolo has joined #archiveteam-bs |
13:08
🔗
|
|
username1 has joined #archiveteam-bs |
13:11
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
13:14
🔗
|
|
odemg has joined #archiveteam-bs |
13:16
🔗
|
|
odemg has quit IRC (Remote host closed the connection) |
13:27
🔗
|
|
schbirid2 has joined #archiveteam-bs |
13:30
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
13:31
🔗
|
|
icedice has quit IRC (Quit: Leaving) |
13:45
🔗
|
|
kniffy has quit IRC (Ping timeout: 240 seconds) |
13:48
🔗
|
|
username1 has joined #archiveteam-bs |
13:51
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
13:56
🔗
|
|
pnJay has joined #archiveteam-bs |
14:00
🔗
|
|
kniffy has joined #archiveteam-bs |
14:07
🔗
|
|
BlueMaxim has quit IRC (Read error: Operation timed out) |
14:10
🔗
|
|
odemg has joined #archiveteam-bs |
14:24
🔗
|
|
schbirid2 has joined #archiveteam-bs |
14:27
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
14:29
🔗
|
|
tuluut has joined #archiveteam-bs |
14:40
🔗
|
|
Nume has joined #archiveteam-bs |
14:40
🔗
|
Nume |
hello~ |
14:41
🔗
|
Nume |
[17:35] <arkiver> I guess give webarchiveplayer some time [17:35] <arkiver> 140 GB is big [17:36] <arkiver> you can also browse it in the wayback machine |
14:41
🔗
|
arkiver |
hi |
14:41
🔗
|
Nume |
so about this |
14:41
🔗
|
arkiver |
#archiveteam is more for announcements |
14:41
🔗
|
Nume |
oh I see |
14:41
🔗
|
Aoede |
Nume: I might be able get the stories for you |
14:41
🔗
|
hook54321 |
arkiver: this is #archiveteam-bs |
14:41
🔗
|
arkiver |
yes |
14:42
🔗
|
arkiver |
(see #archiveteam) |
14:42
🔗
|
Nume |
Aoede, really? |
14:42
🔗
|
Nume |
I would be so grateful |
14:42
🔗
|
hook54321 |
oh, nvm |
14:48
🔗
|
|
username1 has joined #archiveteam-bs |
14:52
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
14:57
🔗
|
rocode |
Nume, where did you get the WARC file? |
14:59
🔗
|
Nume |
archive.org |
15:01
🔗
|
rocode |
I was more checking to see if I was the uploader, in which case I could help you out. :) |
15:01
🔗
|
rocode |
Also note that any WARC file properly uploaded on archive.org will show up in the WayBack Machine. |
15:02
🔗
|
nightpool |
(unless robots.txt or otherwise excluded, right?) |
15:03
🔗
|
nightpool |
(or am I mistaken about that?) |
15:03
🔗
|
rocode |
Correct. robots.txt will exclude. |
15:04
🔗
|
Aoede |
I got this |
15:04
🔗
|
Nume |
Aoede already agreeded to help me find the stories I needed, but thank you a lot for your help as well! |
15:05
🔗
|
nightpool |
I'm more asking for my own benefit, but yeah |
15:05
🔗
|
Nume |
I might be but wrong but I think robots.txt blocked fanfiction |
15:05
🔗
|
Nume |
if that's the correct term even, I am a bit confused by this whole web archive thing ^^ |
15:10
🔗
|
|
schbirid2 has joined #archiveteam-bs |
15:13
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
15:14
🔗
|
rocode |
r/PLACE snapshots are now added: https://archive.org/details/PLACE-SNAPSHOTS |
15:14
🔗
|
nightpool |
yooooo. that's cool |
15:15
🔗
|
rocode |
I am also working on a datafile that tracks diffs, since a lot of devs/artists are using the snapshots for cool things. |
15:19
🔗
|
Aoede |
very cool. Reminded me of Drawball |
15:19
🔗
|
Kaz |
diffs would be awesome |
15:19
🔗
|
Kaz |
Any idea if reddit plans to release the raw data? |
15:20
🔗
|
rocode |
No, but I have 10 second snapshots being uploaded next that diff data was drawn from. |
15:20
🔗
|
rocode |
So that is probably the best we are going to get. |
15:20
🔗
|
Kaz |
ah i see, awesome |
15:21
🔗
|
K4k |
hook54321: I asked around a little and unfortunately Yik Yak does not have a public API we could use for archiving them. Going to have to find another way. |
15:32
🔗
|
nightpool |
Woah, wait, what's going on with yik yak? |
15:33
🔗
|
nightpool |
Also, don't messages on Yik Yak disappear quickly anyway? It's good to grab a snapshot but I'm not sure how much there is there to archive. |
15:38
🔗
|
|
username1 has joined #archiveteam-bs |
15:41
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
15:48
🔗
|
|
GE has joined #archiveteam-bs |
16:03
🔗
|
|
schbirid2 has joined #archiveteam-bs |
16:07
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
16:20
🔗
|
xmc |
they disappear after a few days, and they're only visible a mile or so from where they were posted |
16:21
🔗
|
rocode |
Plus they stripped the anonymity. |
16:21
🔗
|
rocode |
Not touching that with a 100 ft pole. |
16:21
🔗
|
|
xmc sets mode: +oooo midas HCross2 Lord_Nigh Sanqui |
16:21
🔗
|
|
xmc sets mode: +oooo yipdw balrog arkiver swebb |
16:21
🔗
|
|
swebb sets mode: +o DFJustin |
16:21
🔗
|
|
swebb sets mode: +o SadDM |
16:21
🔗
|
|
swebb sets mode: +o antomatic |
16:21
🔗
|
|
swebb sets mode: +o brayden |
16:21
🔗
|
|
swebb sets mode: +o edsu |
16:21
🔗
|
|
xmc sets mode: +oooo chfoo chazchaz godane DFJustin |
16:21
🔗
|
|
xmc sets mode: +o schbirid2 |
16:22
🔗
|
rocode |
(I hope your automatic op scripts check host and not just username. :P) |
16:22
🔗
|
xmc |
that wasn't an auto op, that was me scrolling thru the userlist |
16:22
🔗
|
xmc |
so ... not really |
16:22
🔗
|
rocode |
I was refering to swebb. :) |
16:23
🔗
|
xmc |
oh, yeah, swebb's do |
16:31
🔗
|
|
username1 has joined #archiveteam-bs |
16:35
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
16:39
🔗
|
rocode |
Snapshot diffs uploaded. https://archive.org/details/PLACE-SNAPSHOT-DIFFS |
16:40
🔗
|
xmc |
kool |
16:41
🔗
|
rocode |
Once the archivebot jobs finish I think we are 100% on archival for r/PLACE. |
16:45
🔗
|
HCross2 |
joepie91: my NorthHosts hardware is finally on the way back |
17:00
🔗
|
|
schbirid2 has joined #archiveteam-bs |
17:02
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
17:13
🔗
|
joepie91 |
HCross2: could have gone worse |
17:13
🔗
|
joepie91 |
:p |
17:14
🔗
|
rocode |
Microsoft is shutting down literally all of their research division projects. First it was so.cl, now it is their open source repos. |
17:14
🔗
|
rocode |
Jesus. |
17:15
🔗
|
HCross2 |
joepie91: had a bit of a go at Jon.. and he's packaged and sent it all for free |
17:32
🔗
|
Smiley |
rocode: awesome :) |
17:35
🔗
|
|
hook54321 has quit IRC (Ping timeout: 244 seconds) |
17:36
🔗
|
|
tammy_ has quit IRC (Ping timeout: 244 seconds) |
17:37
🔗
|
|
tammy_ has joined #archiveteam-bs |
17:46
🔗
|
|
hook54321 has joined #archiveteam-bs |
17:54
🔗
|
K4k |
nightpool: Yik Yak is shutting down |
17:54
🔗
|
K4k |
supposedly |
17:57
🔗
|
|
odemg has quit IRC (Remote host closed the connection) |
17:58
🔗
|
hook54321 |
K4k: As far as i know, they haven't officially said that is. |
17:59
🔗
|
K4k |
It's been in the rumor mill for ~6 months at least. |
18:02
🔗
|
|
hook54321 has quit IRC (Ping timeout: 244 seconds) |
18:04
🔗
|
|
tuluut has quit IRC (Ping timeout: 244 seconds) |
18:04
🔗
|
|
tuluut has joined #archiveteam-bs |
18:04
🔗
|
|
JAA has joined #archiveteam-bs |
18:07
🔗
|
|
hook54321 has joined #archiveteam-bs |
18:08
🔗
|
hook54321 |
K4k: There's a web interface |
18:12
🔗
|
hook54321 |
https://www.yikyak.com/yak/R/581b8281266910ccd6282f43cc10f |
18:13
🔗
|
hook54321 |
the wayback machine either doesn't save or doesn't show the replies. archive.is doesn't either. |
18:20
🔗
|
rocode |
SFF.net has updated their robots.txt to allow our archives to be browsed on the wayback machine. We should be good now. |
18:43
🔗
|
|
pizzaiolo has quit IRC (Ping timeout: 245 seconds) |
18:47
🔗
|
|
pizzaiolo has joined #archiveteam-bs |
18:53
🔗
|
|
odemg has joined #archiveteam-bs |
18:54
🔗
|
|
Nume has left |
19:00
🔗
|
|
ndiddy has joined #archiveteam-bs |
19:03
🔗
|
|
JensRex has quit IRC (Remote host closed the connection) |
19:04
🔗
|
|
JensRex has joined #archiveteam-bs |
19:13
🔗
|
|
username1 has joined #archiveteam-bs |
19:18
🔗
|
|
schbirid2 has quit IRC (Read error: Operation timed out) |
19:20
🔗
|
Kaz |
rocode: also just caused sublime to baloon to 8.5GB of RAM use before it crashed |
19:21
🔗
|
rocode |
wtf is researchgate |
19:27
🔗
|
|
wowaname has joined #archiveteam-bs |
19:30
🔗
|
hook54321 |
rocode: ResearchGate is a social networking site for scientists and researchers to share papers, ask and answer questions, and find collaborators |
19:31
🔗
|
username1 |
scientific data silo :( https://www.researchgate.net/ |
19:31
🔗
|
hook54321 |
All you need to sign up is a .edu email address or an invite |
19:31
🔗
|
hook54321 |
silo? |
19:31
🔗
|
xmc |
i have a .edu email address :) |
19:31
🔗
|
xmc |
thanks to my alma mater never retiring them |
19:31
🔗
|
rocode |
hook54321, looks like different context according to https://en.wikipedia.org/wiki/Sci-Hub |
19:32
🔗
|
username1 |
nah, rg hosts papers as well |
19:32
🔗
|
username1 |
but you cannot scrape much |
19:32
🔗
|
|
schbirid2 has joined #archiveteam-bs |
19:32
🔗
|
hook54321 |
xmc: what's an alma mater? |
19:32
🔗
|
schbirid2 |
fuck this isp |
19:32
🔗
|
xmc |
the school i went to, hook54321 |
19:32
🔗
|
xmc |
might be just an american term |
19:33
🔗
|
hook54321 |
I'm in the US... |
19:33
🔗
|
rocode |
https://en.wikipedia.org/wiki/Alma_mater |
19:33
🔗
|
hook54321 |
username1: iirc you can access PDFs even if you aren't logged in. |
19:33
🔗
|
rocode |
it's a common term. |
19:33
🔗
|
hook54321 |
However you need to have the link |
19:34
🔗
|
hook54321 |
to the pdf |
19:34
🔗
|
schbirid2 |
yes maybe |
19:34
🔗
|
schbirid2 |
not wanna discuss, sorry |
19:35
🔗
|
|
username1 has quit IRC (Read error: Operation timed out) |
19:35
🔗
|
hook54321 |
rocode: I don't see anything in the scihub article about ResearchGate |
19:36
🔗
|
rocode |
I am so confused at this point I am just going to stop. |
19:38
🔗
|
schbirid2 |
sci-hub = site to pirate papers |
19:38
🔗
|
schbirid2 |
rg = "social network" for researchers, including huge paper collection |
19:38
🔗
|
schbirid2 |
sci-hub was showing they had much more papers than rg |
19:38
🔗
|
schbirid2 |
EOS |
19:38
🔗
|
hook54321 |
for authors to share their papers |
19:38
🔗
|
hook54321 |
oh |
19:39
🔗
|
schbirid2 |
people upload lots of not-their-own papers |
19:39
🔗
|
|
GE has quit IRC (Remote host closed the connection) |
19:39
🔗
|
hook54321 |
on rg or sci-hub? |
19:39
🔗
|
schbirid2 |
rg |
19:40
🔗
|
hook54321 |
I thought that it doesn't let people do that |
19:40
🔗
|
|
dzl has joined #archiveteam-bs |
19:40
🔗
|
hook54321 |
heh. someone responded to my request to their article with this: "Sorry, I am unable to share my full-text because I don't know if I have permission to." |
19:43
🔗
|
rocode |
Science! |
19:45
🔗
|
|
PyrEx has joined #archiveteam-bs |
19:47
🔗
|
|
GE has joined #archiveteam-bs |
20:10
🔗
|
JAA |
Luckily, open access journals are becoming more and more common. |
20:19
🔗
|
|
schbirid2 has quit IRC (Quit: Leaving) |
20:27
🔗
|
|
Honno has quit IRC (Quit: Leaving) |
20:33
🔗
|
|
pnJay has quit IRC (Quit: Leaving) |
20:43
🔗
|
|
sep332 has joined #archiveteam-bs |
20:45
🔗
|
|
Jonison has quit IRC (Read error: Connection reset by peer) |
20:45
🔗
|
|
sep332_ has quit IRC (Read error: Operation timed out) |
20:55
🔗
|
|
icedice has joined #archiveteam-bs |
21:00
🔗
|
|
pnJay has joined #archiveteam-bs |
21:11
🔗
|
|
odemg has quit IRC (Remote host closed the connection) |
21:12
🔗
|
|
odemg has joined #archiveteam-bs |
21:30
🔗
|
rocode |
MLKSHK is live. http://archiveteam.org/index.php?title=MLKSHK |
21:30
🔗
|
rocode |
Warriors needed etc. |
21:31
🔗
|
pnJay |
JensRex, I will race you to a TB on the mlkshk project! :) |
21:31
🔗
|
pnJay |
although i have a headstart <_< |
21:32
🔗
|
JAA |
Any idea yet how many threads are safe? |
21:32
🔗
|
rocode |
They don't seem to be throttling. |
21:33
🔗
|
pnJay |
im running 10 on all my warriors so far. No problems so far |
21:36
🔗
|
* |
JAA queues Mr. Burns's "egg salad". |
22:05
🔗
|
jtn2 |
I was moaning about gmane.org the other day, so FWIW: it looks like their archives are alive and well if you access news.gmane.org via NNTP, it's just the web front end that's not usable. |
22:05
🔗
|
jtn2 |
(This may be news to nobody) |
22:06
🔗
|
xmc |
excellent, i assumed as much but didn't bother to check |
22:10
🔗
|
|
odemg has quit IRC (Remote host closed the connection) |
22:13
🔗
|
|
odemg has joined #archiveteam-bs |
22:15
🔗
|
|
JAA has quit IRC (Quit: Page closed) |
22:16
🔗
|
Kaz |
rocode: channel? |
22:16
🔗
|
pnJay |
#totheyard |
22:17
🔗
|
|
odemg has quit IRC (Remote host closed the connection) |
22:30
🔗
|
|
odemg has joined #archiveteam-bs |
22:42
🔗
|
|
GE has quit IRC (Remote host closed the connection) |
22:45
🔗
|
|
pizzaiol1 has joined #archiveteam-bs |
22:52
🔗
|
|
pizzaiolo has quit IRC (Remote host closed the connection) |
23:00
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
23:34
🔗
|
|
pizzaiol1 has quit IRC (Read error: Operation timed out) |