#archiveteam-bs 2017-05-21,Sun

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)

WhoWhatWhen
***pizzaiolo has quit IRC (Read error: Connection reset by peer) [00:15]
........... (idle for 53mn)
Sk1d has quit IRC (Ping timeout: 250 seconds) [01:08]
Sk1d has joined #archiveteam-bs [01:15]
..... (idle for 21mn)
j08nY has quit IRC (Quit: Leaving) [01:36]
............. (idle for 1h4mn)
trs80 has quit IRC (Remote host closed the connection) [02:40]
.................. (idle for 1h29mn)
sheaf has joined #archiveteam-bs [04:09]
wp494GLaDOS: did you start up ipernity too on tracker? [04:19]
***SadDM has quit IRC (Read error: Operation timed out)
jspiros has quit IRC (Read error: Operation timed out)
[04:20]
wp494because the last time it was halted on 4/8 or 4/9, it was due to a bunch of 403s [04:21]
***dboard has quit IRC (Read error: Operation timed out) [04:21]
GLaDOSnah, i dont believe i did [04:23]
wp494well someone set it off again
because items are flowing
and they're all 0 MB, just like the last time
it was about 1300 ut today that they started flowing again
and the urls do still seem to be 403s
[04:26]
***ndiddy has quit IRC () [04:31]
........... (idle for 50mn)
jspiros has joined #archiveteam-bs
dboard has joined #archiveteam-bs
DFJustin has quit IRC (Remote host closed the connection)
SadDM has joined #archiveteam-bs
swebb sets mode: +o SadDM
[05:21]
.......... (idle for 46mn)
DFJustin has joined #archiveteam-bs [06:11]
...... (idle for 27mn)
yakfish has quit IRC (Remote host closed the connection) [06:38]
sheaf has quit IRC (Quit: sheaf) [06:51]
sheaf has joined #archiveteam-bs [07:03]
....... (idle for 30mn)
sheaf has quit IRC (Quit: sheaf) [07:33]
Gfy has joined #archiveteam-bs [07:47]
....... (idle for 30mn)
schbirid has joined #archiveteam-bs [08:17]
BlueMaxim has quit IRC (Quit: Leaving)
icedice has joined #archiveteam-bs
[08:27]
.......... (idle for 49mn)
vitzli has joined #archiveteam-bs [09:19]
GE has joined #archiveteam-bs [09:33]
DFJustin has quit IRC (Remote host closed the connection)
DFJustin has joined #archiveteam-bs
[09:38]
...... (idle for 25mn)
icedice has quit IRC (Quit: Leaving) [10:06]
...... (idle for 29mn)
j08nY has joined #archiveteam-bs [10:35]
.... (idle for 17mn)
GE has quit IRC (Remote host closed the connection) [10:52]
.... (idle for 19mn)
arkiverI set it off again
wp494 ^
the 403 are good
previously all items with a 403 failed, which left us with a bunch of items that would return 403
and now that items with 403 are not failing we have to go through them
[11:11]
..................... (idle for 1h43mn)
***GE has joined #archiveteam-bs [12:54]
sheaf has joined #archiveteam-bs [13:05]
.......... (idle for 48mn)
pizzaiolo has joined #archiveteam-bs [13:53]
.......... (idle for 45mn)
yakfish has joined #archiveteam-bs [14:38]
............ (idle for 57mn)
pizzaiolo has quit IRC (Ping timeout: 506 seconds) [15:35]
fie has quit IRC (Read error: Operation timed out) [15:40]
........ (idle for 35mn)
powerKitt has joined #archiveteam-bs [16:15]
....... (idle for 33mn)
vitzli has quit IRC (Quit: Leaving)
powerKitt has quit IRC (Quit: Page closed)
[16:48]
.... (idle for 15mn)
SketchCowWho's uploading "localnewsarchive" to FOS? [17:03]
***RichardG has quit IRC (Ping timeout: 370 seconds) [17:09]
...... (idle for 26mn)
RichardG has joined #archiveteam-bs [17:35]
t2t2what is wget "error -6" and is there anything I can do to fix/avoid it? [17:37]
***Akiva has joined #archiveteam-bs [17:38]
SketchCowhttps://archive.org/details/localnewsarchive [17:47]
.... (idle for 16mn)
***icedice has joined #archiveteam-bs [18:03]
....... (idle for 34mn)
GE has quit IRC (Remote host closed the connection)
rocode has quit IRC (Ping timeout: 600 seconds)
rocode has joined #archiveteam-bs
[18:37]
sheaf has quit IRC (Quit: sheaf) [18:53]
............. (idle for 1h3mn)
Lord_NighSketchCow: for https://archive.org/details/pdfy-QPCSwTWiFz1u9WU_ i technically own the copyright to that document (and its way obsolete; the original version was a text file), it had a weird genesis being converted to html without permission, uploaded to scribd without permission, downloaded as 'david.pdf' and shoved on pdfY at some point. While i don't mind it being there for historical reasons, it is obsolete
and is superseded by a newer version of said document at this point
i actually DMCA'd the copy at scribd because i hate scribd with a passion, since they're making ad/subscription money off my work without my permission
i don't mind it being hosted at IA
[19:56]
***sheaf has joined #archiveteam-bs [19:58]
Lord_Nighhttps://www.dropbox.com/s/z334fcat4jal5qu/S14001A9.txt?dl=0 is the original version of that file before it got html-ified by some anonymous person
https://www.dropbox.com/s/j1lkjtkwjzlg6ko/S14001A10.txt?dl=0 is the latest version
[19:59]
https://www.dropbox.com/s/z334fcat4jal5qu/S14001A9.txt?dl=0 was maybe once located at http://www.netaxs.com/~gevaryah/S14001A9.txt but that would have had to be in 2006ish, I'm not even sure I uploaded it there before netaxs went bust
heh, if you actually want copies if the not-archived files visible on https://web-beta.archive.org/web/20050225051103/http://www.netaxs.com:80/~gevaryah/ i have them saved somewhere
[20:04]
..... (idle for 24mn)
godaneSketchCow: Please don't delete my 'Godane VHS Capture' folder
also you localnewsarchive item uploads is a complete mess in my mind
normally your using the full item name
[20:31]
***Akiva has quit IRC (Remote host closed the connection) [20:31]
godane*file name for metadata [20:32]
***GE has joined #archiveteam-bs [20:32]
godanenot like this: https://archive.org/details/localnewsarchive_ABC
i only complain cause you told me that with the vhs vault stuff
[20:32]
***DFJustin has quit IRC (hub.efnet.us hub.dk)
bwn has quit IRC (hub.efnet.us hub.dk)
alfie has quit IRC (hub.efnet.us hub.dk)
acridAxid has quit IRC (hub.efnet.us hub.dk)
SpaffGarg has quit IRC (hub.efnet.us hub.dk)
Selavi has quit IRC (hub.efnet.us hub.dk)
kevinr has quit IRC (hub.efnet.us hub.dk)
tephra_ has quit IRC (hub.efnet.us hub.dk)
tsr has quit IRC (hub.efnet.us hub.dk)
ThisAsYou has quit IRC (hub.efnet.us hub.dk)
davidar has quit IRC (hub.efnet.us hub.dk)
Ctrl-S___ has quit IRC (hub.efnet.us hub.dk)
Sanqui has quit IRC (hub.efnet.us hub.dk)
deathy has quit IRC (hub.efnet.us hub.dk)
alembic has quit IRC (hub.efnet.us hub.dk)
BartoCH has quit IRC (hub.efnet.us hub.dk)
HCross2 has quit IRC (hub.efnet.us hub.dk)
hook54321 has quit IRC (hub.efnet.us hub.dk)
tuluu has quit IRC (hub.efnet.us hub.dk)
Famicoman has quit IRC (hub.efnet.us hub.dk)
Yoshimura has quit IRC (hub.efnet.us hub.dk)
zhongfu has quit IRC (hub.efnet.us hub.dk)
Kaz has quit IRC (hub.efnet.us hub.dk)
JSharp___ has quit IRC (hub.efnet.us hub.dk)
tklk has quit IRC (hub.efnet.us hub.dk)
floogulin has quit IRC (hub.efnet.us hub.dk)
jiphex has quit IRC (hub.efnet.us hub.dk)
FalconK has quit IRC (hub.efnet.us hub.dk)
t2t2 has quit IRC (hub.efnet.us hub.dk)
K4k has quit IRC (hub.efnet.us hub.dk)
Muad-Dib has quit IRC (hub.efnet.us hub.dk)
Meroje has quit IRC (hub.efnet.us hub.dk)
raphidae has quit IRC (hub.efnet.us hub.dk)
icedice has quit IRC (hub.efnet.us hub.dk)
JensRex has quit IRC (hub.efnet.us hub.dk)
Simpbrain has quit IRC (hub.efnet.us hub.dk)
antomatic has quit IRC (hub.efnet.us hub.dk)
Hecatz has quit IRC (hub.efnet.us hub.dk)
medowar has quit IRC (hub.efnet.us hub.dk)
Aoede has quit IRC (hub.efnet.us hub.dk)
Rai-chan has quit IRC (hub.efnet.us hub.dk)
Frogging has quit IRC (hub.efnet.us hub.dk)
Riviera has quit IRC (hub.efnet.us hub.dk)
SN4T14 has quit IRC (hub.efnet.us hub.dk)
i0npulse has quit IRC (hub.efnet.us hub.dk)
purplebot has quit IRC (hub.efnet.us hub.dk)
yuitimoth has quit IRC (hub.efnet.us hub.dk)
nyany has quit IRC (hub.efnet.us hub.dk)
Madchen has quit IRC (hub.efnet.us hub.dk)
PurpleSym has quit IRC (hub.efnet.us hub.dk)
altlabel has quit IRC (hub.efnet.us hub.dk)
RichardG has quit IRC (hub.efnet.us hub.dk)
j08nY has quit IRC (hub.efnet.us hub.dk)
brayden has quit IRC (hub.efnet.us hub.dk)
GLaDOS has quit IRC (hub.efnet.us hub.dk)
joepie91 has quit IRC (hub.efnet.us hub.dk)
cf has quit IRC (hub.efnet.us hub.dk)
chfoo has quit IRC (hub.efnet.us hub.dk)
eprillios has quit IRC (hub.efnet.us hub.dk)
tapedrive has quit IRC (hub.efnet.us hub.dk)
antonizoo has quit IRC (hub.efnet.us hub.dk)
Odd0002 has quit IRC (hub.efnet.us hub.dk)
HP has quit IRC (hub.efnet.us hub.dk)
dashcloud has quit IRC (hub.efnet.us hub.dk)
w0rp has quit IRC (hub.efnet.us hub.dk)
Kenshin has quit IRC (hub.efnet.us hub.dk)
Jon- has quit IRC (hub.efnet.us hub.dk)
SilSte has quit IRC (hub.efnet.us hub.dk)
espes__ has quit IRC (hub.efnet.us hub.dk)
kvieta has quit IRC (hub.efnet.us hub.dk)
Lord_Nigh has quit IRC (hub.efnet.us hub.dk)
kurt has quit IRC (hub.efnet.us hub.dk)
Fletcher has quit IRC (hub.efnet.us hub.dk)
yuitimoth has joined #archiveteam-bs
nyany has joined #archiveteam-bs
Madchen has joined #archiveteam-bs
PurpleSym has joined #archiveteam-bs
altlabel has joined #archiveteam-bs
irc.homelien.no sets mode: +o PurpleSym
[20:42]
icedice has joined #archiveteam-bs
JensRex has joined #archiveteam-bs
Simpbrain has joined #archiveteam-bs
antomatic has joined #archiveteam-bs
Hecatz has joined #archiveteam-bs
medowar has joined #archiveteam-bs
Aoede has joined #archiveteam-bs
Rai-chan has joined #archiveteam-bs
Frogging has joined #archiveteam-bs
Riviera has joined #archiveteam-bs
SN4T14 has joined #archiveteam-bs
i0npulse has joined #archiveteam-bs
purplebot has joined #archiveteam-bs
irc.underworld.no sets mode: +o antomatic
swebb sets mode: +o antomatic
powerKitt has joined #archiveteam-bs
[20:48]
powerKitthttp://www.dmoz.org/ apparently shutdown 2017/03/17 and left a static mirror at http://dmoztools.net/
I'm gonna throw the static mirror into ArchiveBot with the "--no-offsite-links" parameter to prevent it from making a massive WARC.
[20:51]
JAAYes, someone from here grabbed it back then. Haven't heard about the static mirror though.
powerKitt: Use --large for this one.
[20:54]
powerKittAlright. [20:54]
JAAIIRC, that grab in March was something like 3M URLs. [20:54]
powerKittSomeone needs to add --large to the ArchiveBot documentation. [20:56]
JAAHere's the relevant announcement regarding dmoztools.net, by the way: https://www.facebook.com/DMOZ/posts/10155889279717542
Hmm, I can't find the archive from March in the IA. masterX244 (the guy who grabbed it) hasn't been here since 22 March, it seems. :-/
[20:56]
***DFJustin has joined #archiveteam-bs
swebb sets mode: +o DFJustin
DFJustin has quit IRC (Read error: Connection reset by peer)
powerKitt has quit IRC (Quit: Page closed)
DFJustin has joined #archiveteam-bs
bwn has joined #archiveteam-bs
alfie has joined #archiveteam-bs
acridAxid has joined #archiveteam-bs
SpaffGarg has joined #archiveteam-bs
Selavi has joined #archiveteam-bs
kevinr has joined #archiveteam-bs
tephra_ has joined #archiveteam-bs
tsr has joined #archiveteam-bs
davidar has joined #archiveteam-bs
Ctrl-S___ has joined #archiveteam-bs
ThisAsYou has joined #archiveteam-bs
Sanqui has joined #archiveteam-bs
alembic has joined #archiveteam-bs
deathy has joined #archiveteam-bs
BartoCH has joined #archiveteam-bs
HCross2 has joined #archiveteam-bs
hook54321 has joined #archiveteam-bs
tuluu has joined #archiveteam-bs
Famicoman has joined #archiveteam-bs
Yoshimura has joined #archiveteam-bs
zhongfu has joined #archiveteam-bs
Kaz has joined #archiveteam-bs
JSharp___ has joined #archiveteam-bs
tklk has joined #archiveteam-bs
floogulin has joined #archiveteam-bs
jiphex has joined #archiveteam-bs
FalconK has joined #archiveteam-bs
t2t2 has joined #archiveteam-bs
K4k has joined #archiveteam-bs
raphidae has joined #archiveteam-bs
Muad-Dib has joined #archiveteam-bs
Meroje has joined #archiveteam-bs
swebb sets mode: +o DFJustin
[21:08]
powerKitt has joined #archiveteam-bs [21:23]
.... (idle for 15mn)
powerKitt has quit IRC (Ping timeout: 268 seconds) [21:38]
SketchCowOh Godane
Well, the local news archive thing is a VERY specific situation.
It wouldn't work for most things, but there's a finite amount of "buckets" for channels.
I.e. 600 and maybe eventually to something like 800
So as you're pulling down more videos from that guy and elsewhere, they can be shoved into this with relative ease.
A very specific choice
Also, this wanders handily into IA space and I don't want to make a big footprint
[21:40]
***Lord_Nigh has joined #archiveteam-bs [21:50]
.... (idle for 17mn)
SmileyG has quit IRC (Remote host closed the connection) [22:07]
.... (idle for 18mn)
icedice has quit IRC (Ping timeout: 268 seconds) [22:25]
godaneok [22:32]
i remember you say how you can only own like 30 something collections if remember correctly
btw i got your 6 hour video called "Jason Scott's Day of Archiving"
[22:40]
JAARingling Bros Circus will stream their final performance in about 16 minutes (23:00 UTC) via Facebook and YouTube. I can't take care of this (and also don't know how), but it would be great if we could grab it. According to one news article, the video should also be available for a short time (whatever that means) after the show, but why take any chances?
Oh, meant to send this to the main channel.
[22:43]
.... (idle for 18mn)
MrRadarI saw a post on HN ealier by someone who crawled and indexed all Gopher sites on the public Internet and asked him to upload the data to the IA
And he followed through! https://archive.org/details/gopher-may-2017.tar
So the IA now has a complete copy of the Gopher Internet as of this month
[23:02]
***GE has quit IRC (Remote host closed the connection) [23:07]
MrRadarThe blog post he wrote is a good read too since he chose to use the "Personal" version of AltaVista's search indexer (which is apparently something you could buy circa 1997) running in a Windows 98 VM: https://blog.benjojo.co.uk/post/building-a-search-engine-for-gopher [23:13]
***BlueMaxim has joined #archiveteam-bs [23:14]
powerKitt has joined #archiveteam-bs [23:25]
...... (idle for 26mn)
qwebirc30 has joined #archiveteam-bs
powerKitt has quit IRC (Ping timeout: 272 seconds)
qwebirc30 is now known as powerKitt
[23:51]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)