Time |
Nickname |
Message |
00:04
🔗
|
odie5533 |
xmc: yeah, I figure he'd publish his email if he wanted people emailing him. And he clearly doesn't want people contacting him since he disabled the Issues section on the repo =/ |
00:04
🔗
|
odie5533 |
Does anyone know of a CDX indexer that outputs all records, not just response ones? |
01:14
🔗
|
ivan` |
underscor: if you could run hyves-grab on everything that would be great |
01:14
🔗
|
ivan` |
underscor: it's throttling per-IP |
06:56
🔗
|
SketchCow |
ivan`: I don't know of this specifically. |
07:47
🔗
|
SketchCow |
I'm heading out from Belfast in an hour. I'll be around tonight and it's all about catchup. |
08:34
🔗
|
ivan` |
thanks |
09:08
🔗
|
DFJustin |
SketchCow: looks like the archiveteam_archivebot_go_003 item needs a derive kicked off |
09:08
🔗
|
DFJustin |
no cdx files have been generated yet |
10:19
🔗
|
Nemo_bis |
Can someone try a telnet 1111.wiki-site.com 80 for me? I get: telnet: Unable to connect to remote host: Connection timed out |
10:20
🔗
|
Nemo_bis |
They probably blocked my server's IP but I don't understand at what level |
10:25
🔗
|
Tomcat_ |
Nemo_bis: Works for me, got an under construction site. |
10:39
🔗
|
nico_ |
Via: 1.0 www.wiki-site.com:3128 (squid/2.6.STABLE21) |
10:40
🔗
|
nico_ |
yes it works |
11:00
🔗
|
Nemo_bis |
oh, at least this guy followed my advice http://www.editthis.info/robots.txt |
12:46
🔗
|
GLaDOS |
Wait, did I show you guys my robots.txt? |
12:58
🔗
|
Nemo_bis |
GLaDOS: not that my logs know |
12:59
🔗
|
GLaDOS |
http://tropicraft.net/robots.txt |
13:02
🔗
|
Nemo_bis |
O_o |
13:32
🔗
|
odie5533 |
I created a Python WARC library named pylibwarc, so we now have yet another python warc library. This one does support CDX though, and I'm not sure the others do. https://github.com/odie5533/pylibwarc |
13:54
🔗
|
ersi |
odie5533: Nice work |
13:54
🔗
|
odie5533 |
thank you |
13:58
🔗
|
ersi |
Added it to http://archiveteam.org/index.php?title=The_WARC_Ecosystem |
22:39
🔗
|
Smiley |
heh, vlogbrothers viewing their youtube via wayback machine. |
23:46
🔗
|
SketchCow |
Hey. |
23:47
🔗
|
SketchCow |
DFJustin: drive now done |
23:47
🔗
|
SketchCow |
I mean derive. Kicked off. |
23:49
🔗
|
SketchCow |
Known bug in ia uploader,. will be fixing shortly. |
23:50
🔗
|
DFJustin |
yeah I keep finding affected items |
23:51
🔗
|
DFJustin |
you would think there would be a cron job that runs derives on underived stuff every once in a while though |
23:52
🔗
|
SketchCow |
Not really. |
23:52
🔗
|
SketchCow |
I wish! |
23:53
🔗
|
godane |
hey SketchCow |
23:57
🔗
|
xmc |
IA has cronjobs made out of meat |