#archiveteam 2013-11-17,Sun

↑back Search

Time Nickname Message
00:04 🔗 odie5533 xmc: yeah, I figure he'd publish his email if he wanted people emailing him. And he clearly doesn't want people contacting him since he disabled the Issues section on the repo =/
00:04 🔗 odie5533 Does anyone know of a CDX indexer that outputs all records, not just response ones?
01:14 🔗 ivan` underscor: if you could run hyves-grab on everything that would be great
01:14 🔗 ivan` underscor: it's throttling per-IP
06:56 🔗 SketchCow ivan`: I don't know of this specifically.
07:47 🔗 SketchCow I'm heading out from Belfast in an hour. I'll be around tonight and it's all about catchup.
08:34 🔗 ivan` thanks
09:08 🔗 DFJustin SketchCow: looks like the archiveteam_archivebot_go_003 item needs a derive kicked off
09:08 🔗 DFJustin no cdx files have been generated yet
10:19 🔗 Nemo_bis Can someone try a telnet 1111.wiki-site.com 80 for me? I get: telnet: Unable to connect to remote host: Connection timed out
10:20 🔗 Nemo_bis They probably blocked my server's IP but I don't understand at what level
10:25 🔗 Tomcat_ Nemo_bis: Works for me, got an under construction site.
10:39 🔗 nico_ Via: 1.0 www.wiki-site.com:3128 (squid/2.6.STABLE21)
10:40 🔗 nico_ yes it works
11:00 🔗 Nemo_bis oh, at least this guy followed my advice http://www.editthis.info/robots.txt
12:46 🔗 GLaDOS Wait, did I show you guys my robots.txt?
12:58 🔗 Nemo_bis GLaDOS: not that my logs know
12:59 🔗 GLaDOS http://tropicraft.net/robots.txt
13:02 🔗 Nemo_bis O_o
13:32 🔗 odie5533 I created a Python WARC library named pylibwarc, so we now have yet another python warc library. This one does support CDX though, and I'm not sure the others do. https://github.com/odie5533/pylibwarc
13:54 🔗 ersi odie5533: Nice work
13:54 🔗 odie5533 thank you
13:58 🔗 ersi Added it to http://archiveteam.org/index.php?title=The_WARC_Ecosystem
22:39 🔗 Smiley heh, vlogbrothers viewing their youtube via wayback machine.
23:46 🔗 SketchCow Hey.
23:47 🔗 SketchCow DFJustin: drive now done
23:47 🔗 SketchCow I mean derive. Kicked off.
23:49 🔗 SketchCow Known bug in ia uploader,. will be fixing shortly.
23:50 🔗 DFJustin yeah I keep finding affected items
23:51 🔗 DFJustin you would think there would be a cron job that runs derives on underived stuff every once in a while though
23:52 🔗 SketchCow Not really.
23:52 🔗 SketchCow I wish!
23:53 🔗 godane hey SketchCow
23:57 🔗 xmc IA has cronjobs made out of meat

irclogger-viewer