#archiveteam 2013-12-16,Mon

↑back Search

Time Nickname Message
02:09 🔗 bsmith094 what happens when i update a repo i'm current executing code from?
02:10 🔗 DFJustin eldritch cries erupt from your cpu as beelzebub awakens
02:17 🔗 bsmith094 DFJustin: seriously, though a bad idea?
02:26 🔗 DFJustin would depend on the kind of repo I guess, on linux at least running executables are not affected by the underlying file being replaced, but there can be data files and such I guess
04:12 🔗 bsmith094 I could really use a hand with my ffnet grab, i cant possibly scan all 10 million links for updates in under a year or so. I've been running this download script for ~18 months now and i think I'm about 80 percent done
04:13 🔗 bsmith094 ive been rsync-ing it to wherever we're putting all these jobs, and I have about 200GB done
05:15 🔗 xmc bsmith094: depends on the interpreter. most will be ok, some will get fucked.
05:15 🔗 xmc binary executable, perl, python, ruby, etc: won't care
05:15 🔗 xmc php, shell: will shit the bed
06:13 🔗 chfoo your help is needed: https://github.com/ArchiveTeam/yahoo-blog-wretch-username-grab #shipwretched
11:27 🔗 arkiver I added the important websites section to the projects page: http://archiveteam.org/index.php?title=Projects
21:24 🔗 tephra http://www.telegraph.co.uk/news/worldnews/asia/northkorea/10520935/Is-North-Korea-now-erasing-history.html
21:25 🔗 tephra seems like we should make a grab of http://www.kcna.co.jp/index-e.htm , would archivebot be able to grab it?
21:28 🔗 tephra will start a grab of it
21:30 🔗 ersi They're already done modifying the site though
21:31 🔗 tephra not the co.jp mirror it seems
21:32 🔗 ersi from what I've read, they used the co.jp one first and later they moved to the north korean one
21:33 🔗 tephra co.kp seems to have news from yesterday
21:33 🔗 tephra *jp
21:33 🔗 ersi alright
21:40 🔗 m1das yesteryear
21:42 🔗 godane i grabbed a copy of that
21:44 🔗 DFJustin apparently people are already scraping it on an ongoing basis though
21:50 🔗 tephra right, will grab it now since i started already
21:55 🔗 xmc I nabbed kcna a year or so ago, I could probably dig it up
21:55 🔗 xmc take me a little while though
21:57 🔗 xmc september 2011: http://bl-r.com/trx/www.kcna.co.jp.tar.xz
21:58 🔗 xmc wayback also has crawled it rather heavily
22:01 🔗 xmc or maybe earlier, that was when I tarred it up
22:01 🔗 xmc 2010-11-26 is the dates inside the tar
22:02 🔗 xmc HTH
23:19 🔗 DFJustin http://kcnawatch.org/

irclogger-viewer