[01:42] Oh man, we are NOT in good shape to take over matchmaking. [01:42] I really want short urls to be fixed soon. [01:42] 19:44 < fellowshi> Everyone shuts down Gameservers all the Time. [01:42] This is a misrepresentation [01:43] Maybe 5 get shut off a year [02:18] hi there [02:18] I have an old Eee PC [02:18] I’ve installed fedora on it, but I was wondering what the best distro would be [02:18] to archive stuff [02:18] (including running the Warrior) [02:31] Any is fine, we just use a virtual box instance on top of your fun. [02:31] If you want to get down and dirty and run the script we run, then it should still be fine. [02:40] you can’t run the warrior as an OS ? [02:51] I thought I was pretty clear on the point that doing the matchmaking would be HARD- should I have been more blunt about it? [03:06] Argue matchmaking in -bs [03:06] But yes, matchmaking as an enterprise should be done by an organization othr than archive team [03:06] We're not in a position, nor should we be, to provide vital internet services [03:07] We're good at gathering the data that someone else running vital internet services might need [03:07] i.e. wayback or upcoming.org [03:32] anarllama: you can; there's a Docker image for that [03:32] https://github.com/ArchiveTeam/warrior-dockerfile [04:23] I'm starting to think I'm one of the only people who uses screen + irc [09:09] Just wondering, if I'm backing up several government websites (agencies that are gonna be axed with Australia's new budget), should I upload each warc'd site as a separate item in IA or just upload all the warcs as a single IA item? [09:12] upload them as separate item [09:13] awesome, will do. thanks! [09:49] danneh: oog, good call [09:49] danneh: let me know if you need some help [14:33] !archive http://www.nebraskaweatherphotos.org/ [14:34] What..... is that. [14:35] wrong chan? [14:39] ok, that's a bot [15:35] ok, i downloaded the 21gb twop archive and it makes warcqtviewer go unresponsive. can it handle such big warc files? [16:08] and all those people saying "I have CD backups are now feeling sheepish... -> [16:08] http://www.theatlantic.com/technology/archive/2014/05/the-library-of-congress-wants-to-destroy-your-old-cds-for-science/370804/ [16:55] Long lost commentary tracks from recalled laserdisc releases of James Bond movies... check: https://archive.org/details/from_russia_with_love-criterion_laserdisc-commentary_track [16:57] https://twitter.com/textfiles/status/466618491207176193 [16:57] Well, that went better than expected [17:10] my provider is stopping it's homepage service, http://home.xmsnet.nl/username what would be the best way to provide? [17:10] google says about 9000 pages [17:11] SketchCow: He's always been kind of a dick... on behalf of the rest of Canada I'd like to aplologize. [17:12] I don't think Jian Ghomeshi is a dick, he's just stupid [17:13] lets archive that fragment forever [17:14] get on it! [17:14] :-D [17:14] and after that, im going to put a sticky bit on it and try to delete it a couple of thousand times [17:15] ha [17:20] midas: done [17:20] Punch a DJ in the face, doo dah, doo dah [17:22] http://thumbnails.cbc.ca/maven_legacy/thumbnails/15/449/qpodcast_20140514_14626_uploaded.mp3 [17:23] for how also wants to grab the bastard and delete it a couple of times, just to be sure [17:23] who* [17:23] midas: archivebot got it [17:23] perfect [17:23] SketchCow: you may also want to retweet the thumbnails.cbc.ca link; the podcasts.* link actually doesn't exist [17:25] anyway, about my provider [17:25] Nah, it's just me having fun. [17:25] they are stopping the homepage service [17:25] any idea's how to grab these ~9000 sites? [17:25] wget [17:26] make a list of user names, ???, warrior project, profit? [17:26] alternatively if you have a bunch of URLs and they're all self-contained you can shove them all into archivebot [17:26] yipdw: you know what really makes me sad? this: http://home.xmsnet.nl/berendbotje/ [17:26] ha [17:27] this guy put up a link to his archive, a frigging archive, and then dyndns fucking killed the free service. [17:27] that's just cruel [17:29] does someone know an existing script/tool to keep archives of online source code repositories? i would have a list of repos and want them to update daily [17:29] easy scripting task but hey, no need to replicate if it has already been written [17:31] http://urlm.co/www.atcarchive.dyndns.org has an ip address but it doesn't respond to http [17:32] someone wanna portscan the block lol [17:32] lol :p [17:32] xms is a provider with alot of FTTH connections, so yeah, probably alot of servers available [17:33] or you could tweet him https://twitter.com/ATCArchive [18:15] wrote it myself, quick and ugly and buggy https://github.com/SpiritQuaddicted/quake-code-archives [18:32] East Village Radio signing off - http://evgrieve.com/2014/05/exclusive-east-village-radio-is-signing.html - Website and Show archives here - http://www.eastvillageradio.com/ [18:40] right, so 8000 urls are actually about 600 websites [18:40] according to my sorting and such [18:41] "All of our archives will be available, eventually ...." [18:58] ATZ0: that's a whole lot of data [18:59] Hasn't stopped us before. [18:59] yeah... I guess I was just thinking that it would be big enough to warrant a group project [22:16] re. what ATZ0 brought up: time to unleash archivebot on it [22:16] in other news: !!! [22:16] http://www.cbc.ca/news/business/yahoo-buys-snapchat-rival-blink-1.2642954 [22:20] there's some flash audio players involved, not sure how that complicates things. [22:32] I'm in the process of tracking down the urls of all of the mp3s, but it's going to take a while. [22:33] Even most of the content on the pages is JS generated, so archivebot isn't going to get much [22:57] I thought Yahoo were already a Snapchat rival? you upload your precious data, and a few years later they delete it... [23:00] yes, but now they'll delete it much faster. [23:11] hah [23:12] I'm going to try downloading the first 252 East Village Radio shows yo see if I get banned... we can go from there. [23:24] all i ask is that if we unleash warrior on it, the project be called Village People [23:24] we want you, we want you, we want you as a new virtual machine automated website archiving recruit. [23:37] I'm not /totally/ convinced that quite fits the rhythm, but apart from that... [23:44] aw... I thought that EVR is staffed by terrible hipsters, we could do something like PBR. [23:45] Anyway, EVR is going to be huge. My back of the envelope math puts it at about 1.25TB [23:47] Is there anybody that could talk to me about setting up a warrior project?