#archiveteam 2014-05-14,Wed

↑back Search

Time Nickname Message
01:42 πŸ”— SketchCow Oh man, we are NOT in good shape to take over matchmaking.
01:42 πŸ”— SketchCow I really want short urls to be fixed soon.
01:42 πŸ”— SketchCow 19:44 < fellowshi> Everyone shuts down Gameservers all the Time.
01:42 πŸ”— SketchCow This is a misrepresentation
01:43 πŸ”— SketchCow Maybe 5 get shut off a year
02:18 πŸ”— anarllama hi there
02:18 πŸ”— anarllama I have an old Eee PC
02:18 πŸ”— anarllama IҀ™ve installed fedora on it, but I was wondering what the best distro would be
02:18 πŸ”— anarllama to archive stuff
02:18 πŸ”— anarllama (including running the Warrior)
02:31 πŸ”— SketchCow Any is fine, we just use a virtual box instance on top of your fun.
02:31 πŸ”— SketchCow If you want to get down and dirty and run the script we run, then it should still be fine.
02:40 πŸ”— anarllama you canҀ™t run the warrior as an OS ?
02:51 πŸ”— dashcloud I thought I was pretty clear on the point that doing the matchmaking would be HARD- should I have been more blunt about it?
03:06 πŸ”— SketchCow Argue matchmaking in -bs
03:06 πŸ”— SketchCow But yes, matchmaking as an enterprise should be done by an organization othr than archive team
03:06 πŸ”— SketchCow We're not in a position, nor should we be, to provide vital internet services
03:07 πŸ”— SketchCow We're good at gathering the data that someone else running vital internet services might need
03:07 πŸ”— SketchCow i.e. wayback or upcoming.org
03:32 πŸ”— yipdw anarllama: you can; there's a Docker image for that
03:32 πŸ”— yipdw https://github.com/ArchiveTeam/warrior-dockerfile
04:23 πŸ”— SketchCow I'm starting to think I'm one of the only people who uses screen + irc
09:09 πŸ”— danneh_ Just wondering, if I'm backing up several government websites (agencies that are gonna be axed with Australia's new budget), should I upload each warc'd site as a separate item in IA or just upload all the warcs as a single IA item?
09:12 πŸ”— godane upload them as separate item
09:13 πŸ”— danneh_ awesome, will do. thanks!
09:49 πŸ”— trs80 danneh: oog, good call
09:49 πŸ”— trs80 danneh: let me know if you need some help
14:33 πŸ”— ArhiveBot !archive http://www.nebraskaweatherphotos.org/
14:34 πŸ”— SketchCow What..... is that.
14:35 πŸ”— Smiley wrong chan?
14:39 πŸ”— yipdw ok, that's a bot
15:35 πŸ”— Wabadub ok, i downloaded the 21gb twop archive and it makes warcqtviewer go unresponsive. can it handle such big warc files?
16:08 πŸ”— Smiley and all those people saying "I have CD backups are now feeling sheepish... ->
16:08 πŸ”— Smiley http://www.theatlantic.com/technology/archive/2014/05/the-library-of-congress-wants-to-destroy-your-old-cds-for-science/370804/
16:55 πŸ”— SadDM Long lost commentary tracks from recalled laserdisc releases of James Bond movies... check: https://archive.org/details/from_russia_with_love-criterion_laserdisc-commentary_track
16:57 πŸ”— SketchCow https://twitter.com/textfiles/status/466618491207176193
16:57 πŸ”— SketchCow Well, that went better than expected
17:10 πŸ”— midas my provider is stopping it's homepage service, http://home.xmsnet.nl/username what would be the best way to provide?
17:10 πŸ”— midas google says about 9000 pages
17:11 πŸ”— SadDM SketchCow: He's always been kind of a dick... on behalf of the rest of Canada I'd like to aplologize.
17:12 πŸ”— yipdw I don't think Jian Ghomeshi is a dick, he's just stupid
17:13 πŸ”— midas lets archive that fragment forever
17:14 πŸ”— SadDM get on it!
17:14 πŸ”— SadDM :-D
17:14 πŸ”— midas and after that, im going to put a sticky bit on it and try to delete it a couple of thousand times
17:15 πŸ”— exmic ha
17:20 πŸ”— yipdw midas: done
17:20 πŸ”— SketchCow Punch a DJ in the face, doo dah, doo dah
17:22 πŸ”— midas http://thumbnails.cbc.ca/maven_legacy/thumbnails/15/449/qpodcast_20140514_14626_uploaded.mp3
17:23 πŸ”— midas for how also wants to grab the bastard and delete it a couple of times, just to be sure
17:23 πŸ”— midas who*
17:23 πŸ”— yipdw midas: archivebot got it
17:23 πŸ”— midas perfect
17:23 πŸ”— yipdw SketchCow: you may also want to retweet the thumbnails.cbc.ca link; the podcasts.* link actually doesn't exist
17:25 πŸ”— midas anyway, about my provider
17:25 πŸ”— SketchCow Nah, it's just me having fun.
17:25 πŸ”— midas they are stopping the homepage service
17:25 πŸ”— midas any idea's how to grab these ~9000 sites?
17:25 πŸ”— yipdw wget
17:26 πŸ”— DFJustin make a list of user names, ???, warrior project, profit?
17:26 πŸ”— yipdw alternatively if you have a bunch of URLs and they're all self-contained you can shove them all into archivebot
17:26 πŸ”— midas yipdw: you know what really makes me sad? this: http://home.xmsnet.nl/berendbotje/
17:26 πŸ”— yipdw ha
17:27 πŸ”— midas this guy put up a link to his archive, a frigging archive, and then dyndns fucking killed the free service.
17:27 πŸ”— midas that's just cruel
17:29 πŸ”— schbirid does someone know an existing script/tool to keep archives of online source code repositories? i would have a list of repos and want them to update daily
17:29 πŸ”— schbirid easy scripting task but hey, no need to replicate if it has already been written
17:31 πŸ”— DFJustin http://urlm.co/www.atcarchive.dyndns.org has an ip address but it doesn't respond to http
17:32 πŸ”— DFJustin someone wanna portscan the block lol
17:32 πŸ”— midas lol :p
17:32 πŸ”— midas xms is a provider with alot of FTTH connections, so yeah, probably alot of servers available
17:33 πŸ”— DFJustin or you could tweet him https://twitter.com/ATCArchive
18:15 πŸ”— schbirid wrote it myself, quick and ugly and buggy https://github.com/SpiritQuaddicted/quake-code-archives
18:32 πŸ”— ATZ0 East Village Radio signing off - http://evgrieve.com/2014/05/exclusive-east-village-radio-is-signing.html - Website and Show archives here - http://www.eastvillageradio.com/
18:40 πŸ”— midas right, so 8000 urls are actually about 600 websites
18:40 πŸ”— midas according to my sorting and such
18:41 πŸ”— ATZ0 "All of our archives will be available, eventually ...."
18:58 πŸ”— SadDM ATZ0: that's a whole lot of data
18:59 πŸ”— ATZ0 Hasn't stopped us before.
18:59 πŸ”— SadDM yeah... I guess I was just thinking that it would be big enough to warrant a group project
22:16 πŸ”— wp494 re. what ATZ0 brought up: time to unleash archivebot on it
22:16 πŸ”— wp494 in other news: !!!
22:16 πŸ”— wp494 http://www.cbc.ca/news/business/yahoo-buys-snapchat-rival-blink-1.2642954
22:20 πŸ”— ATZ0 there's some flash audio players involved, not sure how that complicates things.
22:32 πŸ”— SadDM I'm in the process of tracking down the urls of all of the mp3s, but it's going to take a while.
22:33 πŸ”— SadDM Even most of the content on the pages is JS generated, so archivebot isn't going to get much
22:57 πŸ”— Baljem I thought Yahoo were already a Snapchat rival? you upload your precious data, and a few years later they delete it...
23:00 πŸ”— garyrh yes, but now they'll delete it much faster.
23:11 πŸ”— exmic hah
23:12 πŸ”— SadDM I'm going to try downloading the first 252 East Village Radio shows yo see if I get banned... we can go from there.
23:24 πŸ”— ATZ0 all i ask is that if we unleash warrior on it, the project be called Village People
23:24 πŸ”— ATZ0 we want you, we want you, we want you as a new virtual machine automated website archiving recruit.
23:37 πŸ”— Baljem I'm not /totally/ convinced that quite fits the rhythm, but apart from that...
23:44 πŸ”— SadDM aw... I thought that EVR is staffed by terrible hipsters, we could do something like PBR.
23:45 πŸ”— SadDM Anyway, EVR is going to be huge. My back of the envelope math puts it at about 1.25TB
23:47 πŸ”— SadDM Is there anybody that could talk to me about setting up a warrior project?

irclogger-viewer