[00:00] lysobit: nope unless i win lotto. [00:00] >__> [00:03] to -bs! [00:14] SketchCow: I am still going to send the stuff to you(floppies, manuals and such) i just haven't had time yet. [00:15] well.. haven't made time. [00:15] big difference. lol [01:40] "Archive Team: Dammit Yahoo! And on the carpet too?!?" [01:53] RedType_: given the number of server problems these grabs are starting to cause, I'd suggest the small variation of "Archive Team: We Are Going To Fuck Up Your Shit" [02:55] Archive Team: Smash & Grab & Xerox [03:42] Archive Team: Sudo Save Your Shit [06:44] Archive Team: Five Years Ago They Laughed [08:05] would be cool to rip these http://www.ebay.com/itm/The-video-encyclopedia-of-physics-demonstrations-1-index-25-book-w-Laser-Disc-/271127397864?pt=US_Texbook_Education&hash=item3f2073c5e8 [11:16] Great idea, or greatest idea? [11:18] 5 years of Archive Team [11:24] no cake? [11:24] Just got accepted - I am now running the Game Preservation SIG at GDC 2014. [11:25] In other news, I just got a $995 pass to GDC 2014 for free [11:27] SketchCow: where should i send the cake, 300 Funston Avenue? [11:29] Those people get enough cake [11:29] lol [11:29] it's going to be bloody hard to send a cake to everybody. hmm we need food replicators [11:32] midas: they're called recipes [11:32] yw [12:12] haha Nemo_bis ;-) [17:16] just revising this: http://archiveteam.org/index.php?title=Google_Video [17:16] er, revisiting [17:16] it's funny how big 18 TB is in the context of now [17:17] e.g. http://tracker.archiveteam.org/wretch/ [17:18] I've read that again too recently [17:19] the news coverage was fun [17:40] SketchCow: emijrp wants to write a paper on wikiteam; you once mentioned it would be nice to have a project like wikiteam but for forums, do you think that an export feature for forums is something important to advocate for? [17:42] nowadays the biggest "forums" are Q&A sites and reddit-like things probably, some of those have an API (e.g. stackexchange) [18:22] There are lots and lots and lots of focums that are neither Q&A and reddit [18:27] As a table-top gamer, I concur... most of my community's folk-knowledge is tied up in forums. [18:43] Lots and lots of forums are still simple PHPBB and SMF boards with decades of content that can just disappear. [18:53] Exactly! [18:58] Sure [18:59] I only meant, they have some pseudo-competitors with export functions, so maybe that could be an argument to convince them too [18:59] though of course that won't help with the old forums nobody is upgrading [19:22] Nemo_bis: most paid forum software provide tools to scraping their competitors HTML and building a new post database from that. [19:22] I've actually done so to save an invisionfree forum. [19:22] Its not as good as a DB dump but its better than nothing [19:23] and of course the archive.org method of scrape all the things works fairly well unless there were sections of the site that are member only [19:23] it's not even the sites going offline that you have to worry about [19:24] all it takes is for an exploit to be released, and they're all fucked [19:24] archive.org's standard crawl has pretty poor forum coverage in my experience [19:24] this is why we dump forums into archivebot :P [19:24] soon it will get better at that job [19:25] right now it's a gamble on whether you're going to run out of memory [19:30] yipdw: 128GB RAM server ftw? :P [19:30] heh nah [19:30] the main problem with archivebot doing forums is that wget stores its URL graph in memory [19:31] chfoo's wpull can use an on-disk database, which should provide acceptable performance [19:31] and (I think) should give us constant memory usage per process [19:31] so why worry about memory [19:31] because archivebot isn't using wpull yet [19:31] ah ok, thus needing more memory [19:32] yeah [19:52] WHAT FORSOOTH, PRITHEE TELL ME [19:52] Requesting help archiving http://www.aux-penelope.com which is due to go dark on Feb 11, 2014 [19:53] mietek: yahoosucks [19:53] penelope is going down?!? [19:53] Check it: http://www.aux-penelope.com/aux_3.0.htm [19:55] Aha! Scott Kanne responded to my email [19:55] looks small, I sicced the archivebot on it [19:55] > Thanks for the kind words - I have added a tarball of the entire site here: [19:55] www.aux-penelope.com/aux-penelope.tar.gz [19:56] sweet [19:59] someone should try and grab that domain [19:59] it's over 10 years old, it's basically gonna get perma parked if scott lets it go [19:59] https://twitter.com/ATArchiveBot/status/426805475355938816 [20:00] hmm true [20:00] i can email him [20:00] shit i'll even pay for the first year and transfer it to SketchCow or someone [20:01] ++ [20:04] i know that's a dangerous (quickly expensive) road to go down, but even keeping it alive and point at archive.org for a year with a 301 would make a huge difference [20:05] > I'm planning to keep the domain for now... I may pass it on at a later date though [20:05] mietek: shit [20:05] i just sent the email [20:05] well at least he knows people are concerned [20:05] Yep [20:05] *donk* [20:07] http://christtrekker.users.sourceforge.net/doc/aux/faq.html [20:18] Jonimus: hah, competition, what a nice thing :)