[00:16] ooooooh shiiiit [00:16] http://www.theverge.com/2013/5/17/4342012/yahoo-reportedly-nearing-1-1-billion-deal-to-acquire-tumblr [00:17] is it already on fire drill? [00:26] we've already done proof-of-concept grabs in the past https://archive.org/details/archiveteam-tumblr-test-warc [00:26] we straight up don't have enough storage to hold tumblr though [02:21] so much porn [02:27] i'm finally finding more call for help episodes [02:28] in dialup format of course [05:39] Hey. [05:46] Everyone wants us to back up tumblr. [05:46] I may make some noise for attention, but we have lots of time. [05:46] I wonder how big it is. [06:09] Can we even get a rough estimate? [06:10] Of course. We just sample the whole [06:10] Pick a few hundred random blogs and download them [09:51] Is that a good idea to run multipe VM of the Warrior? [09:51] (on the same computer/IP I mean) [09:56] you can, [09:56] not a problem at all, just needs lots of resources. [09:56] tryphon: you on windows or linux? [09:56] os x [09:56] ah :D [09:56] 10.7.4 [09:57] then yeah multiple warriors is your easiest way if you want to do that. [09:57] But I also have a synology nas (powerpc though) [09:57] well yuo can try and run the scripts directly [09:57] instructions are on our wiki. [09:58] will have a look ;) thx [10:02] hmm two VM that target the same port 8001 is ok? [10:06] tryphon: Don't think so. [10:09] Either you won't be able to access one warrior's web interface, or one might not work at all. [10:09] Not sure how it behaves. [10:10] It seems that the second VM doesn't have any network activity at all. [10:10] And we can not (easely) change the port, right. [10:10] -. +? [10:14] I believe virtualbox has an option to route the port to a different port somewhere [10:28] or you change the port in /home/warrior/warrior-code2/warrior-runner.sh [10:28] @GLaDOS fount it :) 1/ Go to VM prefs > network > adaptater 1 - http://imgur.com/NPsa0He [10:29] then go to "port forwading" and change the "host port" - http://imgur.com/NPsa0He [10:29] oops, first picture would have been http://imgur.com/r8jrs7d sorry [11:07] yeah you change it in port forwarding [11:07] don't change the warrior code, i think it'll revert at next update. [14:45] :( uploading a 4GB item at 50kB/s [17:03] D: [18:06] I successfully torrented the TOSEC-PIX [18:09] cool [18:14] Yeah [18:15] So, I'm going to make the decision to unpack it and install it. [19:01] Tumblr is interesting but enormous. If you assume a similar average size of material per-user as Posterous (not necessarily safe) - say 2mb per user - then it's immediately 200TB before you even start. [19:02] Could easily be several times that, or worse. [19:04] /me steps out to buy some extra USB sticks, just in case [19:04] :) [19:08] That grab from 2010 averages at 72mb per user - so 7200TB [19:08] i grabbed g4tv tumbler [19:08] that was over 400mb [19:09] if i remember right [19:12] Mm, there are some enormous posterous blogs too, but the raw average is so low purely because there's so many users whose entries are tiny, or text-only, or spam. [19:13] Assume Moore's Law and say that user's Tumblrs double in size every 18 months (bigger/better pictures, more content, video, etc.) [19:13] That's 288mb per user, or 28 petabytes. [19:13] 14 times the size of the entire Wayback machine. [19:14] Wait, this can't be right.. [19:14] you just realized that [19:14] ha ha [19:14] most video is likely links to youtube [19:15] If you do decide to archive tumblr, I think I'll be washing my hair that day. But good luck with that. :) [19:16] the most likely tumbler accouts to archive if its to big is to go after ones that are linked to alot of wikis [19:18] "All tumblrs are equal, but some are more celebrilicious than others." :) [19:18] cause there most likely have a lot of info that is good [19:18] agree but at 28pb its like backing up facebook or youtube [19:18] its just not going to be alot of it [19:18] antomatic: not impossible, megaupload had 28pb when it went down [19:18] facebook has over 100pb of storage now [19:19] Coo.. [19:19] Even keeping up with the new content (about 70 million new posts each day) would be a stretch. [19:19] It does sound like fun, though.. :) [19:20] it almost have go like geocities did [19:21] be mostly died in 2 years and stay up for the next 11 years after that [19:25] i uploaded some more tekzilla episodes [19:29] SketchCow: I got a interview of Richard Garriott [19:32] Great [20:03] Tumblr has 107.8 million blogs adn 50.6 billion posts [20:07] interview with Michael Limbar from Angel Studios [20:07] *Limber [20:27] i found a 2 part interview with Yu Suzuki [20:28] and a intereview with Kevin Eastman [23:55] WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD [23:58] 'yahoosucks' [23:58] thank you fair sir