[01:02] Nuts. I was looking into a problem from yesterday, and the server now incorrectly believes that one's done - it's tauran, which Wyatt|Wor had problems with. [07:44] Coderjoe: ah, well - I lost the conversation in the backlog so I just thought you were asking what it meant :) [07:56] does the archive.org flash/javascript interface use chunked uploading? [11:58] i can never remember how to redirect stderr to devnull [11:58] 2>/dev/null [12:01] https://github.com/SpiritQuaddicted/fileplanet-file-download/blob/master/download_pages_and_files_from_fileplanet.sh is much nicer now [12:12] http://www.pastie.org/3867284 [12:15] working well [12:35] Schbirid: that's not #AT bizniz [12:36] IMO [12:46] Schbirid: need more people to help download fileplanet? [12:46] yes, definitely. i am just deciding on the final packaging then i would have asked [12:46] how big can one archive.org item become? [12:48] AFAIK it can be any size [12:49] preferebly it should be smaller though [12:58] argh, got a bug with 's [13:04] i am too dumb to figure out how to remove the last character from a string in bash or gnu coreutils [13:07] | rev | cut -c 2- | rev [13:07] heh [13:07] well, why not [13:37] Schbirid: We aim for 10GB [13:37] bigger than that and you can run into task issues, as there is only ~10GB guaranteed to be free on a datanode drive at any point [14:23] hm, anyone able to download http://www.fileplanet.com/52249/download ? [14:24] i always get a 403 forbidden [14:25] same [14:26] please refresh, check the source for the link (grep for default-file-download-link) and try pasting that into your address bar [14:27] same [14:28] cheers [14:28] (i like how they have single quotes in filenames and use single quotes in their javascript as well) [14:29] it's available at http://www.gamefront.com/files/13625/GRIST_MILLBY [14:30] 50000-54999 is 24G already, ugh [15:16] 20k-30k: ~7-8GB [15:16] 30k-40k: 10GB [15:16] 40k-50k: 18G [15:16] 50k-55k: 25G [15:17] i am scared. might mean that we'd need to do 1-2k increments. the end would be at 250k or something [15:17] bbl [15:22] Schbirid, put 5k files per item then [15:37] don't be scared! [16:36] mobileme news: http://arstechnica.com/apple/news/2012/05/free-20gb-cloud-storage-for-mobileme-subscribers-extended-to-sept-30.ars [16:40] http://archive.org/post/419499/chumbycom-is-going-away-request-for-archiving [16:44] oh neat [16:44] Github added organization display to user profiles [16:44] Archive Team needs a snazzy gravatar now [16:44] maybe we can reuse the unicorn one [16:46] (for those who don't know: http://archiveteam.org/images/0/05/Rejectedatlogo.jpg) [16:47] I vote yes! [16:48] I still wish Github let you follow organizations. [17:07] alright, who wants to download some fileplanet! [17:07] right now you will need to tar it manually in the end [17:08] i guess we will just upload each tar seperately and have someone put them together into a collection? [17:18] yes [17:18] * Nemo_bis is already downloading thousands of wikis [17:19] ok [17:19] https://github.com/SpiritQuaddicted/fileplanet-file-download/blob/master/download_pages_and_files_from_fileplanet.sh [17:20] run: download_pages_and_files_from_fileplanet.sh 55000 59999 [17:20] will be about 30G i guess [17:28] registering on the forums is still not possible? do we have a shared account i could use? [18:10] Apple is extending its free storage offer to paid MobileMe subscribers from June 30 to September 30, 2012 [18:11] http://venturebeat.com/2012/05/06/brazil-facebook-lies/ <-- fucking Black Mirror [18:16] also, props to whatever generated the URL slug, because it's totally apropos [18:46] Nemo_bis: you running it? [18:47] Schbirid, what? [18:48] if you mean your script I'm not, as I said I'm already busy with wikis, load was at 60 a few min ago [18:48] oh, i guess i misunderstood you [18:48] 7 now but still no disk space :) [18:48] heh [18:48] nice [19:59] The Swedish site http://www.resdagboken.se is closing down, from a press release by their owners (The large Norwegian(?) company/media conglomerate Schibsteds). The site is a "travel journey diary" for travelers, so it's mostly if not totally only user-made content [20:00] Unsure if the content is going to be deleted, but.. if something's on it's deathbed, most likely. They've disabled the ability to create new users/logins as well as new 'journey diaries'. But existing diaries can be updated until 15 June 2012 [20:04] There's at least 17 million images and 2 million "journey diaries" from users according to their stats in the press release [20:05] Think it'll be better to sweep through now, or wait for last call? [20:06] not sure, but earlier is always better [20:07] Might be better to wait a bit - it loooks like people are doing final entries now [20:09] hm, true. But starting out finding users diaries and such might be good [20:19] Doesn't look nicely structured, sadly [20:44] http://archiveteam.org/index.php?title=Fileplanet [20:46] Until account creation's back up, I'll probably give Schbird my credentials to keep the page count updated [20:47] Or it might be better not to, so someone's keeping track of all the downloads [21:51] If someone sees schibirid, tell him I have a list of all the valid ids [21:51] it's much more efficient than brute-forcing every number between 1 and 220000 [21:53] underscor: Spiffy; how many are there? [21:53] wc -l valid [21:53] 87190 valid [21:53] * shaqfu updates the page [21:54] Any estimates on size? [21:56] nope [21:56] I just extracted them from the sitemap XML files [21:56] Gotcha [22:24] underscor, email him? [23:56] HUZZAH ARCHIVETEAM [23:56] hello! [23:59] Con go well?