[03:43] regarding the GAE free limits: http://en.wikipedia.org/wiki/Google_App_Engine#Free_quotas [03:45] the only problematic ones I see are the BW/day and the "to be removed" CPU time/day [04:02] yeah... that broken file (friendster.000023000-000240000.tar.gz) breaks at 71% [04:04] or possibly a little before. I'm not sure what it is checking, exactly, other than perhaps BFINAL and BTYPE [04:06] and, on the off chance of a stored block, LEN vs NLEN [04:06] recoverable, you think? [04:07] i'd have to write a decoder to let me fully determine what's up. [04:08] (which I don't have a problem doing. the algorithm is easy enough. it isn't LZMA after all) [04:08] can you find the next good block? [04:08] there isn't a block signature and crc like there is in bzip2, but it might be possible [04:08] ah [04:09] again, with custom tools and looking into what is up in the file [04:09] I keep running into problems like this, and it always seems like there should be a tool for it [04:10] a tool that lets you explore the data, and all you have to do is plug in the definition of the data structures [04:10] and, depending on how bad things are, it might be possible to determine the correct data using the crc the gzip format adds at the end of the file [04:10] hrm. not with a file that big, I bet [04:11] well, hex workshop is really good, when working with plain old structures. not so good when you need to do decoding of the data, like with a compression algorithm [04:13] * db48xOthe googles [04:14] yea, that does look fairly nice [04:16] i've vouched for hex workshop for at least 10 years :) [04:20] heh [04:21] it is a really nice hex editor all around. the structure viewer only gets you so far, though [04:21] i think everyone in retro community has like 15 windows of it open at any given time... [04:21] * Cowering slaps DFJustin around a bit with a large trout [04:42] * DFJustin slaps Cowering around a bit with a large trout [04:43] and yeah I generally have hex workshop at the ready.... [04:45] I seem to remember some emu author working on a general data file analysis tool [04:49] Wasn't Tomato doing similar? [04:51] (Also, for Linux/Mac users, I recently discovered wxHexEditor. Looks like it's pretty solid) [05:38] The MC Frontalot concert with my video footage is going well!! [05:41] Spiffy. Is that the porn without porn? [05:41] This PARTICULAR one was for the song Just Once [05:41] Which is about getting bored because you get too much sex [05:41] So he wanted a VIDEO [05:41] So I chose My Little Pony Friendship is Magic [05:41] People liked [05:42] For the OTHER song, called Pr0n Song, I have industrial videos of crushers, screws, and mixers [05:42] Which are all slow, and really creepy with the music [05:42] lol [05:44] Awesome. Some industrial kit has the best advertising. [05:47] reminds me of all the machinery in the video for Happiness in Slavery [05:47] Caught a link to http://www.youtube.com/user/bigshredder from b3ta at one point. It's like Blendtec only bigger. [05:48] SSI was doing it before Blendtec [05:49] Point. I just found Blendtec first. [05:51] well, SSI also wasn't using youtube until after Blendtec, iirc. it looks like they moved all of their videos over from their own server to youtube :-\ [05:57] Ah, I see. It's brilliant advertising, that's for sure. [06:05] Coderjoe: Hey, are you the wget wizard with the different dot-options? [06:06] Trying to figure out how to get these to look better [06:06] There are waaaaay too many lines [06:06] i'm not sure what you're asking [06:06] couple of options [06:06] I need like 1 line per MB or something [06:06] --progress=dot:giga [06:06] oh [06:06] Just less than 50k per line [06:06] haha [06:06] progress options [06:07] dot:mega will give you three megs per line [06:07] Ooh, excellent [06:07] dot:giga will give you one gig per 32 lines [06:07] you can also control how many dots per line and how many bytes per dot by using the .wgetrc file [06:07] i'll bbiam... http://wegetsignal.org/tmp/scumbagsetup.jpg [06:08] db48xOthe: thanks [06:08] you're welcome [06:08] I wrote a patch that adds dot:tera, 8 lines per terabyte [06:08] useful for downloading geocities, for instance [06:17] haha [07:04] haha [07:04] two weeks in [07:04] , [07:04] . [07:04] I mean [07:04] .. [07:04] Two weeks later [07:10] http://books.google.com/books?id=xc8TAQAAIAAJ&q=%22BBS+Documentary%22&dq=%22BBS+Documentary%22&hl=en&ei=FZhYTqLkL-PL0QHe6IXODA&sa=X&oi=book_result&ct=result&resnum=5&ved=0CD8Q6AEwBA [07:11] two weeks later than what? [07:18] The download [07:18] Anyway, sheleves are coming in, I better nap [07:19] 'night [08:31] jesus christ google groups discovery/download is going slow [08:44] I've got a downloader running, but keep hitting 509s [08:44] yeah [11:02] Hiya. :-) [11:04] hi [13:57] I'm not one for gift-horse dental work [13:57] But man, this Star Wars Forums result is a huge mess. [13:57] I'm moving it over to the official teamarchive machine, but wow [14:37] SketchCow: sounds bad [14:37] SketchCow: is it fixable? [14:46] arrg, I think that song made me dumber [15:25] It's just massive, is all. [15:27] On the first machine, flophouse, we have 1.7tb of data left [15:27] Now, that's a good number less than the original 70tb of space, but still, I'm working to pull that out over to the other machine, and some things are completely loose and not easy to get to. [15:28] I have to look at blindtiger shortly. [19:58] Man, I keep taking naps [20:24] naps are great [21:25] ok, we got ALL of the Google Groups files [21:25] we're harvesting the index for the second time for about two weeks now to make SURE we got EVERYTHING [21:26] I was confused because I saw tons of new groups coming in [21:26] turns out, they are in fact new [21:26] heh, neat [21:27] swebb1: read the log [21:27] I've got like nine megs of groups files. shall I rsync them somewhere? [21:27] talk to SketchCow [21:28] k [21:28] groups: chapterowls we-are-here-to-make-you-smile hot-male-models cider-workshop wearehere-to-make-you-smile--think h2kinfosysqabaenquiries education-job-motivation-discussion-entertainment rehman-tech5 sapml indiarealestates california_swingers LOGIC-ID latest-trendsngadgets firstsalary rehman-tech4 [21:28] AWESOME [21:28] wait. we have a list of all google groups? [21:28] o_O [21:28] you could also upload those 9 MB to me [21:29] yes, we absolutely do [21:29] that's fantastic. [21:29] the next thing I want to do is to grab the info from the about pages [21:29] so that we can build a nice, ever-lasting index for the files [21:31] http://gir.seattlewireless.net/~chronomex/ggroups-chronomex.tar.gz [21:31] md5sum: 454d41a1d79a40beb16ce0cfcb01cfb5 [21:32] got it, thanks [21:32] grand [23:10] Hey, gang. [23:10] How big are the google groups, ndurner?