#archiveteam 2011-08-27,Sat

↑back Search

Time Nickname Message
03:43 🔗 Coderjoe regarding the GAE free limits: http://en.wikipedia.org/wiki/Google_App_Engine#Free_quotas
03:45 🔗 Coderjoe the only problematic ones I see are the BW/day and the "to be removed" CPU time/day
04:02 🔗 Coderjoe yeah... that broken file (friendster.000023000-000240000.tar.gz) breaks at 71%
04:04 🔗 Coderjoe or possibly a little before. I'm not sure what it is checking, exactly, other than perhaps BFINAL and BTYPE
04:06 🔗 Coderjoe and, on the off chance of a stored block, LEN vs NLEN
04:06 🔗 db48xOthe recoverable, you think?
04:07 🔗 Coderjoe i'd have to write a decoder to let me fully determine what's up.
04:08 🔗 Coderjoe (which I don't have a problem doing. the algorithm is easy enough. it isn't LZMA after all)
04:08 🔗 db48xOthe can you find the next good block?
04:08 🔗 Coderjoe there isn't a block signature and crc like there is in bzip2, but it might be possible
04:08 🔗 db48xOthe ah
04:09 🔗 Coderjoe again, with custom tools and looking into what is up in the file
04:09 🔗 db48xOthe I keep running into problems like this, and it always seems like there should be a tool for it
04:10 🔗 db48xOthe a tool that lets you explore the data, and all you have to do is plug in the definition of the data structures
04:10 🔗 Coderjoe and, depending on how bad things are, it might be possible to determine the correct data using the crc the gzip format adds at the end of the file
04:10 🔗 db48xOthe hrm. not with a file that big, I bet
04:11 🔗 Coderjoe well, hex workshop is really good, when working with plain old structures. not so good when you need to do decoding of the data, like with a compression algorithm
04:13 🔗 * db48xOthe googles
04:14 🔗 db48xOthe yea, that does look fairly nice
04:16 🔗 Cowering i've vouched for hex workshop for at least 10 years :)
04:20 🔗 db48xOthe heh
04:21 🔗 Coderjoe it is a really nice hex editor all around. the structure viewer only gets you so far, though
04:21 🔗 Cowering i think everyone in retro community has like 15 windows of it open at any given time...
04:21 🔗 * Cowering slaps DFJustin around a bit with a large trout
04:42 🔗 * DFJustin slaps Cowering around a bit with a large trout
04:43 🔗 DFJustin and yeah I generally have hex workshop at the ready....
04:45 🔗 DFJustin I seem to remember some emu author working on a general data file analysis tool
04:49 🔗 Wyatt|Wor Wasn't Tomato doing similar?
04:51 🔗 Wyatt|Wor (Also, for Linux/Mac users, I recently discovered wxHexEditor. Looks like it's pretty solid)
05:38 🔗 SketchCow The MC Frontalot concert with my video footage is going well!!
05:41 🔗 Wyatt|Wor Spiffy. Is that the porn without porn?
05:41 🔗 SketchCow This PARTICULAR one was for the song Just Once
05:41 🔗 SketchCow Which is about getting bored because you get too much sex
05:41 🔗 SketchCow So he wanted a VIDEO
05:41 🔗 SketchCow So I chose My Little Pony Friendship is Magic
05:41 🔗 SketchCow People liked
05:42 🔗 SketchCow For the OTHER song, called Pr0n Song, I have industrial videos of crushers, screws, and mixers
05:42 🔗 SketchCow Which are all slow, and really creepy with the music
05:42 🔗 db48xOthe lol
05:44 🔗 Wyatt|Wor Awesome. Some industrial kit has the best advertising.
05:47 🔗 Coderjoe reminds me of all the machinery in the video for Happiness in Slavery
05:47 🔗 Wyatt|Wor Caught a link to http://www.youtube.com/user/bigshredder from b3ta at one point. It's like Blendtec only bigger.
05:48 🔗 Coderjoe SSI was doing it before Blendtec
05:49 🔗 Wyatt|Wor Point. I just found Blendtec first.
05:51 🔗 Coderjoe well, SSI also wasn't using youtube until after Blendtec, iirc. it looks like they moved all of their videos over from their own server to youtube :-\
05:57 🔗 Wyatt|Wor Ah, I see. It's brilliant advertising, that's for sure.
06:05 🔗 underscor Coderjoe: Hey, are you the wget wizard with the different dot-options?
06:06 🔗 underscor Trying to figure out how to get these to look better
06:06 🔗 underscor There are waaaaay too many lines
06:06 🔗 Coderjoe i'm not sure what you're asking
06:06 🔗 db48xOthe couple of options
06:06 🔗 underscor I need like 1 line per MB or something
06:06 🔗 db48xOthe --progress=dot:giga
06:06 🔗 Coderjoe oh
06:06 🔗 underscor Just less than 50k per line
06:06 🔗 underscor haha
06:06 🔗 Coderjoe progress options
06:07 🔗 db48xOthe dot:mega will give you three megs per line
06:07 🔗 underscor Ooh, excellent
06:07 🔗 db48xOthe dot:giga will give you one gig per 32 lines
06:07 🔗 db48xOthe you can also control how many dots per line and how many bytes per dot by using the .wgetrc file
06:07 🔗 Coderjoe i'll bbiam... http://wegetsignal.org/tmp/scumbagsetup.jpg
06:08 🔗 underscor db48xOthe: thanks
06:08 🔗 db48xOthe you're welcome
06:08 🔗 db48xOthe I wrote a patch that adds dot:tera, 8 lines per terabyte
06:08 🔗 db48xOthe useful for downloading geocities, for instance
06:17 🔗 underscor haha
07:04 🔗 SketchCow haha
07:04 🔗 SketchCow two weeks in
07:04 🔗 SketchCow ,
07:04 🔗 SketchCow .
07:04 🔗 SketchCow I mean
07:04 🔗 SketchCow ..
07:04 🔗 SketchCow Two weeks later
07:10 🔗 SketchCow http://books.google.com/books?id=xc8TAQAAIAAJ&q=%22BBS+Documentary%22&dq=%22BBS+Documentary%22&hl=en&ei=FZhYTqLkL-PL0QHe6IXODA&sa=X&oi=book_result&ct=result&resnum=5&ved=0CD8Q6AEwBA
07:11 🔗 db48xOthe two weeks later than what?
07:18 🔗 SketchCow The download
07:18 🔗 SketchCow Anyway, sheleves are coming in, I better nap
07:19 🔗 Wyatt|Wor 'night
08:31 🔗 chronomex jesus christ google groups discovery/download is going slow
08:44 🔗 Wyatt|Wor I've got a downloader running, but keep hitting 509s
08:44 🔗 chronomex yeah
11:02 🔗 Eprillios Hiya. :-)
11:04 🔗 Schbirid hi
13:57 🔗 SketchCow I'm not one for gift-horse dental work
13:57 🔗 SketchCow But man, this Star Wars Forums result is a huge mess.
13:57 🔗 SketchCow I'm moving it over to the official teamarchive machine, but wow
14:37 🔗 db48xOthe SketchCow: sounds bad
14:37 🔗 db48xOthe SketchCow: is it fixable?
14:46 🔗 db48xOthe arrg, I think that song made me dumber
15:25 🔗 SketchCow It's just massive, is all.
15:27 🔗 SketchCow On the first machine, flophouse, we have 1.7tb of data left
15:27 🔗 SketchCow Now, that's a good number less than the original 70tb of space, but still, I'm working to pull that out over to the other machine, and some things are completely loose and not easy to get to.
15:28 🔗 SketchCow I have to look at blindtiger shortly.
19:58 🔗 SketchCow Man, I keep taking naps
20:24 🔗 Jofo naps are great
21:25 🔗 ndurner ok, we got ALL of the Google Groups files
21:25 🔗 ndurner we're harvesting the index for the second time for about two weeks now to make SURE we got EVERYTHING
21:26 🔗 ndurner I was confused because I saw tons of new groups coming in
21:26 🔗 ndurner turns out, they are in fact new
21:26 🔗 chronomex heh, neat
21:27 🔗 ndurner swebb1: read the log
21:27 🔗 chronomex I've got like nine megs of groups files. shall I rsync them somewhere?
21:27 🔗 ndurner talk to SketchCow
21:28 🔗 chronomex k
21:28 🔗 chronomex groups: chapterowls we-are-here-to-make-you-smile hot-male-models cider-workshop wearehere-to-make-you-smile--think h2kinfosysqabaenquiries education-job-motivation-discussion-entertainment rehman-tech5 sapml indiarealestates california_swingers LOGIC-ID latest-trendsngadgets firstsalary rehman-tech4
21:28 🔗 chronomex AWESOME
21:28 🔗 chronomex wait. we have a list of all google groups?
21:28 🔗 chronomex o_O
21:28 🔗 ndurner you could also upload those 9 MB to me
21:29 🔗 ndurner yes, we absolutely do
21:29 🔗 chronomex that's fantastic.
21:29 🔗 ndurner the next thing I want to do is to grab the info from the about pages
21:29 🔗 ndurner so that we can build a nice, ever-lasting index for the files
21:31 🔗 chronomex http://gir.seattlewireless.net/~chronomex/ggroups-chronomex.tar.gz
21:31 🔗 chronomex md5sum: 454d41a1d79a40beb16ce0cfcb01cfb5
21:32 🔗 ndurner got it, thanks
21:32 🔗 chronomex grand
23:10 🔗 SketchCow Hey, gang.
23:10 🔗 SketchCow How big are the google groups, ndurner?

irclogger-viewer