#archiveteam 2011-08-26,Fri

↑back Search

Time Nickname Message
00:13 🔗 chronomex wait there are people trying to sell this book?
00:13 🔗 chronomex http://www.amazon.com/gp/product/0966288912
00:16 🔗 Soojin http://contemporary-home-computing.org/1tb/wp-content/uploads/screenshot_200.png
00:16 🔗 Soojin rofl
00:16 🔗 Soojin nsfw
00:20 🔗 illunatic hahaha
00:20 🔗 illunatic i'm going to buy that book
01:02 🔗 Wyatt Hmm, ndurner isn't here and I haven't managed to get anything from groups in hours. Suboptimal.
01:19 🔗 swebb Wyatt: ditto. My discovery crawler has been going well, but the downloaders have all been idle for 20-ish hours
01:20 🔗 swebb https://skitch.com/scumola/fu7nu/cacti
02:05 🔗 underscor DFJustin: That's intriguing
02:05 🔗 underscor I can do that now with the hadoop infrastructure, but not publicly (yet, at least)
02:06 🔗 underscor It's very resource intensive
03:02 🔗 underscor hot damn
03:02 🔗 underscor it actually fucking worked
03:02 🔗 underscor I don't know who requested it
03:02 🔗 underscor but http://tracker.archive.org/torrent/torrents/FRIENDSTER-000000000-all.torrent is valid and everything
03:02 🔗 underscor That was a good stress test
04:36 🔗 DFJustin http://timemachine.6x.to/
04:40 🔗 SketchCow I'm so happy, these groups fo hams have been FLOODING me with descs.
04:41 🔗 SketchCow A 268 issue run may be totally described in no time at all.
06:07 🔗 DFJustin underscor: also is there a reason why archive.org labels all AVI files as Cinepack
06:07 🔗 SketchCow He's even less qualified to answer these questions than me.
06:07 🔗 SketchCow But AVI is a container format, which they called Cinepack.
06:07 🔗 SketchCow The video handling is ALL based off of ffmpeg, if that helps at all.
08:43 🔗 alard underscor: Sorry, couldn't restrain myself, had to try the Friendster torrent again. I hope it didn't cause too much trouble. Your new torrent thing works wonderfully.
09:01 🔗 SketchCow Well, it only kills MY machine
09:07 🔗 alard Oh, that's not a problem then. :)
09:18 🔗 SketchCow Well, technically you are rsyncing to a machine crippled by the torrent generation.
09:18 🔗 SketchCow So it's your problem too.
09:18 🔗 SketchCow They gave him a new machine to destroy, so the trouble will pass momentarily.
16:25 🔗 Drgonz0 what are you archiving exactely?
16:31 🔗 DFJustin EVERYTHING
16:33 🔗 Drgonz0 okay
16:35 🔗 Cowering archive all the HP Touchpad free apps before HP takes that site offline...
16:37 🔗 Drgonz0 good idea
16:38 🔗 illunatic i'm archiving the stanford engineering everywhere courses
16:39 🔗 illunatic the torrent links there are already dead so i'm having to get them from youtube
17:04 🔗 underscor SketchCow: It's off your machine now, has been for 2 days :P
17:05 🔗 underscor alard: Does it seem like a valid torrent/downloads okay?
17:33 🔗 SketchCow Oh thank good
17:33 🔗 SketchCow ness
17:35 🔗 db48xOthe heh
17:40 🔗 db48xOthe 47.9GB 106:58:36 [ 130kB/s]
17:40 🔗 db48xOthe Total bytes written: 51438039040 (48GiB, 131KiB/s)
17:40 🔗 db48xOthe very slow
17:42 🔗 SketchCow http://www.youtube.com/watch?v=5yrA-BpWECI&feature=player_embedded
17:46 🔗 alard underscor: Yes, my Deluge torrent client accepted it and started downloading from the web seed. I didn't wait for it to finish, of course, but everything seemed ok.
19:45 🔗 db48xOthe I've failed over a dozen times to download http://www.archive.org/download/FRIENDSTER-000000000/friendster.000023000-000240000.tar.gz
19:55 🔗 underscor db48xOthe: Via torrent or http?
20:20 🔗 alard db48xOthe: I have just downloaded that file, the sha1 checksum matches the one in the archive.org _files.xml
20:20 🔗 alard But it seems that there is a problem with the gzip format of the file.
20:21 🔗 alard gzip: stdin: invalid compressed data--format violated
20:22 🔗 alard after 1,63 GB.
20:36 🔗 underscor uh oh
20:38 🔗 chronomex hmmm.
20:40 🔗 alard According to the wiki, I uploaded that file. I don't have a copy, so I can't check where it went wrong.
20:54 🔗 underscor alard: hahaha, I love the 193M log file the friendster download generated
20:54 🔗 underscor http://tracker.archive.org/torrent/logs/
20:59 🔗 underscor ndurner: What is error 444 again?
20:59 🔗 chronomex Connecting to archiveteamorg.appspot.com|74.125.53.141|:80... connected.
20:59 🔗 chronomex HTTP request sent, awaiting response... 509 Bandwidth Limit Exceeded
20:59 🔗 chronomex who's in charge of this? we should consider self-hosting in future.
21:00 🔗 underscor chronomex: It's not really bw limit exceeded iirc
21:00 🔗 chronomex oh?
21:00 🔗 underscor It means there's no work or something like that
21:00 🔗 alard underscor: Ah, yes, I noticed that. :) -- db48x's wants to solve that, but no-one listens: https://savannah.gnu.org/bugs/index.php?33654
21:00 🔗 chronomex oh. damn.
21:00 🔗 underscor You have to ask ndurner
21:00 🔗 chronomex ok.
21:00 🔗 ndurner 444 = nothing to do ATM
21:00 🔗 * chronomex runs discover process then
21:01 🔗 alard The groups thing seems to be bandwith-limited a lot, these days.
21:01 🔗 ndurner 509 = throttled so we're not overshooting the quota
21:02 🔗 chronomex after ggroups pulls down their files permanently, we should use this list of group names to suck in messages too.
21:02 🔗 underscor ndurner: There's no way to increase the quota?
21:02 🔗 underscor (How do they track quota anyway?)
21:02 🔗 underscor Er, rather, what do they limit?
21:03 🔗 ndurner other than paying up, no
21:03 🔗 chronomex requests per hour I think
21:03 🔗 chronomex oh god no not the $12 fee
21:03 🔗 ndurner the quota is about using or blocking resources
21:03 🔗 chronomex aren't there opensource GAE-compatible environments available?
21:04 🔗 ndurner (Thread.sleep() doesn't actually *use* CPU, but blocks it, so you still pay)
21:04 🔗 chronomex huh
21:04 🔗 underscor ndurner: How much would it cost to open it up?
21:05 🔗 chronomex http://code.google.com/appengine/docs/billing.html
21:06 🔗 ndurner hard to say, because there are no real limits
21:06 🔗 underscor You can set a "Max Daily Budget" though, can't you?
21:06 🔗 chronomex yes
21:06 🔗 ndurner yes
21:07 🔗 underscor Oh, I see what you're saying
21:07 🔗 underscor Each additional scraper/downloader increases the resource requirements
21:09 🔗 swebb Jason's doing a 'hangout' on G+ if you guys are interested. I'm on there now too.
21:09 🔗 chronomex 1) link? 2) how is that different from irc exactly?
21:09 🔗 underscor Video, iirc?
21:10 🔗 swebb muelti-person video.
21:10 🔗 chronomex oh, video
21:10 🔗 chronomex I don't have any video gear, strangely enough
21:10 🔗 underscor http://ge.tt/#8kWBpB7
21:10 🔗 underscor Friend just finished mixing his cover of Skyscraper <3
21:12 🔗 chronomex I confuse huddle and hangout a lot
21:15 🔗 chronomex if anyone cares, this is me: https://plus.google.com/u/2/118060174030033503719/
21:22 🔗 underscor chronomex: Is that your real name?
21:22 🔗 chronomex my legal name is "Duncan Smith"; that is a Hangul transliteration of my legal name.
21:23 🔗 underscor Oh, I see
21:23 🔗 chronomex I do not normally look like a vintage tugboat
21:24 🔗 underscor lol
21:24 🔗 chronomex I don't approve of the conflation of "legal name" with "true name"
21:24 🔗 DFJustin swit deonkeon, eh
21:24 🔗 swebb chronomex: You're a boat? :)
22:20 🔗 db48x2 aww
22:20 🔗 db48x2 my Friendster archive topped out at 995GB
22:28 🔗 DFJustin 10^12 bytes is 931 gibibytes, so using hd manufacturer logic you could call it over a terabyte :)
22:28 🔗 db48x2 heh
22:29 🔗 * db48x2 throws a --si on there
22:29 🔗 db48x2 1.1T total
22:29 🔗 db48x2 :)
23:12 🔗 SketchCow Damn!
23:32 🔗 underscor Adding cmalp_00001 to the torrent generation queue
23:32 🔗 underscor Recieved request from 127.0.0.1 for cmalp_00001 - 2011-08-26 23:31:36.966885
23:32 🔗 underscor Satisfied request from 127.0.0.1 for cmalp_00001 - 2011-08-26 23:31:36.969984
23:32 🔗 underscor <3
23:32 🔗 underscor I love it
23:34 🔗 Aranje Nice and fast, too
23:34 🔗 db48x2 underscor: looks like you've done a good job on that
23:42 🔗 SketchCow Any other people need slots?
23:48 🔗 underscor db48x2: Aranje Thanks <3
23:48 🔗 Aranje <3

irclogger-viewer