[00:09] the core idea is that you have a huge number of eyeballs watching your feed, and advertisers LOVE eyeballs, and they figure people love money and will sell the eyeballs in their feed. [00:09] waoh [00:09] scrollbakc [00:12] that sounds disgusting [00:12] money is dirty and does not belong anywhere near eyeballs [00:14] http://achewood.com/index.php?date=12192002 [00:37] found out there is a pirate magazine in russia [00:37] turns out its porn [00:38] godane: IS IT PIRATE PORN [00:38] OR PIRATES IN PORN [00:38] YARRR ME BOOTY [00:39] there leather/latex in them [00:40] you mean, girls from somali? [00:40] thats the first thing that comes to my mind when I read "PIRATE PORN" [00:40] i think there russia [00:41] yes, but I never heard of pirates in Russia [00:41] Somalia, all the time [00:41] how it is pirate [00:42] I mean, is a pirate magazine because it is not legal? [00:46] it's got to be porn of pirates [00:46] I don't really want porn of warez kiddies [00:47] different strokes for different folks [00:51] YARRR ME BOOTY [00:52] i'm getting this error again: Did you FTP upload to subdirectory "TechTV_TSS_2002_Full_Episodes" on server "items-uploads.archive.org"? [00:52] Can you fix that SketchCow? [00:57] i'm going to backup the.scene episodes [00:57] looks like maybe 3 from the first season was uploaded to archive.org [01:07] SketchCow: i've been busy with some cleanup related stuff here but have not forgotten about checking the disk images [02:00] http://archive.org/details/TechTV_TSS_2002_Full_Episodes [04:00] SketchCow: dumping the proms from at&t thing is slow progress :( [04:00] also i need to hunt down a good 360k drive still [04:44] ok almost done with at&t thing [04:44] wish i could get the drive out of it, it looks relatively easy to mod [06:25] http://arstechnica.com/information-technology/2012/05/digital-archivists-technological-custodians-of-human-history [06:43] fuck yeah archiveteam [07:05] I feel like this should be used somewhere http://i.imgur.com/yBFtb.jpg [07:07] oh yes :] [09:39] Hi, what would be a good number of files per IA item? 18,000 items with up to 100 files each, or 1,800 items of 1000 files each? [09:40] (The files, Tabblo zips, are between 3 and 30 MB.) [09:48] hmmmm [09:48] so assume 10M average filesize [09:49] thought it would be handy to upload the Tabblo-provided zip files as-is, without tarring, so it's easy to look inside them. [09:49] 1000 files @ 10M each -> 1G itemsize -> this sounds reasonable [09:49] er, 10G, still reasonable [09:49] Yeah, it's probably even less, since not every id exists. [09:51] sounds like we have an answer ;) [09:54] Indeed, thanks! [10:02] Lord_Nigh: what at&t thing? [10:13] http://archive.org/details/archiveteam-tabblo-0 [11:06] interesting... [11:07] upcoming comic titled Wizzywig, which has themes of hacking, phreaking, and the like [11:08] cover art is modeled after a mac plus or so [11:08] hmm [11:09] i suck. longish running web comic. getting a book printing soon, though [11:20] alard: chronomex: above ~1,000ish items, the webnodes start failing to render [11:21] but you'll probably be okay as long as they're definitely not any more than 1k [12:03] If anyone wants to help downloading Tabblo, here's a script: http://archiveteam.org/index.php?title=Tabblo#Downloading_ZIPs [12:22] Greetings all! [12:23] Hi. [12:35] what the hell [12:35] /bin/cp: cannot create directory ‘./www.fileplanet.com/212045’: Too many links [12:36] ext3:( [12:37] ah, 32k limit [13:17] wait... the fireplanet script doesn't do multiple subdirs? [13:19] dont worry [13:19] that was my mirroring of meta stuff [13:19] i did run into that problem earlier but did not check why, simply did 5000 per dir [13:19] now i tried merging them ;) [14:04] Could the person downloading Tabblo zip range 10 also claim that range on the wiki? [14:09] is there a way to protect from silent deletion of items at archive.org, eg get a message from them beforehand? i do not have the space to mirror the fileplanet stuff locally too so it would be devastating if it got deleted by IGN or some other request [14:16] it's not deleted, only hidden [14:16] sweet [14:16] thanks [14:16] just moved to the darchives [14:20] sort alexa 5.000 by dmoz links: [14:20] alexa website dmoz-links 6 .wikipedia.org 28722 1937 .angelfire.com 23835 221 .free.fr 19709 1250 .topix.com 16883 11 .blogspot.com 15158 4 .yahoo.com 13863 47 .imdb.com 9623 3629 .freewebs.com 6525 575 .wunderground.com 5453 658 .ocn.ne.jp 5371 49 .bbc.co.uk 5164 316 .narod.ru 4690 727 .geocities.jp 4520 372 .nifty.com 4195 [14:21] i wonder how long angelfire has [14:21] yeah [14:21] i think i just discovered an early-alert algorithm for archiveteam ;) [14:22] has...? [14:22] apart from "case: owner == yahoo"? [14:22] has until it gets killed [14:31] stil... " Lycos Chat is different" - http://chat.lycos.co.uk/ [14:55] Going to my doctor's for a quick checkup, then back to work on things. [16:55] SketchCow: http://news.ycombinator.com/item?id=4002153 [18:11] whee. now I have to go to Monroe, MI for work on wednesday, so I'll be even closer, I suppose. I'm not sure when I'll be done, but it should be well before the presentation. [18:15] I'm not sure this meets the core mission, but it seems that fileplanet.com is being shut down. There are games/mods/etc that perhaps might only exist there. Or maybe not. [18:16] warthurto: we are way ahead of you :))) http://archiveteam.org/index.php?title=Fileplanet#Status [18:17] * warthurto egg on face [18:17] Must have made a search mistake [18:17] you made me feel warm and fuzzy though [18:18] people really do care! [18:26] Am I right in assuming if I had an S3 id, since incoming ec2 data is free and so is ec2 to s3, i could help out for very low cost on my ec2 machines? [18:30] warthurto: the "s3" we usually refer to here is the s3-api interface at the internet archive. that counts as external traffic from the POV of aws. [18:30] http://archive.org/help/abouts3.txt [18:30] Got it [18:32] Probably more bang for my buck with my fast home connection [18:38] warthurto: #fireplanet is our main channel [18:38] if you start a range, post it there (unless you have a working wiki account yourself) [18:39] Will do. I'll get things setup tonight [18:39] alard, how do i run multiple ranges at once, the scirpt will only login one at a time, it seems [18:39] for wabblo [18:39] *t [18:40] wabblot! [18:44] on second thought, let's not go to wabblot. it is a silly place. [18:44] heh http://en.wikipedia.org/wiki/Wabbit [19:12] NIGHT OF THE LEPUS [19:13] uploading my satliteview videos [19:13] also include metadata if you have it [19:23] I'm not gonna die! [19:27] bravo! [19:35] SketchCow: care to elaborate? [19:37] SketchCow: i got my 2002 videos of the screen savers are uploaded [19:37] SketchCow: http://archive.org/details/TechTV_TSS_2002_Full_Episodes [19:39] TSS Speed week 1 was only 24:36 [20:05] SketchCow: I'm reducing the repo size before resuming upload - creating svn dumps, hg bundles and git bundles helps a lot, and I guess that'll be faster than sending it all raw [20:21] http://www.pinballpetes.net/ [20:21] that fucking pink elephant [20:21] Understood, patrickg [20:21] I've been blasting away at the IUMA thing. [20:22] SketchCow: thinking of visiting the current Ann Arbor location of Pinball Pete's while you're there? [20:23] Ah fuck - is that there too? [20:23] there are two locations, one in AA and the other in East Lansing [20:23] What an awful page. [20:23] the AA one is at 1214 S University Ave, Ann Arbor, MI 48104 [20:24] here's the yelp: http://www.yelp.com/biz/pinball-petes-ann-arbor [20:24] I think the answer is no [20:24] Too much for one trip. [20:24] I'm going to Marvin's. [20:24] http://marvin3m.com/ [20:25] mmm [20:25] well-kept [20:25] yes [20:25] Marvin's a little wary about me. [20:26] the yelp for PP sounds like the current owners don't keep the pinball machines well maintained [20:26] http://archive.org/details/iuma-archiver [20:26] Oh look, a rescued an IUMA artist called "archiver" from a dead site [20:26] I R O N Y [20:27] shit. I've been just up the road from Marvin's a number of times, on orchard lake rd just north of where M10 ends into it (just north of 14 mile, iirc) [20:28] I assume you're not still in that area, right? [20:28] Otherwise you need to be getting to the Ann Arbor ting [20:28] i've never lived on that side of the state, but have been to the AA/Det area a number of times over the years [20:28] OK. [20:29] I had a bunch of people from the west side talking about visiting, but they all don't want to take that (straight, dependable) highway [20:29] I'll probably be going to the AADL talk, since I have to go to Monroe on Wednesday morning for work [20:29] Including the cutest little ex-girlfriend who broke up with me and married some gal [20:29] which highway is that? [20:30] 94. [20:30] ah [20:30] well, just a little north of marvins is Yotsuba: http://yotsuba-restaurant.com/ [22:20] FIRST IUMA TAKEDOWN! [22:21] Toronto band, guess someone didn't like his name's 2nd or 3rd hit on google being a band he played in in 2002 [22:21] That didn't take long [22:26] Yeah, that's fast. [22:33] Well, archive.org has a very specific thing with google. [22:33] If something's uploaded, it's in google's search engine within, I swear, 10 minutes or less. [22:33] I've seen it happen. [22:33] So I think this guy did a search on his name and KAPOW there's his music. [22:34] Hell of a bit of serendipity, given that it's been up for, what, 24 hours? [22:34] A week in some cases. [22:35] It's been about a week of work to get them all up. [22:45] The good news is I'm just about done adding stuff from it. [22:45] Yeah, the graph of google crawls is pretty insane [22:45] They actively have between 10 and 15 boxes crawling IA [22:45] at least [22:45] why don't they just use rsync like normal people [22:46] google? [22:46] yea [22:46] rsync for what? [22:46] They do web/http crawls? [22:46] :P [22:46] undersco2: Wait, dedicated, or 10-15 that just happen to be on IA usually? [22:47] the hostnames rotate [22:47] so I assume there's some sort of "check for new things" job queueing system that they run for sites like IA/reddit/etc [22:47] Ah, okay. I wasn't sure if they dedicated boxes just to keeping up [22:48] right, they crawl harder on things that change often [22:59] SketchCow: Uploaded: http://archive.org/details/Satliteview_BS_Tantei_Club_Yuki_ni_Kieta_Kako [23:15] http://archive.org/details/Satliteview_BS_Tantei_Club_Yuki_ni_Kieta_Kako [23:15] http://archive.org/catalog.php?history=1&identifier=Satliteview_BS_Tantei_Club_Yuki_ni_Kieta_Kako [23:36] http://archive.org/download/iuma-miscellaneous/Atomic_Duct_Tape_-_Touch_My_Monkey.mp3 [23:36] but he's covered in poo!