[02:32] Blorp [02:34] Wow, they couldn't be bothered to really explain that torrent, huh. [02:34] I can upload those items to archive.org [02:34] I wish we had a proper documentation for it, though. [02:35] http://lwn.net/Articles/285366/ [02:35] Found [02:35] That was fast [02:42] Installing it, emijrp. [02:43] Or memories of Emihrp. [03:06] http://twitter.com/#!/archiveteam/status/97865189625565184 [03:07] http://www.archive.org/details/git-history-of-linux [03:10] SO MANY FRIENDSTER ARCHIVES [03:10] oh [03:10] * db48x facepalms [03:10] 8.4T . [03:10] root@teamarchive-0:/3/FRIENDSTER# du -sh . [03:11] SketchCow: I goofed while uploading this google video to you [03:11] Always great to hear [03:11] I uploaded it all, rather than just the toupload directory :P [03:11] Over it, doubles will be found. [03:12] good news though, only 1038 files left to download [03:13] 116G friendster.2500001-2600000.tar [03:13] 148G friendster.14600000-14699999.tar [03:13] 156G friendster.13500000-13599999.tar [03:13] 158G friendster.13400000-13499999.tar [03:13] 3.9G friendster.250001-260000.tar [03:13] 3.9G friendster.320001-330000.tar [03:13] 35G friendster.600001-700000.tar [03:13] 24G friendster.9000000-9035999.tar [03:13] Another set I'm uploading. AW YEAAAHHH [03:13] cool [03:14] damn [03:14] pretty sizable [03:15] I'm going to have to make a friendster sub-collection. [03:15] Only way around it. [03:15] I was allowed to put these in 100gb items. [03:15] Wow [03:15] But at 10tb or whatever, that's 100 items. [03:16] haha [03:16] Wait, so that's ALL of friendster? [03:16] (that we have) [03:17] Not sure, still finding out. [03:18] I've got 1.5T to upload, but half of it isn't compressed [03:20] 1.2TB, I mean [03:21] so is there a limit on the number of items per collection or something? [03:21] Not really. [03:21] Just friendster, being a special case, needs treatment. [03:22] And I personally believe, and therefore win, that the archiveteam collection isn't served well by "precious collection of website", "respository of Linux", "collected tapes of usenet", and then 100+ items of "Friendster" [03:23] oooh [03:24] I thought you meant that you were going to have to split Friendster into multiple sub-projects [03:24] No. [03:24] I just mean I make it into a collection. [03:24] Then aim that collection at the other collection. [03:24] cool [03:24] So it's listed as being part of it. [03:25] The biggest pain is renaming friendster archives to be a standard format. [03:26] By the way, I tweeted that Timex thing. [03:26] And it's gotten me, for reasons I completely don't understand, 100 new followers. [03:30] Spent an hour or two researching a soy sauce company for archive team presentation. [03:33] heh [03:34] I'd just like to complain that the actual archiving of 3.5'' floppies is quite simple compared to just viewing what's on them before you image them (at least on Linux-based systems) [03:39] I'm confused by this complaint. [03:40] I continue to think Laurence Lessig is a bit of an idiot. [03:40] I hope to meet him, chat one day. [03:41] really? [03:42] Yeah, I consider putting him in front of the supreme court to argue for copyright set us back by a century. [03:42] That english was horrible, sorry. [03:43] hrm [03:45] This position isn't popular. [03:45] I think what dashcloud meant was that the act of storing a floppy or copying it's contents for archiving is much easier than viewing what is on it. Possibly because it's slow? [03:45] or maybe because I'm slow :) [03:46] SketchCow: what would you like to have seen instead? [03:46] no- I spent many hours trying to get the floppy drive to auto mount across several distros, and failed until I stumbled onto the magic of mtools which just works [03:49] sometimes you have to show fd0 a little love. he can be very demanding [03:49] fd0 is a whiny little bitch [03:49] We all know it, I';m just saying what we're all thinking [03:49] D-: [03:49] heh [03:51] It's just scary how long transfers take between the flophouse/blindtiger machines and the new home for them. [03:52] Because they're going at 40MB/s [03:52] And they're STILL taking hours [03:52] That's just a lot of data we got [03:52] Thank god nothing major's gone down for a while. [03:52] do they use regular interconnects there, or have they moved into cluster & supercomputer territory? [03:57] Not sure, I don't delve into it. [03:58] I know they use instances, when we rebooted one machine it was back in 45 seconds [03:58] A lot of it is KVM [03:58] At least from what I've been able to discern and poke around [03:59] Anything that's iwxxxxx is a virtual machine (and also a "worker" aka a deriver), anything that's iaxxxxx is a storage node/petabox [03:59] Most (if not all) petaboxes have a worker VM running on them [04:19] Grrr [04:53] lol: http://quotes.burntelectrons.org/5902 [05:01] Spectacularly bad mood today [05:01] I think this is the diet. [05:02] My weight is being demolished. [05:10] SketchCow: But you're not fat [05:10] I'm pretty fat [05:10] I'm not that sort of obesity where basic functionality isn't happening. [05:10] I'm just outside the recommended weight for my height. [05:11] I was up to 235, when I should be 180. [05:11] Now I'm 220, I hope to get to 195. [05:12] 180 would be unsafe. [05:22] SketchCow: I'm 220, was 240 [05:22] just fyi [05:22] throwing it outthere [05:22] I was 175 in high-scool, & was like a bean-poll [05:26] My highest was 255. [05:26] But that 1997. [05:26] That WAS 1997. [05:26] My lowest since I was 16 was 205. [05:26] So 195's a challenge. [05:28] one of the easiest changes I made was not getting dressing on my salad [05:28] I had no idea how much of the fresh flavor that I was missing out on [05:28] Mine is not having sugar [05:28] And raping virgins [05:28] hrm [05:28] I"ll have to give that a shot [05:28] Raping virgins solves everything, I learned this [05:28] EVERYTHING [05:28] shot in the fucking mouth lol [05:28] http://www.pouet.net/prod.php?which=57406 [05:29] Someone ported nyan cat to commodore [05:29] lol [05:29] You know, in case you think there's a God [05:30] I can't help but wonder, is that listed on their resume? [05:30] Well we like what we see here... oh wait, what's this, you ported Nyan cat to Commodore? Can you start Tuesday? [05:30] New to demoscene, I see [05:31] Not necesarrily new... I've seen some of the contestants at Notacon [05:31] but I've seen very little [05:32] how long have you been envolved with Demo scene? [05:33] Fan/Watcher, 1987 [05:33] Participant, 1995 [05:51] http://www.deviantart.com/download/244380299/nyan_cat_machine___papercraft_by_ddi7i4d-d41hx1n.gif [05:51] The archiveteam server [06:39] SketchCow: lol [06:48] It's very ecologically sound. [06:52] 100% compostable [07:02] ok, so back to archiving this thing [07:08] Which thing? [07:13] Google Friends Newsletter [07:16] ah, that explains that [07:16] not all the warc-related command line options mentioned on the wiki page have been implemented [07:34] but it still doesn't follow the links to the actual articles [16:10] hrm [16:10] there is no alard [16:11] would someone ask him where tmpdir.h is, when he shows up? [17:17] So what's the "something CRAZY" that's getting uploaded? [17:18] Twaud.io [17:19] Oooh, nice [17:19] Twaudio's fuckin' crazy. [17:20] In what sense(s)? [17:21] Well, it's a collection of TRULY random mp3s [17:21] !! [17:21] Across 3 years. [17:21] How big is it? [17:21] 60gb, although I hope to get it down to 55 or 50. [17:21] Unpacking it a little to see how the mp3s are. [17:21] I need to download this. That's a really good sample set! [17:21] "truly random mp3s" [17:21] There's 21,000 mp3s [17:22] (Probably) [17:22] 60 gigabytes of line noise! [17:22] Oh...only 21000? [17:22] ha ha [17:22] "only..." [17:22] Hmm, probably still worthwhile. [17:22] You go ahead and listen to them [17:22] And then go, 3 months later "OK, some were good" [17:22] someone catted /dev/hda to /dev/audio [17:22] I'm not in it for the listening, in this case [17:22] So you want mp3s but don't want to listen to them. [17:22] actually, I wonder how well that audio file would compress [17:23] I have some paintings for sale, you can't see them [17:23] he just admires the waveforms [17:23] I mean, yeah, maybe I'd listen to some, but I'm more interested in assessing the id3 coverage. [17:23] bbot_: Problem is you're compressing random noise against a lossy format designed for human hearing. [17:23] SketchCow: well, depends on what you had on /dev/hda [17:23] I have no idea what the id3 tags are. [17:23] 21,000 is a LOT SIR [17:24] the few mp3s of that I've heard are fairly structured [17:24] Yeah, it's about half of my current data set. ;) [17:25] http://everything2.com/title/catting+weird+things+to+%252Fdev%252Faudio [17:25] shockingly, the links in the 11 year old post are dead [17:25] fuck that shit [17:25] think I have the files somewhere [17:26] boo http://everything2.com/robots.txt [17:27] DFJustin: I downloaded everything on their site, so they changed their robots.txt to be insanely restrictive [17:27] hahahah [17:27] Careful man [17:27] You'll go to federal prison [17:27] Next to Aaron Swartz [17:27] First Degree Downloading [17:27] Pre-meditated [17:27] I can't believe he's being prosecuted for that [17:28] He's not, I'm being pithy [17:28] aggrevated wgetting [17:28] it's a fbi case, I wgetted across state lines [17:28] He's being prosecuted for breaking and enterting [17:28] entering [17:28] DFJustin: http://archiveteam.org/index.php?title=Everything2 [17:28] Oh, did they drop charges related to downloading, or was that just media hype? [17:28] It's media misreporting. Somewhere, I have the full story. [17:29] Probably just media and idiotchatter [17:29] Let me quickly look. [17:29] hehehehehe [17:29] Be informed. [17:29] SketchCow: Arstechnica summaries it pretty well, I think [17:29] the downside though is that the wayback machine won't be following links to files [17:29] That everything2 archiveteam page uses a bootstrap tracker that's been dead for months [17:30] the torrent link on it [17:30] denis.stalker.h3q.com has been dead for a long ass time now [17:30] http://mediafreedom.org/2011/07/larry-lessig-responds-says-swartzs-alleged-actions-crossed-ethical-line/ [17:30] Lessig throws Swartz under the bus [17:31] http://chronicle.com/article/Rogue-Downloaders-Arrest/128439/ [17:31] that's the article I suggest. [17:31] Awesome, thank you. [17:31] Much better than the arstechnica [17:33] http://batcave.textfiles.com/TWAUDIOTEST [17:33] That link won't stick around forever. [17:34] http://batcave.textfiles.com/TWAUDIOTEST/WbJ.mp3 [17:34] That's the sound of Aaron's Laptop quietly downloading JSTOR [17:34] I got a half decent first link [17:34] http://batcave.textfiles.com/TWAUDIOTEST/qFTz.mp3 [17:35] * DFJustin headbangs [17:36] I will tell you not a lot of these have ID3 tags. [17:36] I just ran a test. [17:37] OK, if something has ANY id3 tags, it has a .txt now. [17:37] Remember, this is a small number, 5gb out of the 60gb [17:37] Oh dear... [17:37] (in this directory) [17:38] I just pulled a random...I guess it's a German morning radio programme that was pretty sparse, yeah. [17:39] One moment, messed up. [17:39] One frame; TCON with the usual id3v1 default of "Blues" [17:40] you can just use fingerprinting tools for the rest [17:40] I guess a lot is probably original though [17:42] It might still be an interesting data set once I get my scripts involved, though. If nothing else, I may finally have the right conditions to determine which encoders add TSSE by default. [17:42] Should run this through acoustiz and/or Echoprint :) [17:43] OK, just re-ran it. [17:43] Now any .txt file has actual data, if any [17:44] ersi: Acoustiz? That's a new one to me. [17:44] it is such a mess that id3 used 0 = blues or whatever caused that [17:44] so many blues mp3s... [17:46] Spirit_: Yeah, well, id3 is just a damn mess all around. :/ [17:46] Wyatt: Maybe that's not the name of the service.. I know it's open source acoustic fingerprinting though [17:46] http://batcave.textfiles.com/TWAUDIOTEST/tsc.mp3 [17:46] i told jamendo.com about it years ago and i think they still tag 0 for "no tag"... [17:47] And there's the sound of the archiveteam channel at night [17:47] When I come around and tuck all you children in your beds [17:48] ha that's from Sen to Chihiro no Kamikakushi [17:48] That was a good film. [17:49] http://batcave.textfiles.com/TWAUDIOTEST/tvZ.mp3 [17:49] Wow, these people are having a lot of trouble carrying their boxes during that piano concert [17:53] heh [17:56] a lot of these text files just say "No ID3 tag." [17:56] hatsune miku spotted [17:58] What are you using to export the tags? [17:58] lol so many of these are in japanese [17:58] Well, bear in mind these are a handful of folks who used this thing. [17:59] So if motherfucker loves his J-Pop, motherfucker gets his J-Pop [17:59] * SketchCow was just blasting the terrifying "Dream Island Contemplation Park" [18:00] Without animation: http://www.youtube.com/watch?v=ma0pZLrXDHk [18:00] With animation: http://www.youtube.com/watch?v=-anabfAg06U [18:01] It is really a shame that guy died. [18:01] Luckily, he gave the instructions to finish the last movie he was working on. [18:02] Weird, id3.org may be hopeless, but it's not usually down. [18:03] this is a decent rendition of 月時計 ~ ルナ・ダイアル http://batcave.textfiles.com/TWAUDIOTEST/r26y.mp3 [18:03] Aranje: aw dang [18:04] SketchCow: heh, nice -- a lot of Susumu Hirasawa's work is best played loud [18:04] bbot_, Sup? [18:04] I'd totally make a live-action version of paranoia agent [18:04] Rename the opening song Battery Park Contemplation and use 9/11 imagery [18:04] Aranje: baking a new torrent file now [18:04] any suggestions for trackers? [18:05] Oh, awesome. I totally didn't know what your aw dang was about, but yeah thanks! [18:05] openbittorrent.com is popular anymore (udp only though), publicbt.org runs both http and udp still [18:05] I run one on a server if you want to throw that in there too for shits and giggles [18:06] sure [18:06] DFJustin: Sounds like it was recorded from the MIDI? Hard to say what patch set is in use here, but it sounds different from Microsoft's gm.sf2 [18:06] Aranje: openbittorrent.com serves at http/tcp as well [18:06] Does it? [18:06] It's been rejecting mine for ages [18:06] They don't announce it publically, but yes - they answer and give out peers there [18:06] Huh. Neat. [18:06] it's really flaky though [18:07] I might confuse openbittorrent and that other one >_> I know one of them has HTTP/TCP even though they say they don't [18:07] bbot_: udp://explodie.org:6969/announce or http://explodie.org:6969/announce is mine. It's never down. [18:07] yes [18:07] publicbt.org has both [18:07] ah, d'oh :) [18:07] obtt does not, it only runs udp [18:07] :P [18:08] publicbt.org doesn't respond from either my home IP or bbot.org's [18:08] does it ignore regular web traffic? [18:08] Nope, usually has a site [18:08] a clone of openbittorents, even [18:08] diff colors [18:08] yeah looks down from here too [18:09] damn [18:09] I have only a single page for my tracker, lol. It's the same software app that openbittorrent and publicbt run [18:09] http://explodie.org/opentracker.html is my infopage [18:11] One day I intend to take it pro [18:11] Once I'm out of college and have money to dunk into amazing things that have no monetary returns [18:14] http://bbot.org/everything2-2M-v2.tbz.torrent now, with more trackers [18:27] <3 [18:28] * Aranje pulls [19:17] Gaaaa, still compressing. [19:23] I want to make a joke about needing a compression box, but it wouldn't be funny [19:24] heh winrar is having fun times trying to open this everything2 archive [19:25] it's been several minutes and it's only at 400000 files [19:26] doing it on a netbook probably doesn't help [20:29] OK, the twaudio test directory's going away now. [20:29] Unless someone was using it. [20:32] RM-Gun..FIRE! [20:35] DFJustin: it took something like 40 hours to compress [20:35] two million files, broham [20:48] -rw-r--r-- 1 jscott users 59067074560 Jun 19 17:43 twaud.io.tar [20:48] -rw-r--r-- 1 jscott users 53199327232 Aug 1 20:48 twaud.io.tar.bz2 [20:48] Not a lot of savings. It's still compressing. [20:49] quick thing here, um how in cthulu's name are you guys finding names for these random mp2s? [20:50] mp3s? [20:53] bsmith093: From what I gather, we're not. [20:53] It's just sort of as they came or something. [21:02] some of them I just happened to recognize because I'm a huge weeaboo [21:17] does this involve the internet? http://www.imdb.com/title/tt1740707/ [21:19] looks like a url to me [21:19] the movie listed on the linked page [21:25] I'm confused by the random mp3 question. [21:39] All gone. [22:36] http://danwebb.net/2009/5/20/massive-robot-launches-twaud-io [22:49] http://www.archive.org/details/twaudio-2009-2011&reCache=1 [22:49] ....and there we go. [23:19] http://i.imgur.com/GVdgM.jpg [23:19] I love tmux [23:20] (Also, I know Nemo_bis will love seeing all those uploads) [23:20] why? how does it beat screen for you? [23:24] tmux is pretty nice [23:25] tsp: paning [23:26] That means you can have more than one window showing on the terminal? [23:27] yes [23:28] and of arbitrary dimensions and position