[00:23] lame, http://geociti.es/ is gone already [00:35] is there some sort of way for me to get a list of original and derives file sizes? [00:36] i want a list of my g4video-web collection so i can check which one is brokening without having to play them all [00:37] sure, 1sec [00:37] crap, 10k items [00:37] you've been busy [00:37] jesus [00:37] good work [00:39] well you can get all of the files.xml , like http://archive.org/download/g4tv.com-video15759/g4tv.com-video15759_files.xml [00:40] if you have a list of the item names it'll probably be straightforward to gin up a thing to get all the _files.xml [00:41] then you want to do an xpath search across all of those for file[source="derivative"]/size or something [00:41] that might be a bit tricky [00:44] xml_grep is a great tool for getting started on that [00:48] * chronomex nods [04:22] do we know who ripped/uploaded http://archive.org/details/cdrom-golden-orchard-10 ? [04:22] the reason I ask is because it's an incomplete rip [04:26] balrog: here's a secret ... http://archive.org/catalog.php?history=1&identifier=cdrom-golden-orchard-10 [04:27] that would be dopefishj [04:27] ok [04:27] :) [04:27] the person reporting the issue may likely be wrong though [04:27] I'm checking things [04:30] I copied it from one of those apple 2 ftps [04:35] aaaaaah [04:35] ok [04:39] those apple 2 cd-roms have weird filesystems so it's quite possibly a software issue [04:41] some details here https://archive.org/details/golden-grail-10 [04:45] we have v1.0, v1.1, and v1.2 of golden orchard plus the repackaged golden grail so I'm pretty sure all the stuff is there one way or another [12:16] Hai! [12:16] o/ [12:17] Needs at least 3 players [12:19] http://pyz.socialgamer.net/game.jsp room password is goodtime [12:19] :< [12:19] Come on guys! [12:19] Play some Cards Against Humanity! [12:20] \o/ [12:20] norbert79: hi! [12:20] Hi :) [12:20] How does this go? [12:20] Click what you think is the funniest response. [12:20] btw I don't know how to play :D [12:21] ok [12:21] let's try [12:21] yeah hhmmm [12:21] playing this at work may not be the most sensible thing D: [12:23] gotta love the side boob [12:25] lol [12:25] LOL GLaDOS [12:25] \o/ [12:26] I have the worst cards.. [12:27] happens... Stephen was witzh luck too [12:27] but fitted so well [12:28] The last one wasn't easy to decide upon :) [12:30] how do I do this [12:30] You pick one of the cards [12:30] if it's funny enough [12:30] then you get rewarded [12:30] you need to 'confirm card' too [12:30] ooooh [12:31] Oh lord [12:31] thats low of me ¬_¬ [12:31] You need to select [12:31] and confirm [12:34] [10:34:22 PM] Error: timeout timeout [12:34] I love my internet [12:34] lol [12:35] haha [12:35] And you had to select [12:41] Smiley: are you still there? [12:41] Guess not [12:41] btw you can also use the in-game chat too :) [12:42] Cameron_D: [12:50] AW COME ON [12:50] [13.50.20] Error: error Service Unavailable [12:50] lol [12:57] huh? [12:57] http://pyz.socialgamer.net/game.jsp -> GLADOS game -> password: goodtime [12:57] join [12:59] sorry,dropped out being busy. [12:59] g2g anyway [12:59] ok [12:59] o/ [13:04] [9:03:47 PM] Error: error [13:04] How helpful [13:05] GLaDOS: Sorry, was called away, brb [13:09] heh [13:09] I am sorry, it's my working hours [13:13] True [13:55] I uploaded my first file to IA as a normal user the other day [13:55] Just the warc.gz of a site [13:55] Is there anything else I am supposed to do? [13:57] let someone know :D [13:58] It'll go into archive team collection [13:58] yeah I put it in the wrong collection [13:58] you would, you can't access the archiveteam stuff. [13:59] Should I be making an uploading a cdx file as well [14:01] I did a few sites without making the cdx file [14:01] they'll survive without it from what I understand [14:01] I thought there was a process to make the cdx from the warc [14:11] No worries man, can always do a CDX index file afterwards [14:12] omf_: Poke underscor or SketchCow to put it into an AT collection and stuff [14:15] yeah I just wanted to check that before I start uploading other sites I pulled down [15:03] T-7min to SpaceX launch \o [15:04] the heck is SpaceX [15:04] http://www.ustream.tv/channel/nasa-media-channel [15:04] You don't know SpaceX? They're a private space industry company [15:04] one of the private space companies [15:04] Currently doing their second launch, which will dock at the International Space Station [15:13] ia should generate a cdx automatically [15:51] tracker goes down at the same time posterous bans boxes? suspicious [15:52] sep332: that was my doing. [15:52] Which is why you don't run 4000 downloaders! [15:53] lol [15:54] haha, did they all try and pull a username at the same time? [15:54] Possibly. [15:54] You should stagger the startups [15:55] Could've happened when closure's script cleared my assigned tasks out at :50 [15:55] instead of one pipeline with 200 threads start 4 with 50 each and delay them by 10 seconds or something [15:55] I'm not controlling the instances like that [15:55] soult [15:56] 's AMI has the number of threads in user data [15:56] Guess it's time to create a new AMI then [15:56] or maybe the run-pipeline can stagger the starting of concurrent threads [15:56] why would you do that :c [15:57] only by 500ms or something [15:57] just to stop there being 200 connections to the tracker at once [15:57] Not you, soult [15:57] The tracker usually can handle reboot time fine [15:57] Why don't we simply wrap all calls to tracker into a limtconcurrent? [15:57] oh [15:58] I mean, it handled 8k threads like a steel beam on crack [15:58] No, a polar bear on rocket boots [15:58] POINT IS, it survives. [15:58] it is back now, I think [15:59] It is [15:59] Watch the assigned items number [15:59] ..it died before they launched that time [16:06] is someone going to reset the punchfork tracker? [16:06] 649 items are stuck [16:06] alard ^ [16:17] also do you care if they are working? [16:18] I have a feeling I may have a pile of non working drives too. [16:18] I prefer working drives :D [16:18] These should all be working, [16:18] looks like it maybe around £25 [16:18] which isn't bad to be fair [16:18] Then, sure! I guess :) [16:18] :D [16:19] about 240 SEK ish [16:19] I need to find a box, measure it, weigh it, get a quot etc [16:19] I may do it this weekend, [16:19] why you want them anyway? [16:20] I guess I should ask how much the total capacity is :D [16:20] D: [16:20] Lord only knows? [16:22] also, got any left over drive cards? Like extender cards or such for SCSI/IDE/S-ATA? [16:32] on the tracker page, what does the icon next to sep332 and erazmus mean? there's a tooltip but it's moving too fast to hover [16:33] sep332: It means that the user is running it from the warrior vm [16:33] oh ok [16:33] the tooltip would just show the warrior version [19:57] so my wifi when out [20:16] godane: if we sent you a long ethernet cable, would it help? [20:19] i don't want long ethernet [20:19] my dad my freak out about it [20:19] *may freak [20:20] ok, no probs. [20:43] soooo [20:46] so guys i have this other side project going [20:47] go on... [20:47] the idea of a full source dvd that can recompile itself [20:48] ooo i think you mentioned this before [20:48] yes i have [20:48] its just i think i need other people to help with this distro [21:02] godane, is it linux base [21:02] based [21:02] yes [21:03] i tried using slitaz [21:03] a guy help with the compiling tools cause there server was a mess [21:03] everything was installed in there chroot [21:04] i figure this distro would have some sort of archiveteam feel to it [21:04] since it will be able to recompile the distro offline [21:09] i heard tcc is so fast, it can compile a kernel at boot time [21:10] here's the demo http://bellard.org/tcc/tccboot.html [21:23] o_O [21:23] I don't know if joking or not [21:23] :O [21:26] Smiley: me, tcc, or godane? lol [21:27] tcc, but wow [21:27] i see it's not joking, thts impressive. [21:28] The Screen Savers: Suzanne Vega Slams File Sharing: https://archive.org/details/g4tv.com-video25735 [21:28] bellard is a quiet, friendly, genius [21:28] Right, I'm watching the URL count on teh warrior [21:28] maxing out at 700+Kbs/ now [21:29] i didn't think that interview was on g4tv.com [21:29] Finished PrepareDirectories for Item user-farrytale [21:29] Received item 'user-farrytale' from tracker [21:29] Starting GenerateSeedURL for Item user-farrytale [21:29] Starting GetItemFromTracker for Item [21:29] Starting PrepareDirectories for Item user-farrytale [21:29] Finished GenerateSeedURL for Item user-farrytale [21:29] Starting WgetDownload for Item user-farrytale - Downloaded: 17960 URLs. [21:29] best one [21:29] wanna boot a full linux kernel in js on your browser? he's got that http://bellard.org/jslinux/ [21:29] Only 212 likes, that shouldn't be that many? http://punchfork.com/farrytale [21:31] alard: is # of URLs supposed to match up with likes? [21:32] I'd expect a linear relation. [21:32] user-kurdyla has 1,183 likes and so far 20k+ URLs [21:35] 17960 urls / 212 recipes = 85 urls per recipe. [21:36] Starting WgetDownload for Item user-becme01 - Downloaded: 15610 URLs. [21:36] tarting WgetDownload for Item user-zibbyxo - Downloaded: 13760 URLs. [21:36] tarting WgetDownload for Item user-TLPaniciCorujo - Downloaded: 13180 URLs [21:38] We'll see. [21:38] hmmm [21:38] this Jim beam black == nice [22:39] PLEASE HELP [22:40] g4tv.com is killing me [22:40] i don't know how to get all 71 images here: http://www.g4tv.com/images/4923/comic-con-2012-new-york-comic-con-2012-cosplay-pictures/83909/ [22:40] tarting WgetDownload for Item user-becme01 - Downloaded: 26560 URLs. [22:41] http://images.g4tv.com/rimg_606x0/ImageDb3/313600_l/.jpg [22:41] they are numbered godane [22:41] go from 599-however much you find? [22:42] for x in 1..500 do wget http://images.g4tv.com/rimg_606x0/ImageDb3/313$x_l/.jpg done [22:43] just know if it 404 si goes to this: http://cache.g4tv.com/rimg_606x0/logo.jpg [22:43] urgh that needs fixing butr my brain is dead. [22:49] alard: ? [22:49] we have a failure. [22:50] http://pastebin.com/rXvRgixt [22:57] godane i assume you have other links like that you need to grab all the images? [22:58] i'm doing the first 10000 [22:59] do you want to save the entire page with all of the images or just dont care and want all the images seperately? [22:59] i'm just grabing the images [22:59] there is java on the pages [23:00] so i could only get the first 15 images of a collection [23:04] - Downloaded: 34050 URLs. [23:04] do wget http://images.g4tv.com/rimg_606x0/ImageDb3/313"$i"_l/.jpg [23:04] done [23:04] for i in {1..10} [23:09] i'm using seq but its the same [23:09] i use wget -x -i index.txt --warc-file=$website-images-$start-to-$end-$(date +%Y%m%d) --warc-cdx -E -o wget-images-$start-to-$end.log [23:37] I see from the scrollback folks were talking about the genius that is Fabrice Bellard [23:42] he's also done FFmpeg, and QEMU from what I remember [23:42] ffmpeg is an abotion [23:43] abortion [23:43] try working on the code some time [23:43] he started both of them [23:43] Fabrice keeps his own fork cause how screwball shit is [23:43] there's the libav fork if you're unhappy with FFmpeg [23:43] and then there's FFmbc that's focused on broadcasting needs [23:44] oh they are way worse. Trying to steal things and then play the blame game. The problem is not the work people have done. It is the non-separation of IP infringing material. Now they have worked on separating it but it is still a ways off [23:44] ffmbc is what I am talking about as the other fork [23:44] the only problem it has was changing the command flag structure [23:45] so it shipped by default partially incompatible [23:45] Fabrice is the only good part of the whole process [23:46] I followed development quite extensively for a while, and I haven't seen his name on either list in years [23:47] so I'm glad he's working on it, because he does amazing stuff, but I don't know where he's contributing [23:47] mainly on ffmbc [23:47] and consulting [23:47] there is no way to account for all the consulting gigs with ffmpeg he does [23:47] that is also important. It gets the word out about non-MS windows solutions [23:48] that would make a lot of sense since I heard he was working at a French telecom [23:48] He used to have that info on one of his web pages [23:48] I'm rather curious about the IP infringing material bit [23:48] well that is easy [23:48] mp3 [23:48] x264 [23:48] really? [23:48] yes [23:49] really [23:49] mp3 is still patented [23:49] and so is x264 [23:49] x264 has a few dozen patents in it [23:49] this is why no linux distro will ship a full ffmpeg by default [23:50] but you can easily disable any part you don't want at compile time [23:50] they will get sued. I talked about this at the OpenVideoConference and the AlliedMediaConference [23:50] no they won't [23:50] they only do source distribution [23:50] and x264 has a lot of commercial users, some of whom are fairly large names in their field [23:51] you mean like google who owns all those patents now [23:51] they released vp8 to start getting us away from patents [23:52] which is good- and why you should support Xiph in their efforts to develop Daala, their next gen video codec to go along with Opus [23:52] after the crash and burn that was theora I do not have much faith in them [23:53] theora is actually a big deal in some unexpected markets [23:53] https://sphotos-b.xx.fbcdn.net/hphotos-ash3/559815_480818148640108_981970963_n.jpg [23:53] They finally got a decent hardware decoder then? [23:53] the batcave.... [23:53] I've heard it's used a lot in games for cut-scenes- it's knocked Bink & Smacker out of that niche [23:54] yeah but theora was design for internet streaming to replace x264/h264 and it didn't really get in there [23:54] What do we all use now [23:54] h264 [23:55] youtube, flash video, hulu, netflix, amazon [23:55] It sucks for us as consumers [23:55] I want something to break out and take [23:55] over [23:55] nite [23:56] But lets be positive. Flac is pretty fucking awesome [23:56] and you can buy whole albums in it [23:56] even 24bit [23:56] unlikely- just be thrilled that FLAC and Opus are available, and are best-in-class or nearly so [23:56] and Vorbis has kicked the crap out of MP3 for years now [23:57] and the very finest H264 encoder you get is open-source (GPL even) [23:57] the Avid one is still better. That is what they use for movies [23:57] not really [23:57] Really. Lets see the proof then [23:57] they've used x264 for Blu-rays- the people at Criterion Collection used it [23:58] yeah I read about that [23:58] A few films from one studio does not make it the best