#archiveteam-bs 2013-03-01,Fri

↑back Search

Time Nickname Message
00:23 🔗 chronomex lame, http://geociti.es/ is gone already
00:35 🔗 godane is there some sort of way for me to get a list of original and derives file sizes?
00:36 🔗 godane i want a list of my g4video-web collection so i can check which one is brokening without having to play them all
00:37 🔗 chronomex sure, 1sec
00:37 🔗 chronomex crap, 10k items
00:37 🔗 chronomex you've been busy
00:37 🔗 chronomex jesus
00:37 🔗 chronomex good work
00:39 🔗 chronomex well you can get all of the files.xml , like http://archive.org/download/g4tv.com-video15759/g4tv.com-video15759_files.xml
00:40 🔗 chronomex if you have a list of the item names it'll probably be straightforward to gin up a thing to get all the _files.xml
00:41 🔗 chronomex then you want to do an xpath search across all of those for file[source="derivative"]/size or something
00:41 🔗 chronomex that might be a bit tricky
00:44 🔗 omf_ xml_grep is a great tool for getting started on that
00:48 🔗 * chronomex nods
04:22 🔗 balrog do we know who ripped/uploaded http://archive.org/details/cdrom-golden-orchard-10 ?
04:22 🔗 balrog the reason I ask is because it's an incomplete rip
04:26 🔗 chronomex balrog: here's a secret ... http://archive.org/catalog.php?history=1&identifier=cdrom-golden-orchard-10
04:27 🔗 chronomex that would be dopefishj
04:27 🔗 balrog ok
04:27 🔗 chronomex :)
04:27 🔗 balrog the person reporting the issue may likely be wrong though
04:27 🔗 balrog I'm checking things
04:30 🔗 DFJustin I copied it from one of those apple 2 ftps
04:35 🔗 balrog aaaaaah
04:35 🔗 balrog ok
04:39 🔗 DFJustin those apple 2 cd-roms have weird filesystems so it's quite possibly a software issue
04:41 🔗 DFJustin some details here https://archive.org/details/golden-grail-10
04:45 🔗 DFJustin we have v1.0, v1.1, and v1.2 of golden orchard plus the repackaged golden grail so I'm pretty sure all the stuff is there one way or another
12:16 🔗 GLaDOS Hai!
12:16 🔗 Smiley o/
12:17 🔗 GLaDOS Needs at least 3 players
12:19 🔗 GLaDOS http://pyz.socialgamer.net/game.jsp room password is goodtime
12:19 🔗 Smiley :<
12:19 🔗 GLaDOS Come on guys!
12:19 🔗 GLaDOS Play some Cards Against Humanity!
12:20 🔗 Smiley \o/
12:20 🔗 GLaDOS norbert79: hi!
12:20 🔗 norbert79 Hi :)
12:20 🔗 norbert79 How does this go?
12:20 🔗 GLaDOS Click what you think is the funniest response.
12:20 🔗 Smiley btw I don't know how to play :D
12:21 🔗 norbert79 ok
12:21 🔗 norbert79 let's try
12:21 🔗 Smiley yeah hhmmm
12:21 🔗 Smiley playing this at work may not be the most sensible thing D:
12:23 🔗 Smiley gotta love the side boob
12:25 🔗 Smiley lol
12:25 🔗 Smiley LOL GLaDOS
12:25 🔗 GLaDOS \o/
12:26 🔗 GLaDOS I have the worst cards..
12:27 🔗 norbert79 happens... Stephen was witzh luck too
12:27 🔗 norbert79 but fitted so well
12:28 🔗 norbert79 The last one wasn't easy to decide upon :)
12:30 🔗 Cameron_D how do I do this
12:30 🔗 norbert79 You pick one of the cards
12:30 🔗 norbert79 if it's funny enough
12:30 🔗 norbert79 then you get rewarded
12:30 🔗 norbert79 you need to 'confirm card' too
12:30 🔗 Cameron_D ooooh
12:31 🔗 Smiley Oh lord
12:31 🔗 Smiley thats low of me ¬_¬
12:31 🔗 norbert79 You need to select
12:31 🔗 norbert79 and confirm
12:34 🔗 Cameron_D [10:34:22 PM] Error: timeout timeout
12:34 🔗 Cameron_D I love my internet
12:34 🔗 Smiley lol
12:35 🔗 norbert79 haha
12:35 🔗 norbert79 And you had to select
12:41 🔗 GLaDOS Smiley: are you still there?
12:41 🔗 norbert79 Guess not
12:41 🔗 norbert79 btw you can also use the in-game chat too :)
12:42 🔗 GLaDOS Cameron_D:
12:50 🔗 norbert79 AW COME ON
12:50 🔗 norbert79 [13.50.20] Error: error Service Unavailable
12:50 🔗 norbert79 lol
12:57 🔗 soultcer huh?
12:57 🔗 norbert79 http://pyz.socialgamer.net/game.jsp -> GLADOS game -> password: goodtime
12:57 🔗 norbert79 join
12:59 🔗 Smiley sorry,dropped out being busy.
12:59 🔗 Smiley g2g anyway
12:59 🔗 norbert79 ok
12:59 🔗 GLaDOS o/
13:04 🔗 GLaDOS [9:03:47 PM] Error: error
13:04 🔗 GLaDOS How helpful
13:05 🔗 norbert79 GLaDOS: Sorry, was called away, brb
13:09 🔗 GLaDOS heh
13:09 🔗 norbert79 I am sorry, it's my working hours
13:13 🔗 GLaDOS True
13:55 🔗 omf_ I uploaded my first file to IA as a normal user the other day
13:55 🔗 omf_ Just the warc.gz of a site
13:55 🔗 omf_ Is there anything else I am supposed to do?
13:57 🔗 Smiley let someone know :D
13:58 🔗 Smiley It'll go into archive team collection
13:58 🔗 omf_ yeah I put it in the wrong collection
13:58 🔗 Smiley you would, you can't access the archiveteam stuff.
13:59 🔗 omf_ Should I be making an uploading a cdx file as well
14:01 🔗 omf_ I did a few sites without making the cdx file
14:01 🔗 Smiley they'll survive without it from what I understand
14:01 🔗 omf_ I thought there was a process to make the cdx from the warc
14:11 🔗 ersi No worries man, can always do a CDX index file afterwards
14:12 🔗 ersi omf_: Poke underscor or SketchCow to put it into an AT collection and stuff
14:15 🔗 omf_ yeah I just wanted to check that before I start uploading other sites I pulled down
15:03 🔗 ersi T-7min to SpaceX launch \o
15:04 🔗 BlueMax the heck is SpaceX
15:04 🔗 ersi http://www.ustream.tv/channel/nasa-media-channel
15:04 🔗 ersi You don't know SpaceX? They're a private space industry company
15:04 🔗 Coderjoe one of the private space companies
15:04 🔗 ersi Currently doing their second launch, which will dock at the International Space Station
15:13 🔗 DFJustin ia should generate a cdx automatically
15:51 🔗 sep332 tracker goes down at the same time posterous bans boxes? suspicious
15:52 🔗 GLaDOS sep332: that was my doing.
15:52 🔗 GLaDOS Which is why you don't run 4000 downloaders!
15:53 🔗 sep332 lol
15:54 🔗 Cameron_D haha, did they all try and pull a username at the same time?
15:54 🔗 GLaDOS Possibly.
15:54 🔗 Cameron_D You should stagger the startups
15:55 🔗 GLaDOS Could've happened when closure's script cleared my assigned tasks out at :50
15:55 🔗 Cameron_D instead of one pipeline with 200 threads start 4 with 50 each and delay them by 10 seconds or something
15:55 🔗 GLaDOS I'm not controlling the instances like that
15:55 🔗 GLaDOS soult
15:56 🔗 GLaDOS 's AMI has the number of threads in user data
15:56 🔗 soultcer Guess it's time to create a new AMI then
15:56 🔗 Cameron_D or maybe the run-pipeline can stagger the starting of concurrent threads
15:56 🔗 GLaDOS why would you do that :c
15:57 🔗 Cameron_D only by 500ms or something
15:57 🔗 Cameron_D just to stop there being 200 connections to the tracker at once
15:57 🔗 GLaDOS Not you, soult
15:57 🔗 GLaDOS The tracker usually can handle reboot time fine
15:57 🔗 soultcer Why don't we simply wrap all calls to tracker into a limtconcurrent?
15:57 🔗 Cameron_D oh
15:58 🔗 GLaDOS I mean, it handled 8k threads like a steel beam on crack
15:58 🔗 GLaDOS No, a polar bear on rocket boots
15:58 🔗 GLaDOS POINT IS, it survives.
15:58 🔗 Cameron_D it is back now, I think
15:59 🔗 GLaDOS It is
15:59 🔗 GLaDOS Watch the assigned items number
15:59 🔗 GLaDOS ..it died before they launched that time
16:06 🔗 balrog is someone going to reset the punchfork tracker?
16:06 🔗 balrog 649 items are stuck
16:06 🔗 GLaDOS alard ^
16:17 🔗 Smiley also do you care if they are working?
16:18 🔗 Smiley I have a feeling I may have a pile of non working drives too.
16:18 🔗 ersi I prefer working drives :D
16:18 🔗 Smiley These should all be working,
16:18 🔗 Smiley looks like it maybe around £25
16:18 🔗 Smiley which isn't bad to be fair
16:18 🔗 ersi Then, sure! I guess :)
16:18 🔗 Smiley :D
16:19 🔗 ersi about 240 SEK ish
16:19 🔗 Smiley I need to find a box, measure it, weigh it, get a quot etc
16:19 🔗 Smiley I may do it this weekend,
16:19 🔗 Smiley why you want them anyway?
16:20 🔗 ersi I guess I should ask how much the total capacity is :D
16:20 🔗 Smiley D:
16:20 🔗 Smiley Lord only knows?
16:22 🔗 ersi also, got any left over drive cards? Like extender cards or such for SCSI/IDE/S-ATA?
16:32 🔗 sep332 on the tracker page, what does the icon next to sep332 and erazmus mean? there's a tooltip but it's moving too fast to hover
16:33 🔗 soultcer sep332: It means that the user is running it from the warrior vm
16:33 🔗 sep332 oh ok
16:33 🔗 soultcer the tooltip would just show the warrior version
19:57 🔗 godane so my wifi when out
20:16 🔗 Smiley godane: if we sent you a long ethernet cable, would it help?
20:19 🔗 godane i don't want long ethernet
20:19 🔗 godane my dad my freak out about it
20:19 🔗 godane *may freak
20:20 🔗 Smiley ok, no probs.
20:43 🔗 Smiley soooo
20:46 🔗 godane so guys i have this other side project going
20:47 🔗 Smiley go on...
20:47 🔗 godane the idea of a full source dvd that can recompile itself
20:48 🔗 Smiley ooo i think you mentioned this before
20:48 🔗 godane yes i have
20:48 🔗 godane its just i think i need other people to help with this distro
21:02 🔗 omf_ godane, is it linux base
21:02 🔗 omf_ based
21:02 🔗 godane yes
21:03 🔗 godane i tried using slitaz
21:03 🔗 godane a guy help with the compiling tools cause there server was a mess
21:03 🔗 godane everything was installed in there chroot
21:04 🔗 godane i figure this distro would have some sort of archiveteam feel to it
21:04 🔗 godane since it will be able to recompile the distro offline
21:09 🔗 sep332 i heard tcc is so fast, it can compile a kernel at boot time
21:10 🔗 sep332 here's the demo http://bellard.org/tcc/tccboot.html
21:23 🔗 Smiley o_O
21:23 🔗 Smiley I don't know if joking or not
21:23 🔗 Smiley :O
21:26 🔗 sep332 Smiley: me, tcc, or godane? lol
21:27 🔗 Smiley tcc, but wow
21:27 🔗 Smiley i see it's not joking, thts impressive.
21:28 🔗 godane The Screen Savers: Suzanne Vega Slams File Sharing: https://archive.org/details/g4tv.com-video25735
21:28 🔗 sep332 bellard is a quiet, friendly, genius
21:28 🔗 Smiley Right, I'm watching the URL count on teh warrior
21:28 🔗 Smiley maxing out at 700+Kbs/ now
21:29 🔗 godane i didn't think that interview was on g4tv.com
21:29 🔗 Smiley Finished PrepareDirectories for Item user-farrytale
21:29 🔗 Smiley Received item 'user-farrytale' from tracker
21:29 🔗 Smiley Starting GenerateSeedURL for Item user-farrytale
21:29 🔗 Smiley Starting GetItemFromTracker for Item
21:29 🔗 Smiley Starting PrepareDirectories for Item user-farrytale
21:29 🔗 Smiley Finished GenerateSeedURL for Item user-farrytale
21:29 🔗 Smiley Starting WgetDownload for Item user-farrytale - Downloaded: 17960 URLs.
21:29 🔗 Smiley best one
21:29 🔗 sep332 wanna boot a full linux kernel in js on your browser? he's got that http://bellard.org/jslinux/
21:29 🔗 alard Only 212 likes, that shouldn't be that many? http://punchfork.com/farrytale
21:31 🔗 sep332 alard: is # of URLs supposed to match up with likes?
21:32 🔗 alard I'd expect a linear relation.
21:32 🔗 sep332 user-kurdyla has 1,183 likes and so far 20k+ URLs
21:35 🔗 alard 17960 urls / 212 recipes = 85 urls per recipe.
21:36 🔗 Smiley Starting WgetDownload for Item user-becme01 - Downloaded: 15610 URLs.
21:36 🔗 Smiley tarting WgetDownload for Item user-zibbyxo - Downloaded: 13760 URLs.
21:36 🔗 Smiley tarting WgetDownload for Item user-TLPaniciCorujo - Downloaded: 13180 URLs
21:38 🔗 alard We'll see.
21:38 🔗 Smiley hmmm
21:38 🔗 Smiley this Jim beam black == nice
22:39 🔗 godane PLEASE HELP
22:40 🔗 godane g4tv.com is killing me
22:40 🔗 godane i don't know how to get all 71 images here: http://www.g4tv.com/images/4923/comic-con-2012-new-york-comic-con-2012-cosplay-pictures/83909/
22:40 🔗 Smiley tarting WgetDownload for Item user-becme01 - Downloaded: 26560 URLs.
22:41 🔗 Smiley http://images.g4tv.com/rimg_606x0/ImageDb3/313600_l/.jpg
22:41 🔗 Smiley they are numbered godane
22:41 🔗 Smiley go from 599-however much you find?
22:42 🔗 Smiley for x in 1..500 do wget http://images.g4tv.com/rimg_606x0/ImageDb3/313$x_l/.jpg done
22:43 🔗 godane just know if it 404 si goes to this: http://cache.g4tv.com/rimg_606x0/logo.jpg
22:43 🔗 Smiley urgh that needs fixing butr my brain is dead.
22:49 🔗 Smiley alard: ?
22:49 🔗 Smiley we have a failure.
22:50 🔗 Smiley http://pastebin.com/rXvRgixt
22:57 🔗 S[h]O[r]T godane i assume you have other links like that you need to grab all the images?
22:58 🔗 godane i'm doing the first 10000
22:59 🔗 S[h]O[r]T do you want to save the entire page with all of the images or just dont care and want all the images seperately?
22:59 🔗 godane i'm just grabing the images
22:59 🔗 godane there is java on the pages
23:00 🔗 godane so i could only get the first 15 images of a collection
23:04 🔗 Smiley - Downloaded: 34050 URLs.
23:04 🔗 S[h]O[r]T do wget http://images.g4tv.com/rimg_606x0/ImageDb3/313"$i"_l/.jpg
23:04 🔗 S[h]O[r]T done
23:04 🔗 S[h]O[r]T for i in {1..10}
23:09 🔗 godane i'm using seq but its the same
23:09 🔗 godane i use wget -x -i index.txt --warc-file=$website-images-$start-to-$end-$(date +%Y%m%d) --warc-cdx -E -o wget-images-$start-to-$end.log
23:37 🔗 dashcloud I see from the scrollback folks were talking about the genius that is Fabrice Bellard
23:42 🔗 dashcloud he's also done FFmpeg, and QEMU from what I remember
23:42 🔗 omf_ ffmpeg is an abotion
23:43 🔗 omf_ abortion
23:43 🔗 omf_ try working on the code some time
23:43 🔗 dashcloud he started both of them
23:43 🔗 omf_ Fabrice keeps his own fork cause how screwball shit is
23:43 🔗 dashcloud there's the libav fork if you're unhappy with FFmpeg
23:43 🔗 dashcloud and then there's FFmbc that's focused on broadcasting needs
23:44 🔗 omf_ oh they are way worse. Trying to steal things and then play the blame game. The problem is not the work people have done. It is the non-separation of IP infringing material. Now they have worked on separating it but it is still a ways off
23:44 🔗 omf_ ffmbc is what I am talking about as the other fork
23:44 🔗 omf_ the only problem it has was changing the command flag structure
23:45 🔗 omf_ so it shipped by default partially incompatible
23:45 🔗 omf_ Fabrice is the only good part of the whole process
23:46 🔗 dashcloud I followed development quite extensively for a while, and I haven't seen his name on either list in years
23:47 🔗 dashcloud so I'm glad he's working on it, because he does amazing stuff, but I don't know where he's contributing
23:47 🔗 omf_ mainly on ffmbc
23:47 🔗 omf_ and consulting
23:47 🔗 omf_ there is no way to account for all the consulting gigs with ffmpeg he does
23:47 🔗 omf_ that is also important. It gets the word out about non-MS windows solutions
23:48 🔗 dashcloud that would make a lot of sense since I heard he was working at a French telecom
23:48 🔗 omf_ He used to have that info on one of his web pages
23:48 🔗 dashcloud I'm rather curious about the IP infringing material bit
23:48 🔗 omf_ well that is easy
23:48 🔗 omf_ mp3
23:48 🔗 omf_ x264
23:48 🔗 dashcloud really?
23:48 🔗 omf_ yes
23:49 🔗 omf_ really
23:49 🔗 omf_ mp3 is still patented
23:49 🔗 omf_ and so is x264
23:49 🔗 omf_ x264 has a few dozen patents in it
23:49 🔗 omf_ this is why no linux distro will ship a full ffmpeg by default
23:50 🔗 dashcloud but you can easily disable any part you don't want at compile time
23:50 🔗 omf_ they will get sued. I talked about this at the OpenVideoConference and the AlliedMediaConference
23:50 🔗 dashcloud no they won't
23:50 🔗 dashcloud they only do source distribution
23:50 🔗 dashcloud and x264 has a lot of commercial users, some of whom are fairly large names in their field
23:51 🔗 omf_ you mean like google who owns all those patents now
23:51 🔗 omf_ they released vp8 to start getting us away from patents
23:52 🔗 dashcloud which is good- and why you should support Xiph in their efforts to develop Daala, their next gen video codec to go along with Opus
23:52 🔗 omf_ after the crash and burn that was theora I do not have much faith in them
23:53 🔗 dashcloud theora is actually a big deal in some unexpected markets
23:53 🔗 Smiley https://sphotos-b.xx.fbcdn.net/hphotos-ash3/559815_480818148640108_981970963_n.jpg
23:53 🔗 omf_ They finally got a decent hardware decoder then?
23:53 🔗 Smiley the batcave....
23:53 🔗 dashcloud I've heard it's used a lot in games for cut-scenes- it's knocked Bink & Smacker out of that niche
23:54 🔗 omf_ yeah but theora was design for internet streaming to replace x264/h264 and it didn't really get in there
23:54 🔗 omf_ What do we all use now
23:54 🔗 omf_ h264
23:55 🔗 omf_ youtube, flash video, hulu, netflix, amazon
23:55 🔗 omf_ It sucks for us as consumers
23:55 🔗 omf_ I want something to break out and take
23:55 🔗 omf_ over
23:55 🔗 Smiley nite
23:56 🔗 omf_ But lets be positive. Flac is pretty fucking awesome
23:56 🔗 omf_ and you can buy whole albums in it
23:56 🔗 omf_ even 24bit
23:56 🔗 dashcloud unlikely- just be thrilled that FLAC and Opus are available, and are best-in-class or nearly so
23:56 🔗 dashcloud and Vorbis has kicked the crap out of MP3 for years now
23:57 🔗 dashcloud and the very finest H264 encoder you get is open-source (GPL even)
23:57 🔗 omf_ the Avid one is still better. That is what they use for movies
23:57 🔗 dashcloud not really
23:57 🔗 omf_ Really. Lets see the proof then
23:57 🔗 dashcloud they've used x264 for Blu-rays- the people at Criterion Collection used it
23:58 🔗 omf_ yeah I read about that
23:58 🔗 omf_ A few films from one studio does not make it the best

irclogger-viewer