[00:04] SketchCow: i'm looking at your coverdisks collection [00:04] looks like we have alot of repeated pc gamer disks [00:04] or not [00:05] i was looking at issue 143 [00:05] one in collection say its christmas 2004 dvd release [00:06] another not in the pc gamer collection will say its a march 2004 release [00:06] ho [00:07] i think that one got screwed up [00:07] anyways i plan on put all 'loose' pc gamer cds in the pc gamer collection if thats all right [00:48] hi, has anyone been able to run this: https://github.com/odie5533/WarcQtViewer under Linux? I can't figure out what packages I need to install [01:21] so i just found something interesting [01:22] turns out cbsnews has full archive xml data [02:13] cbsnews got to be joking [02:15] dashcloud: $ sudo aptitude install python3-pyside.qtxml python3-pyside.qtwebkit python3-pyside.qtuitools python3-pyside.qtnetwork python3-pyside.qtgui python3-pyside.qtcore [02:15] looks like full episodes of 60 minutes are there and can be downloaded [02:18] dashcloud: WarcQtViewer launch after running $ sudo aptitude install python3-pyside.qtxml python3-pyside.qtwebkit python3-pyside.qtuitools python3-pyside.qtnetwork python3-pyside.qtgui python3-pyside.qtcore python-pyside python-pyside.qtopengl python-twisted-web python-twisted python-twisted-bin python-twisted-runner [02:19] but it fail to render te webpage [02:19] once i upload this stuff we can point to xml data to show that full episodes are there for download on there site [02:31] so it definitely needs python3 and not python2? [02:34] okay- trying that out now [02:41] anyone here know how to manipulate the dom [02:42] to remove a body * {visibility: 'hidden'} ? [02:42] because i am exploring a blogsport warc [02:42] and there are the ****** blogger login that hide everything [02:43] i can remove the iframe with window.document.getElementById("injected-iframe").style.visibility="hidden"; [02:43] in Firefox, View -> Page Style -> No Style [02:43] nico_32: the install of the packages went just fine, but I get this result: http://paste.archivingyoursh.it/fuluvoduxu.rb when doing python warcqtviewer.py [02:44] document.body.style.visibility = "visible"; [02:45] dashcloud: old/too recent version of twisted ? [02:46] it works on debian sid [02:46] with Python 2.7.6 [02:52] 12.0.0 here [02:52] for twisted [02:53] * nico_32 is going to sleep [02:53] thanks for your help! [02:53] (twisted 13.2.0-1 here) [06:24] does someone here know how I can upload an item together with the metadata through FTP to the archive so that it also derives, but that I don't have to fill in all the metadata in in the browser? [06:26] I tried adding a meta.xml file and it looked like it worked (I didn't have to add metadata by hand when checkin in the file) but it didn't derive and when I started the derivation it said there was an error [06:26] I think that was because I didn't upload a files.xml file, but for the files.xml file I need to have quite a few hashes and other things from the file [06:27] so I wanted to know if someone here knows a good way to do this? [08:00] Hard day. [08:02] hey SketchCow [08:03] i moved pc gamer cds into pcgamer_cdroms collection [08:03] some of them anyways [08:07] OK. [08:09] i'm starting to grab cbsnews.com video xml [08:09] then from there i can grab the video pages to get all video urls [08:10] fun fact: you can take part of rtsp url and change domain to download.cbsnews.com to get the flash file [08:11] SketchCow: there also have full 60 minutes episodes [08:20] Smiley: I had to drop the MTU to 1496, rsync is now working. [08:20] cc joepie91 [08:21] hah wat [08:21] yeah [08:21] that was my first reaction [08:21] reminds me of my alienvps woes [08:21] IRC wouldn't work [08:21] had something to do with the DDoS filtering apparently [08:21] setting an MTU made it link to the hub properly [08:21] never understood why that worked [08:26] me neither [08:30] 1496, must be adsl [08:35] Aranje: actually, it's more like 10Gbit :/ [08:36] midas: 500 ADSL connections [08:36] :) [08:36] hahahhaa [08:36] there ya go [08:36] :3 [08:36] somewhere along the path is a lowly adsl connection :3 [08:36] racks full of adsl modems [08:36] somebody's dslam is routing your 10g [08:57] joepie91: going to fosdem? [08:57] * midas isnt [09:17] midas: probably nort [09:17] not * [09:39] /dev/sda2 1.8T 349G 1.4T 20% / [09:39] lol [09:53] filling up your new drive :p [09:57] load average: 6.49, 6.39, 6.43 [09:57] midas: yes :P [09:57] downloading 3 FTPs atm [09:57] one of which is ftp.microsoft.com [09:57] so, eh [10:07] nice, wgetting FTP's has a nice effect on my system load :p [10:09] currently grabbing ftp.tu-chemnitz.de ftp.uni-muenster.de gatekeeper.dec.com [10:09] ftp.uni-erlangen.de ftp.warwick.ac.uk [10:25] i need this help using sed to fix this url: rtmp:\\cp98363.edgefcs.net\ondemand\media\2006\12\11\video2244953.flv [10:25] i need it to have / instead of \ [10:25] but sed keeps giving me problems [10:41] s/\\/\// maybe? [10:41] that's close o.o [10:57] this worked: sed 's|\\|/|g' | sed 's|//|/|g' [10:58] original url: rtmp:\/\/cp98363.edgefcs.net\/ondemand\/?auth=cbs&aifp=v001&slist=media\/2014\/01\/17\/119033923999\/media\/2014\/01\/17\/119033923999\/EN_0112WhalesNEW_796.mp4 [10:58] the url before that was what i could get by seding [10:59] thanks [11:02] :) [11:25] midas@semafoor:/storage/archive-sites/massive-ftp$ fg [11:25] du -sh [11:25] 1.8T . [11:26] lets see who is the biggest [11:31] Sending email > disconnects SCP session [11:31] this network is strange :p [11:40] .tw 423854814364069888 [11:40] I want to reiterate that "ICANN has .cheeseburger" is a real sentence with profound technical implications and wtf how did we get here (@mhoye) [12:19] right, there is a mirror of CCC on one of the FTP's [12:19] i hope IA has deduplication [12:58] SketchCow: question... i changed my email once and wanted to change it back to the old. can't do that. it says it's already used by another, which in fact is me... is there anything i can do? [12:59] I think i just fucked up big time right here :< [13:11] oh, and a similiar problems is that my uploads are on the old email, instead on the new one (where it should belong) [14:03] i've uploaded something! [14:04] https://archive.org/details/RaveRadio [14:04] see, i did! [14:04] trying to find more raveradio stuff, back in the day it was awesome [14:05] all grabbed from tape [14:24] tape?! [14:24] hah [14:24] I wonder how far along my FTP jobs are [14:25] * joepie91 screen -r's [14:25] ... oh ... [14:25] There is no screen to be resumed. [14:25] right, crap [14:25] :( [14:25] (crash) [14:25] :< [14:25] * joepie91 -nc's... [14:26] \o/ [14:26] joepie91: wich did you grab? [14:27] maybe we should put them on the wiki, just to see who is grabbing what [14:28] http://sprunge.us/ONGf [14:28] what now [14:30] excuse me? [14:30] it doesn't work [14:30] but idfk why [14:30] maybe no resume? [14:30] that would suck tho [14:30] no such thing as "resume" for ftp, is there? [14:31] wtf [14:31] without -nc it works fine [14:31] what's this [14:31] yeah [14:31] c is continue [14:31] right? [14:31] -nc == no clobber [14:31] skip existing files [14:32] yeah [14:32] microsoft doensnt like it it seems [14:32] fuck it [14:32] --mirror ignores files that weren't updated [14:32] will just do that then [14:33] uh oh [14:33] getting The Stare from the cat [14:33] oh dear. [14:33] bitching about lack of catfood in 3... 2... 1... [14:33] mew mew mew :+ [14:34] yup [14:34] first meow is in [14:34] i should have send you catfood instead of a harddrive [14:34] haha [14:34] just waiting for paypal transfer to come in [14:35] waiting for one of my employers to pay me again :P [14:35] kinda sucks, they are always late [14:36] what is this? proassist [14:38] ... [14:38] it came in [14:38] literally like 2 minutes ago [14:38] hah [14:38] brb Lidl [14:38] :D [14:51] :P [14:51] hello? proassist? yes, why are you stealing my money? [14:51] apperently i have my phone insured there, for the last 5 years [14:51] without me knowing [14:52] and i kept paying [16:16] midas: wat :| [20:45] o_o [20:46] O.o [20:47] O-O [21:19] http://paulrobertson.mechafetus.com/expedition.gif [22:44] so i'm looking at download.cbsnews.com/media folder in wayback machine [22:45] and it looks like this video was grab 892 times: http://download.cbsnews.com/media/2011/12/18/60_reef_1218_1296.m4v [22:45] also this video is 124mb [23:01] o_O [23:01] why so much o_O [23:01] (the grabs I mean) [23:14] if you'd like to see strange, check this out: https://archive.org/details/Usb2xFdd has 12 grabs and this has none: https://archive.org/details/Mousesuitedeluxe I can't figure out why people are downloading the first one, and not the second one