[00:13] i'm less then 10 episodes away from backing up all of Destructoid episodes [02:14] SketchCow: Destructoid is fully uploaded now [02:16] i'm starting to uploaded Not Mainstream Typical Videos [02:16] or notmtv [11:46] Not directly directed to Archives, but still interesting: http://www.bbc.co.uk/news/technology-24534864 [16:03] 50 uploaded IA items \o/ [16:25] go ersi go! [16:25] ^_^ [16:26] Here is an interesting little one-liner I have been using lately [16:26] find . -type f -size +1073741824c -printf "%s:%h%f\n" [16:27] It finds files in the current and sub dirs that is over 1 GB in size [16:27] then prints the size in bytes followed by the filename [16:28] here is a 3.7 gb file [16:28] 3876216249:.2anhhaj3560l402ercezt6zdg-20131008-102405.warc.gz [16:30] :. is the separator [16:50] du -ck | sort -n | something... [17:50] uploaded: http://archive.org/details/GeekBeat.TV_109 [17:50] uploaded: http://archive.org/details/GeekBeat.TV_108 [17:50] uploaded: http://archive.org/details/GeekBeat.TV_92 [17:50] these are the 3 missing geekbeat.tv episodes [17:51] episode 92 is from rev3 but the page doesn't even exist anymore [17:51] 108 and 109 are from youtube 720p hd version [18:10] i found episode 401 [18:10] even geekbeat.tv website doesn't have it [18:10] but its out there [18:43] I'M ALIVE [18:43] SketchCow: How was Chuck Peddle? [18:47] uploaded: http://archive.org/details/GeekBeat.TV_401 [18:47] now we have all 4 missing episodes [18:49] Peddle is the fuckin' man [18:49] Godane, I will sort and clear you later today [18:50] ok [19:02] SketchCow: i'm backing up isohunt.com forums [19:10] Do it [19:10] SketchCow: on the wiki, these images listed don't work: http://archiveteam.org/index.php?title=Category:Error . can you fix them or will they need to be reuploaded? [19:12] I just checked [19:13] The image is fine - the problem is that some genius put # in the filename. [19:13] (I'm on the server, looking at these) [19:15] # in a filename is possible under ext3/4 [19:16] maybe a Linux user [19:16] :) [19:27] Trying to think of the best way to fix these [19:33] for cycle in bash? [19:33] and using sed [19:37] find and mv [19:37] probably [19:37] http://archiveteam.org/images/4/47/ [19:37] No, I'm doing that. [19:38] I just wrote a regex to do some find and replace in files [19:38] How about this. I'll make you a deal, chfoo or whoever esle [19:38] I drop the links to fixed versions of all files, you replace them in the wiki. [19:38] Deal? [19:38] http://archiveteam.org/images/4/47/Akoha_Welcome_to_Akoha_1313392691673.png [19:38] First one [19:38] sure [19:48] i can't embed images by url. reupload them? [19:55] Yes. [19:56] Re-upload and then re-link [19:56] That seems to be it [19:56] http://www.flickr.com/photos/textfiles/10333719586/ [19:56] ok, will do [20:06] i'm going after the lounge topic on isohunt.com [20:07] since its not public [20:07] i signed up for a account and i'm going to do by topic since it maybe faster [20:08] the akoha image has been reuploaded and the page linking it was fixed [20:16] so i decided to do what i did with underground gamer to isohunt [20:16] make a forum index first [20:17] then grab the urls from that [20:31] what about the torrent descriptions [20:54] i have no idea other then blute force it [20:55] but i think in the US they only allow what is thought to be public domain or creative commons [21:57] Floppy drive music http://www.youtube.com/attribution_link?u=%2Fwatch%3Fv%3DXk_XaJ7gE4Q%26feature%3Dshare&a=iVeXmXSPjFxTY-4LlUrAjg [22:00] omf_: http://www.worldcat.org/title/usb-floppy-disk-drive-1/oclc/711691219&referer=brief_results [22:02] wow [22:02] and there is only 1 copy of it [22:21] I adore that "other title" [22:21] I wonder if I can get that via interlibrary loan. [22:21] I was thinking the same thing [22:22] Ever notice how broken most of the interlibrary systems are [22:22] search sucks [22:23] No one really needs to discover things at the library right [22:24] that's what the stacks are for! [22:24] Physical books are nice, there is a feel to them. However full text search is just a killer feature of ebooks [22:24] forget hunting around in the index [22:35] i'm getting redirect from isohunt.com to the login screen [22:45] looks like i may have fixed that problem [22:47] Got throttled on ID 526969088, retrying in 300 seconds and killing a thread... [22:47] Killed a thread, have 5 threads left. [22:47] ffs [22:53] so now i'm doinga bear grab [22:54] this is just going to be the first pages of each thread [22:54] then from there i will look for all the start pages and make a dump of that [22:54] even with a bear grab i'm still getting redirects [22:55] to login screen