[00:06] *** MrRadar has joined #archiveteam [00:19] *** DoomTay has joined #archiveteam [00:27] *** nertzy has joined #archiveteam [00:37] *** nertzy has quit IRC (Read error: Operation timed out) [00:40] *** BlueMaxim has joined #archiveteam [00:40] *** nertzy has joined #archiveteam [01:12] *** schbirid2 has joined #archiveteam [01:14] *** schbirid has quit IRC (Read error: Operation timed out) [01:34] *** dashcloud has quit IRC (Remote host closed the connection) [01:36] *** dashcloud has joined #archiveteam [01:45] *** Stiletto has quit IRC (Ping timeout: 246 seconds) [01:47] *** kristian_ has quit IRC (Leaving) [02:18] *** hive-mind has quit IRC (Ping timeout: 260 seconds) [02:20] *** hive-mind has joined #archiveteam [02:29] *** philpem has quit IRC (Ping timeout: 260 seconds) [03:17] *** Stiletto has joined #archiveteam [03:43] *** db48x has joined #archiveteam [04:27] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [04:32] *** Stiletto has quit IRC (Ping timeout: 246 seconds) [04:33] *** Sk1d has joined #archiveteam [04:46] *** Stiletto has joined #archiveteam [05:36] *** SmileyG has quit IRC (Read error: Operation timed out) [05:41] *** Smiley has joined #archiveteam [05:46] *** DoomTay has quit IRC (Quit: Page closed) [07:36] *** BlueMaxim has quit IRC (Quit: Leaving) [07:37] *** odie5533 has joined #archiveteam [07:47] The Firefall MMORPG is possibly going to shut down any time. They've fired everyone and gone dark. I'm not sure if needs a custom crawl or if Archive.org's crawler is handling it, but they have a website and forum at http://firefall.com/ [07:51] *** Honno has joined #archiveteam [08:08] On it. Thank you [08:18] *** dashcloud has quit IRC (Read error: Operation timed out) [08:21] Thanks. I wish I didn't have to make this request, but I always knew I would. [08:22] *** dashcloud has joined #archiveteam [09:22] arkiver: likely to have high amounts of sketchy and warez-y stuff (given that it originates from gigatribe which is not exactly known for its legitimate userbase) [09:22] arkiver: but probably not *all* sketchy [09:22] it will be hard, however, to find all the URLs [09:22] I don't think they have a public index at all [09:22] and they seem to be kinda doing the mega.co.nz thing [09:22] (which, again, is not surprising given that it originates from gigatribe) [09:44] *** BlueMaxim has joined #archiveteam [10:26] *** kristian_ has joined #archiveteam [10:39] *** ZoeB has joined #archiveteam [10:40] Hi! So I'm trying to archive a site like this: [10:40] wget -mbc --warc-file=musicfromouterspace --warc-cdx http://www.musicfromouterspace.com [10:40] ...but it looks like it's missing some of the content, I think due to the frames. [10:40] Is there any way around this, does anyone know? [10:41] The creator of that site died recently, so I don't know how much longer it'll be around. :/ [10:42] link to pages with frames? [10:42] pls [10:43] One sec, it's my partner that found them. Asking... [10:43] http://www.musicfromouterspace.com/index.php?MAINTAB=SYNTHDIY&VPW=1109&VPH=500 [10:44] oh god i see http://www.musicfromouterspace.com/index.php?MAINTAB=SYNTHDIY&PROJARG=HOT_TIPS/led_drivers.html&VPW=1670&VPH=762 [10:44] Yeah. :/ [10:45] you might need to build those urls yourself :\ [10:45] or someone can helpwith headless browser javascript stuff maybe [10:45] Well I'm about to head off to a wedding for a week, so I'm out of time right now, unfortunately. [10:47] Which is really bad timing, as this guy was a pillar of the synth DIY community, had an O'Reilly book published on it etc (Make Analog Synthesizers). [10:47] i am sure someone will make this high priority [10:47] the site looks amazing [10:47] Indeed [10:47] Thanks [10:48] More generally, if wget doesn't support frames, that might be something worth looking into fixing, in the long term. I know frames are a bad old idea, but clearly some treasure troves of information still use them. [10:48] it's not the frames, it's that the urls are generated in javascript i think [10:48] Ah [10:49] So wget would need to be more like a fully functional browser in order to interpret them? Yeah, that's not a small task, ack. [10:49] There's another thing I need to ask too. Is there a wiki division of Archive Team? [10:50] I vaguely remember something like that. [10:50] I've been asked to keep backups of http://www.sdiy.info/w/Main_Page [10:50] #wikiteam! [10:50] Ah, thanks [10:52] OK, I need to go and try on fancy outfits and pack. Thanks for your help! [10:52] (Not a fancy outfit fan...) [10:52] *** ZoeB has left [11:01] hey guys https://developers.google.com/freebase/ is closing [11:02] there are a couple of file 30gb each that i think archivebot will run into content leght timeout [11:16] *** z00nx has quit IRC (Remote host closed the connection) [11:30] i'll try it on archivebot anyway [11:39] *** Gfy has quit IRC (Ping timeout: 250 seconds) [11:57] *** WinterFox has joined #archiveteam [11:57] *** redlob has quit IRC (Read error: Operation timed out) [11:58] *** redlob has joined #archiveteam [12:06] Is there a way on Archive.org to see if an item you upload has been deleted or hidden? I thought I uploaded something a long time ago but don't see it so am not sure. [12:08] odie5533: If you know the identifier, use archive.org/history/identifier instead of archive.org/details/identifier [12:14] *** davidar has joined #archiveteam [12:25] PurpleSym: thanks, but I'm afraid I don't know the identifier as I uploaded it (I think) quite a few years ago [12:25] But perhaps I didn't upload it? Should I just upload it again? hah [12:29] *** nertzy has quit IRC (Ping timeout: 244 seconds) [12:36] *** Gfy has joined #archiveteam [12:41] *** Gfy has quit IRC (Ping timeout: 250 seconds) [12:48] *** BlueMaxim has quit IRC (Quit: Leaving) [12:58] *** Gfy has joined #archiveteam [13:22] *** Gfy has quit IRC (Ping timeout: 250 seconds) [14:00] *** Gfy has joined #archiveteam [14:17] *** laufwerkf has joined #archiveteam [14:40] *** laufwerkf has quit IRC (Remote host closed the connection) [14:59] *** laufwerkf has joined #archiveteam [15:00] *** philpem has joined #archiveteam [15:00] *** laufwerkf has quit IRC (Read error: Connection reset by peer) [15:02] *** WinterFox has quit IRC (Read error: Operation timed out) [15:11] *** laufwerkf has joined #archiveteam [15:11] *** laufwerkf has quit IRC (Connection closed) [15:14] *** laufwerkf has joined #archiveteam [15:15] *** laufwerk_ has joined #archiveteam [15:15] *** laufwerkf has quit IRC (Read error: Connection reset by peer) [15:24] *** laufwerk_ has quit IRC (Remote host closed the connection) [15:34] *** laufwerkf has joined #archiveteam [15:35] Anyway to add an item to a collection, like the Windows 3.1 Games collection, so it can be streamed? [15:37] *** laufwerk_ has joined #archiveteam [15:42] *** laufwerkf has quit IRC (Read error: Operation timed out) [15:44] *** laufwerkf has joined #archiveteam [15:46] *** laufwerk_ has quit IRC (Read error: Operation timed out) [15:48] *** laufwerk_ has joined #archiveteam [15:52] *** laufwerkf has quit IRC (Read error: Operation timed out) [15:53] *** laufwerk_ has quit IRC (Read error: Connection reset by peer) [15:56] *** laufwerkf has joined #archiveteam [16:04] *** andromed1 has joined #archiveteam [16:05] *** DoomTay has joined #archiveteam [16:05] It looks like Nokia has shut down all of the useful stuff bell-labs.org. I don't know about any other domains, but plan-9.bell-labs.org and cs.bell-labs.org are both gone [16:05] Google still has most of the pages in cache, though [16:05] *** laufwerkf has quit IRC (Remote host closed the connection) [16:06] is there some way that we can grab those before they expire? [16:08] (sorry, bell-labs.com) [16:09] swtch.com (Russ Cox) and its source directories appear to be in trouble, but that might be temporary (no idea) [16:09] *** laufwerkf has joined #archiveteam [16:09] unfortunately, it looks like a bunch of the historical stuff on plan-9.bell-labs.com at least (/historic/) and a few other interesting things were blocked from Googlebot, so that won't work.. [16:13] *** laufwerkf has quit IRC (Remote host closed the connection) [16:48] *** Simpbrain has quit IRC (Read error: Operation timed out) [16:53] *** AlexLehm has quit IRC (Remote host closed the connection) [16:55] *** Simpbrain has joined #archiveteam [17:04] *** AlexLehm has joined #archiveteam [17:18] *** Morbus has joined #archiveteam [17:22] *** morbus_ has quit IRC (Ping timeout: 961 seconds) [17:27] *** VADemon has joined #archiveteam [17:27] *** Zialus has quit IRC (Read error: Operation timed out) [17:29] *** Zialus has joined #archiveteam [17:30] *** Ymgve has quit IRC () [17:48] bit.ly looks down [17:48] Works for me [17:49] *** TC01 has quit IRC (Read error: Operation timed out) [17:50] *** TC01 has joined #archiveteam [17:54] *** dashcloud has quit IRC (Read error: Operation timed out) [17:57] *** dashcloud has joined #archiveteam [17:58] *** Ymgve has joined #archiveteam [18:11] *** dashcloud has quit IRC (Read error: Operation timed out) [18:14] *** dashcloud has joined #archiveteam [18:18] do we have a channel for torrent sites projects? [18:18] if not, does anyone have a nice idea? [18:20] #what.thefuck [18:22] arkiver: #torrentsitearchiving ? [18:24] #torrential [18:35] ^^ [18:35] magnetictorrents? [18:37] I like "torrential" better, since it's somewhat storm-releated [18:54] *** Simpbrain has quit IRC (Remote host closed the connection) [19:14] so arkiver pls end voting when ready [19:15] *** tomwsmf has joined #archiveteam [19:47] let's do #torrential [19:59] *** Zialus has quit IRC (Read error: Operation timed out) [20:01] *** Zialus has joined #archiveteam [20:14] *** kristian_ has quit IRC (Leaving) [20:21] *** fie_ has quit IRC (Read error: Connection reset by peer) [20:47] *** RichardG_ has joined #archiveteam [20:47] *** RichardG has quit IRC (Read error: Connection reset by peer) [21:20] *** pguth_ has quit IRC (Remote host closed the connection) [21:22] *** pguth_ has joined #archiveteam [21:26] Oh silly me, I had not even noticed this die https://en.wikipedia.org/wiki/Yahoo!_Babel_Fish [21:28] Oh wow, that takes me back [21:31] pff, that's an altavista project [21:31] Okay, maybe it doesn't take me THAT far back, but I've come by babelfish translator before [21:32] :) [21:41] *** jk[SVP] has quit IRC (Ping timeout: 244 seconds) [21:41] *** jk[SVP] has joined #archiveteam [21:42] *** d_rebel has quit IRC (Ping timeout: 244 seconds) [21:42] *** d_rebel has joined #archiveteam [21:42] Sorry to ask again, but does anyone know how to add an Archive.org item to a collection, like adding a game item to the Windows 3.1 collection so it can be played with the javascript streamer? [21:43] *** TC01 has quit IRC (Read error: Connection reset by peer) [21:44] *** TC01 has joined #archiveteam [21:51] odie5533: it doesn't have to be added to a collection to be made playable, but there are some metadata fields you have to fill out to tell it which emulator to use. if you join #jsmess we can walk you through it there [21:52] *** RichardG_ is now known as RichardG [22:00] *** Gfy has quit IRC (Ping timeout: 250 seconds) [22:09] *** Gfy has joined #archiveteam [22:37] odie5533: In case you didn't join the #jsmess channel: http://digitize.archiveteam.org/index.php/Making_Software_Emulate_on_IA [22:39] Thanks. [22:47] *** DoomTay has quit IRC (Quit: Page closed) [22:48] you'll want to test it before telling people about it- I've had a few that worked just fine offline, and then failed horribly online [23:05] *** TC01 has quit IRC (Ping timeout: 244 seconds) [23:05] *** fie has joined #archiveteam [23:06] *** TC01 has joined #archiveteam [23:07] *** AlexLehm has quit IRC (Ping timeout: 260 seconds) [23:31] *** Honno has quit IRC (Read error: Operation timed out)