[00:04] Kazaa supported Gnutella. [00:30] not sure how many people know about https://guerrillamail.com , but it's been great when I've needed to get email accounts real quick for testing- it auto-creates a box, and any email only lasts 60 minutes [00:35] hahahaha [00:35] "blablabla@sharklasers.com" [00:36] I love it [00:47] dashcloud: sounds like mailinator.com in a way [00:48] or tenminutemail [01:20] Helvetica in the streets but a Wingdings in the sheets. [01:24] A crazy mess of characters? :P [02:01] oh wow didn't know DownThemAll could export lists of links [02:04] anyone willing to take these lists of links and download/upload them to somewhere else? would do it myself if it wouldn't take two weeks to upload [02:14] They're servers hosting DOOM singleplayer and multiplayer WADs. [02:34] http://paste.archivingyoursh.it/heferiroha.avrasm | http://paste.archivingyoursh.it/yurerewogu.avrasm | http://paste.archivingyoursh.it/kuxacifaxe.avrasm forgot that GLaDOS had a pastebin. [02:35] those could just be crawled with archivebot [02:35] point it at the folder [02:35] granted it's tied up with winamp jobs for a while [02:36] didn't know that's what archivebot was for. [02:36] wait. [02:36] I'm dumb aren't I [02:38] Yeah, that's pretty non-observant [02:38] it'll make a warc but you can just warctozip it later for hosting more directly [02:38] Fully deserving of the bollocking I'd usually get for that one [02:39] DFJustin, what do you mean? Where does it host the warc it makes? [02:39] fos [02:39] then eventually to IA [02:40] assuming it doesn't crash from overload [02:40] (it's happened a couple of times) [02:40] they eventually get dumped in items like this https://archive.org/details/archiveteam_archivebot_go_003 [02:40] which the wayback machine can then pull from [02:40] ah OK [02:40] Archivebot was the result of xmc and I brainstorming where archive team could use some automation. We decided that it was one-off, smaller (sub-gigabyte) websites thast people would mention and then we had whoever was sitting around do however they thought WARCing was done. [02:41] These servers aren't sub-gigabyte [02:41] And then yipdw really made it his own, and the bot does the best practices, and then gives it to IA to add into the wayback. [02:41] BlueMax: neither is most of the stuff we've been cramming down archivebot's maw [02:41] The bot has limits. Larger things should be done elsewhere, but people use it that way anyway, because easy. [02:42] I'm just saying what it was designed for. [02:42] This pair of scissors is designed to cut paper, but I'm going to stab you with them anyway [02:42] Fair enough, I just don't want to overload the bot if it's doing anything important like WinAMP [02:42] I don't care what it was designed to do, I care about what it can do [02:42] * BlueMax hides [02:42] It's not how much you want to eat, it's how much you CAN eat [02:43] anyway, we're doing okay on archivebot so far [02:43] I can turn on another swap file [02:43] heh [02:43] alright, well, if you're fine with it, how do I load the URLs into the archivebot [02:43] oh, uh [02:43] currently there is no mass load thing [02:43] I can do that for now [02:44] does it work if I link a single page like http://static.best-ever.org/wads/ to the bot [02:44] yes [02:44] actually, that's the recommended usage [02:44] that's how I got the text lists I posted above [02:45] https://github.com/ArchiveTeam/ArchiveBot/issues/14 is a mass-loader but I haven't really gotten around to it [02:45] fair enough [02:46] oh hey [02:46] it finished winamp [02:46] neat [02:46] cool, can I jump in then? :P [02:49] yeah [04:04] I talked to one of the WAD sources I wanted to back up, but he seemed unwilling to let me attempt to make a backup of his files. http://paste.archivingyoursh.it/kanowicejo.xml only reason I asked was because there's no public link list for his server, it's pure cluster-bomb guesswork to know what files he does have on there [04:07] should I try talking to him again later on or leave it [04:41] I'm all up for more BBS material. [04:41] k [04:41] :) [04:41] Yeah. I was really excited to come home and tell you(might be weird lol) [04:42] and he has about 150 more 3.25" floppy disks [04:42] but he said look through them and let him know. :) [04:43] he had a lot of old manuals from 80's too [04:43] Sniffin' for treasure me hearty. [04:44] To all AT: I strongly recommend posting an ad on Craigslist in the Computers by owner and the Wanted section looking for FREE floppies or other stuff.. People have a ton of stuff that they'd otherwise throw out. but can be rescued [04:45] should add that to the -bs topic. [04:45] sometimes your ad will get flagged by some people and removed. but just repost :) Also, I was sad to find out my Dad recycled a shitload of 5.25" floppy games I played with my sister when we were little. heh like spellbound and midnight rescue by the learning company [04:47] he got rid of probably 30.. and a few years ago(like 5?) I recycled a shitload of stuff(my sis and i computer when i was 7, old computers with a turbo button haha, floppies) before I started getting into this stuff. [04:48] But, SketchCow, can you dump/digitize them? I can mail them this week if you'd like. :) [04:49] I can [04:51] cool. Can I also send about 500 more(commodore 64, apple 5.25" disks, and such. ) Or do you recommend me sending it to Cowering/some guy named Al at the Silicon valley computer museum, still? [04:51] Or me. [04:51] I have a hell of a backlog but I will work through said backlog [04:51] https://www.youtube.com/watch?v=E9XQ2MdNgKY [05:31] did the winamp grab include the program, or just plugins and skins and the like? [05:32] it's getting the program too [05:33] watch download.nullsoft.com at http://archivebot.at.ninjawedding.org:4567/ [05:47] Another group wrote me, with, essentially "So, we'd love to have a chat about DOWNLOADING FACEBOOK AND TWITTER" [05:47] I sent them to #archiveteam, we'll see if they show [06:25] Hoho, that'll be interesting [07:11] grr [07:12] i've been using noscript, and have hit a couple of sites the display absolutely nothing without javascript. there have been others that display nearly nothing but a message to turn JS on. [07:13] and i'm not talking about things like the leaderboard or warrior dashboard [07:26] i know. its annoying as hell [07:26] noscript itself has built in workarounds for some sites [07:26] but it doesn't cover everything [12:36] BlueMax: uploading the ftp.fu-berlin.de idgames grab now (will be a little while at 32 GB) [12:42] jeez, that idgames folder takes up 2/3rds of the FTP [12:49] dashcloud, what's your opinion on this: I talked to one of the WAD sources I wanted to back up, but he seemed unwilling to let me attempt to make a backup of his files. http://paste.archivingyoursh.it/kanowicejo.xml only reason I asked was because there's no public link list for his server, it's pure cluster-bomb guesswork to know what files he does have on there [12:55] don't know [20:15] I was trying to take a copy of 240GB of raw photos at work today, and I was handed this hard disk I just could not get to work with anything I tried. I tried two Linux machines and a Mac desktop. It apparely works fine on the guy's Mac laptop. [20:16] It was also somehow a NAS, and it was from some company I've never heard of before. [21:06] Coderjoe: for blogspot dynamic view sites, you can give google cache the URL and it will respond with HTML [21:06] Coderjoe: I've been thinking about making some sort of HTTP proxy that uses a headless webkit to render and sends the resulting DOM to Firefox [21:23] uploaded: https://archive.org/details/cdrom-linuxformatmagazine-175 [21:36] godane: can you help me ? [21:36] i am trying to upload an item to archive.org [21:36] with the old ftp interface [21:36] i went to the https://archive.org/checkin/ url [21:36] first time i got a empty page [21:36] now i got The identifier chosen is already taken. You will need to try an alternate identifier [21:37] the unit name is CedricBlancherTribute [21:49] http://techcrunch.com/2013/11/21/source-microsoft-in-talks-to-buy-shoutcast-and-winamp-from-aol/ [21:50] this is the important part: We have also learned that AOL has been planning to announce the closure of Shoutcast next week [21:50] not terribly surprised [21:51] nope, was to be expected [22:03] Oh, cool. [22:03] That thing I linked in #archiveteam an hour ago [22:04] anyone can help me with my ia issue ? [22:06] What is your ia issue. [22:08] i uploading a warc+cdc with the ftp interface [22:08] and i got a empty page when i tried to checkin it [22:09] it means it's taking a little time. [22:09] going to https://archive.org/details/CedricBlancherTribute [22:09] tell me to pick a collection [22:09] Pick any one. [22:15] CHANGING sid.cdx source="" to source="original" [22:15] ASSIGNING "sid.cdx" to format "Unknown" [22:15] normal ? [22:22] hu [22:22] i uploaded a the generated cdx file [22:22] and ia is regenerating a cdx file [22:23] it always does, it's actually kinda pointless to upload a cdx [22:24] it will take some time :( [22:24] wget generated a 51mb cdx file [22:32] okay [22:32] task complete [22:33] should i delete the cdx i uploaded ?