#archiveteam-bs 2013-11-21,Thu

↑back Search

Time Nickname Message
00:04 🔗 SketchCow Kazaa supported Gnutella.
00:30 🔗 dashcloud not sure how many people know about https://guerrillamail.com , but it's been great when I've needed to get email accounts real quick for testing- it auto-creates a box, and any email only lasts 60 minutes
00:35 🔗 BlueMax hahahaha
00:35 🔗 BlueMax "blablabla@sharklasers.com"
00:36 🔗 BlueMax I love it
00:47 🔗 Coderjoe dashcloud: sounds like mailinator.com in a way
00:48 🔗 xmc or tenminutemail
01:20 🔗 SketchCow Helvetica in the streets but a Wingdings in the sheets.
01:24 🔗 BlueMax A crazy mess of characters? :P
02:01 🔗 BlueMax oh wow didn't know DownThemAll could export lists of links
02:04 🔗 BlueMax anyone willing to take these lists of links and download/upload them to somewhere else? would do it myself if it wouldn't take two weeks to upload
02:14 🔗 BlueMax They're servers hosting DOOM singleplayer and multiplayer WADs.
02:34 🔗 BlueMax http://paste.archivingyoursh.it/heferiroha.avrasm | http://paste.archivingyoursh.it/yurerewogu.avrasm | http://paste.archivingyoursh.it/kuxacifaxe.avrasm forgot that GLaDOS had a pastebin.
02:35 🔗 DFJustin those could just be crawled with archivebot
02:35 🔗 DFJustin point it at the folder
02:35 🔗 DFJustin granted it's tied up with winamp jobs for a while
02:36 🔗 BlueMax didn't know that's what archivebot was for.
02:36 🔗 BlueMax wait.
02:36 🔗 BlueMax I'm dumb aren't I
02:38 🔗 SketchCow Yeah, that's pretty non-observant
02:38 🔗 DFJustin it'll make a warc but you can just warctozip it later for hosting more directly
02:38 🔗 BlueMax Fully deserving of the bollocking I'd usually get for that one
02:39 🔗 BlueMax DFJustin, what do you mean? Where does it host the warc it makes?
02:39 🔗 yipdw fos
02:39 🔗 yipdw then eventually to IA
02:40 🔗 yipdw assuming it doesn't crash from overload
02:40 🔗 yipdw (it's happened a couple of times)
02:40 🔗 DFJustin they eventually get dumped in items like this https://archive.org/details/archiveteam_archivebot_go_003
02:40 🔗 DFJustin which the wayback machine can then pull from
02:40 🔗 BlueMax ah OK
02:40 🔗 SketchCow Archivebot was the result of xmc and I brainstorming where archive team could use some automation. We decided that it was one-off, smaller (sub-gigabyte) websites thast people would mention and then we had whoever was sitting around do however they thought WARCing was done.
02:41 🔗 BlueMax These servers aren't sub-gigabyte
02:41 🔗 SketchCow And then yipdw really made it his own, and the bot does the best practices, and then gives it to IA to add into the wayback.
02:41 🔗 DFJustin BlueMax: neither is most of the stuff we've been cramming down archivebot's maw
02:41 🔗 SketchCow The bot has limits. Larger things should be done elsewhere, but people use it that way anyway, because easy.
02:42 🔗 SketchCow I'm just saying what it was designed for.
02:42 🔗 SketchCow This pair of scissors is designed to cut paper, but I'm going to stab you with them anyway
02:42 🔗 BlueMax Fair enough, I just don't want to overload the bot if it's doing anything important like WinAMP
02:42 🔗 yipdw <GeneKrantz> I don't care what it was designed to do, I care about what it can do
02:42 🔗 * BlueMax hides
02:42 🔗 SketchCow It's not how much you want to eat, it's how much you CAN eat
02:43 🔗 yipdw anyway, we're doing okay on archivebot so far
02:43 🔗 yipdw I can turn on another swap file
02:43 🔗 yipdw heh
02:43 🔗 BlueMax alright, well, if you're fine with it, how do I load the URLs into the archivebot
02:43 🔗 yipdw oh, uh
02:43 🔗 yipdw currently there is no mass load thing
02:43 🔗 yipdw I can do that for now
02:44 🔗 BlueMax does it work if I link a single page like http://static.best-ever.org/wads/ to the bot
02:44 🔗 yipdw yes
02:44 🔗 yipdw actually, that's the recommended usage
02:44 🔗 BlueMax that's how I got the text lists I posted above
02:45 🔗 yipdw https://github.com/ArchiveTeam/ArchiveBot/issues/14 is a mass-loader but I haven't really gotten around to it
02:45 🔗 BlueMax fair enough
02:46 🔗 yipdw oh hey
02:46 🔗 yipdw it finished winamp
02:46 🔗 yipdw neat
02:46 🔗 BlueMax cool, can I jump in then? :P
02:49 🔗 yipdw yeah
04:04 🔗 BlueMax I talked to one of the WAD sources I wanted to back up, but he seemed unwilling to let me attempt to make a backup of his files. http://paste.archivingyoursh.it/kanowicejo.xml only reason I asked was because there's no public link list for his server, it's pure cluster-bomb guesswork to know what files he does have on there
04:07 🔗 BlueMax should I try talking to him again later on or leave it
04:41 🔗 SketchCow I'm all up for more BBS material.
04:41 🔗 arkhive k
04:41 🔗 arkhive :)
04:41 🔗 arkhive Yeah. I was really excited to come home and tell you(might be weird lol)
04:42 🔗 arkhive and he has about 150 more 3.25" floppy disks
04:42 🔗 arkhive but he said look through them and let him know. :)
04:43 🔗 arkhive he had a lot of old manuals from 80's too
04:43 🔗 BlueMax Sniffin' for treasure me hearty.
04:44 🔗 arkhive To all AT: I strongly recommend posting an ad on Craigslist in the Computers by owner and the Wanted section looking for FREE floppies or other stuff.. People have a ton of stuff that they'd otherwise throw out. but can be rescued
04:45 🔗 BlueMax should add that to the -bs topic.
04:45 🔗 arkhive sometimes your ad will get flagged by some people and removed. but just repost :) Also, I was sad to find out my Dad recycled a shitload of 5.25" floppy games I played with my sister when we were little. heh like spellbound and midnight rescue by the learning company
04:47 🔗 arkhive he got rid of probably 30.. and a few years ago(like 5?) I recycled a shitload of stuff(my sis and i computer when i was 7, old computers with a turbo button haha, floppies) before I started getting into this stuff.
04:48 🔗 arkhive But, SketchCow, can you dump/digitize them? I can mail them this week if you'd like. :)
04:49 🔗 SketchCow I can
04:51 🔗 arkhive cool. Can I also send about 500 more(commodore 64, apple 5.25" disks, and such. ) Or do you recommend me sending it to Cowering/some guy named Al at the Silicon valley computer museum, still?
04:51 🔗 SketchCow Or me.
04:51 🔗 SketchCow I have a hell of a backlog but I will work through said backlog
04:51 🔗 SketchCow https://www.youtube.com/watch?v=E9XQ2MdNgKY
05:31 🔗 Coderjoe did the winamp grab include the program, or just plugins and skins and the like?
05:32 🔗 DFJustin it's getting the program too
05:33 🔗 DFJustin watch download.nullsoft.com at http://archivebot.at.ninjawedding.org:4567/
05:47 🔗 SketchCow Another group wrote me, with, essentially "So, we'd love to have a chat about DOWNLOADING FACEBOOK AND TWITTER"
05:47 🔗 SketchCow I sent them to #archiveteam, we'll see if they show
06:25 🔗 ersi Hoho, that'll be interesting
07:11 🔗 Coderjoe grr
07:12 🔗 Coderjoe i've been using noscript, and have hit a couple of sites the display absolutely nothing without javascript. there have been others that display nearly nothing but a message to turn JS on.
07:13 🔗 Coderjoe and i'm not talking about things like the leaderboard or warrior dashboard
07:26 🔗 Lord_Nigh i know. its annoying as hell
07:26 🔗 Lord_Nigh noscript itself has built in workarounds for some sites
07:26 🔗 Lord_Nigh but it doesn't cover everything
12:36 🔗 dashcloud BlueMax: uploading the ftp.fu-berlin.de idgames grab now (will be a little while at 32 GB)
12:42 🔗 BlueMax jeez, that idgames folder takes up 2/3rds of the FTP
12:49 🔗 BlueMax dashcloud, what's your opinion on this: I talked to one of the WAD sources I wanted to back up, but he seemed unwilling to let me attempt to make a backup of his files. http://paste.archivingyoursh.it/kanowicejo.xml only reason I asked was because there's no public link list for his server, it's pure cluster-bomb guesswork to know what files he does have on there
12:55 🔗 dashcloud don't know
20:15 🔗 w0rp I was trying to take a copy of 240GB of raw photos at work today, and I was handed this hard disk I just could not get to work with anything I tried. I tried two Linux machines and a Mac desktop. It apparely works fine on the guy's Mac laptop.
20:16 🔗 w0rp It was also somehow a NAS, and it was from some company I've never heard of before.
21:06 🔗 ivan` Coderjoe: for blogspot dynamic view sites, you can give google cache the URL and it will respond with HTML
21:06 🔗 ivan` Coderjoe: I've been thinking about making some sort of HTTP proxy that uses a headless webkit to render and sends the resulting DOM to Firefox
21:23 🔗 godane uploaded: https://archive.org/details/cdrom-linuxformatmagazine-175
21:36 🔗 nico_32 godane: can you help me ?
21:36 🔗 nico_32 i am trying to upload an item to archive.org
21:36 🔗 nico_32 with the old ftp interface
21:36 🔗 nico_32 i went to the https://archive.org/checkin/ url
21:36 🔗 nico_32 first time i got a empty page
21:36 🔗 nico_32 now i got The identifier chosen is already taken. You will need to try an alternate identifier
21:37 🔗 nico_32 the unit name is CedricBlancherTribute
21:49 🔗 midas1 http://techcrunch.com/2013/11/21/source-microsoft-in-talks-to-buy-shoutcast-and-winamp-from-aol/
21:50 🔗 midas1 this is the important part: We have also learned that AOL has been planning to announce the closure of Shoutcast next week
21:50 🔗 Coderjoe not terribly surprised
21:51 🔗 midas1 nope, was to be expected
22:03 🔗 SketchCow Oh, cool.
22:03 🔗 SketchCow That thing I linked in #archiveteam an hour ago
22:04 🔗 nico_32 anyone can help me with my ia issue ?
22:06 🔗 SketchCow What is your ia issue.
22:08 🔗 nico_32 i uploading a warc+cdc with the ftp interface
22:08 🔗 nico_32 and i got a empty page when i tried to checkin it
22:09 🔗 SketchCow it means it's taking a little time.
22:09 🔗 nico_32 going to https://archive.org/details/CedricBlancherTribute
22:09 🔗 nico_32 tell me to pick a collection
22:09 🔗 SketchCow Pick any one.
22:15 🔗 nico_32 CHANGING sid.cdx source="" to source="original"
22:15 🔗 nico_32 ASSIGNING "sid.cdx" to format "Unknown"
22:15 🔗 nico_32 normal ?
22:22 🔗 nico_32 hu
22:22 🔗 nico_32 i uploaded a the generated cdx file
22:22 🔗 nico_32 and ia is regenerating a cdx file
22:23 🔗 DFJustin it always does, it's actually kinda pointless to upload a cdx
22:24 🔗 nico_32 it will take some time :(
22:24 🔗 nico_32 wget generated a 51mb cdx file
22:32 🔗 nico_32 okay
22:32 🔗 nico_32 task complete
22:33 🔗 nico_32 should i delete the cdx i uploaded ?

irclogger-viewer