[00:19] this piss me off: http://www.5isucai.com/ftpsearch/ [00:19] i search webuser [00:19] there is like 17 of them there [00:19] mostly form 2005 [00:25] http://www.5isucai.com/ftpsearch/?showftp=0 [00:26] chronomex: any idea on how to find the real ip for the files? [00:26] sorry, I already poked at it a bit and had no luck [00:29] actually be a scene member? [00:30] no way! [00:32] only m2v.ru and 5isucai.com have it [00:36] um [00:36] here's a hint: that web server also has an FTP server [00:36] it does [00:37] its password protected though [00:37] but better [00:38] it is probably somewhere on the server, though [00:38] but I am not going digging for you [00:38] i can't read chinese [00:38] me either [00:38] also, enjoy your malware [00:38] yaaay [00:41] http://www.5isucai.com/robots.txt [00:49] DFJustin: I'm checking something. [00:51] Is there any particular protocol for verifying a WARC file before upload? [00:52] DFJustin: I've been talking to this guy for no small amount of time. [00:52] I told him I was just going to pay for him to mail me a hard drive, but he's been slow. [00:52] To get back to me. [00:52] Obviously in the meantime, he's been busy. I told him I don't like ad-click-though things. [00:53] If someone wants to help me come up with a command-line way to grab the stuff, I'll do it. [00:53] SketchCow: What stuff? [00:53] I have a few free hours and I'm itching for something to do [00:54] http://zippyshare.com/Hallfiry [00:54] All of them. [00:54] I want all of them. [00:54] He was doing a direct upload to me on fos, but he switched to this because it was easier for him. [00:55] Make it easy for me to use a wget. [00:55] Go. [00:56] * undersco2 goes [00:56] I don't know how the fuck that was easier [00:56] For him [00:57] No idea either. Probably a drag and drop client. [01:01] indeed [01:01] http://www.zippyshare.com/sites/uploader.jsp [01:02] * chronomex pokes at it desultorily [01:02] it's a goddamn data motel [01:03] like a library with all the books glued to the shelves [01:06] what was the way that was more difficult for him? [01:07] the upload tool supposedly can create a report of the file urls... [01:07] might be handy if he created that [01:10] http://www9.zippyshare.com/v/23031924/file.html is the index [01:11] what's a ctf? [01:13] signature: Catalog 3.00 [01:17] whee. need to decipher minified javascript [01:17] hmm [01:17] or not [01:18] haha [01:18] Update: Down to 8,000 IUMA artists to upload [01:18] there's supposed to be a captcha, but I never saw one [01:18] Update: The first 4 FORTUNECITY .tar files have been generated and now being uploaded. [01:18] oh [01:18] it only shows if you don't have flash [01:19] i think I see how to download a file given the file.html page. [01:20] but the next problem is walking the directory tree there and getting all of those [01:22] There will be 26 FortuneCity archives. [01:22] Each 50gb. [01:23] and fail. [01:30] I've got a relatively small WARC downloaded for WFNX - anything I should do to verify it before uploading? [01:33] Nah. Just go for it. [01:34] oh [01:34] that was easy [01:34] Coderjoe: how do you download? [01:34] I've been trying, but I'm stuck on the swf's generation of the "time" param [01:35] SketchCow: OK, thanks. Anything special to do for uploading? Collection names, etc [01:37] undersco2: still working on the download, but I have the file tree data [01:37] get this json data: http://zippyshare.com/rest/public/getTree?user=Hallfiry&ident=3w9cyej0 [01:37] yeah, I got that [01:38] I have everything except it's kicking me over to the download page instead of sending the file [01:38] it's via json [01:38] I wonder if they whitelist by ip [01:38] i gave up on the file download for the time being [01:38] no [01:38] we [01:38] ll [01:38] the link in the tree gives you the file download page [01:40] it looks like the recaptcha thing might add a temp whitelist [01:40] when it's calling /rest/captcha/test [01:40] yeah [01:40] i thought i could get in via the recaptcha, but when I grab the file page with wget, the recaptcha just says about adding to my file list [01:41] http://hastebin.com/teyeyetata.bash [01:42] yeah [01:42] well, there's a per server JSESSIONID [01:42] I wonder if its checking that [01:43] (set in cookies) [01:45] So wait, misty - you're in canada? or UK? [01:46] mmm shaw [01:46] Coderjoe: we could decompose the swf and see how "time" gets set from "seed" [01:47] :D [01:49] SketchCow: Canada [01:55] undersco2: so just download?key=(fileid) isn't enough? [02:00] hmm [02:06] As one critic pointed out: "I've never seen a show jump 1000 sharks before." [02:18] Coderjoe: nope [02:18] you need the time param too [02:18] well, i have no flash tools atm [02:19] otherwise it 302s to the download page [02:35] mistym: Just checking, I have another misty I've been doing stuff with [02:35] And I was wondering why the hell she'd be up at 3am in the UK [02:38] SketchCow: my project for tonight is twofold: [02:38] 1: verify all the rwts18 dumps at least pass data checksum checks and note any which do not [02:39] Coderjoe: me either [02:39] 2: find and prepare a 360k shugart floppy drive for further imaging since the wider drive i sent has some issues with flaky 'wide track' disks like the a2 ones (it works fine with freshly formatted ones though) [02:39] oh and 3: pack a 5.25" cleaning disk for you [02:39] someone could just manually download it all and send to SketchCow :D [02:40] undersco2: ugh [02:40] you'll need some 90% isopropyl alcohol to use it, but that's easy enough to get at CVS [02:43] Thanks, LordNlptp [02:43] Then I'd like to go to the box and start ripping. [02:44] er correction for 2: find and prepare a 360k DD/40track shugart floppy drive for further imaging since the NARROWER HD/80track drive i sent has some issues with flaky 'wide track' disks like the a2 ones (it works fine with freshly formatted ones though) [02:44] i now know why there was problems with that site [02:44] the a2 drives were a completely brilliant design by woz; they're simple as hell inside and very reliable [02:45] they want you to pay for a account on there ftp site [02:46] also the karateka proto disk which you did 4 passes on was probably written by a very badly misaligned drive, that would explain the extreme noise [02:46] even on a 40track drive that one may not read right, its possible the only way to image that one is to deliberately misalign a drive on purpose [02:47] undersco2: http://www.swftools.org/download.html [02:51] oh joy [02:51] haha [02:52] SketchCow: It IS Saturday night :V [02:53] Yes [02:55] OK, what collection should I toss wfnx in? Just "community texts" or smth for now? No "web" or "software" option. [02:57] Yeah, call it whatever and I'll transfer it when you give me the item name. [02:57] How big? [02:57] LordNlptp: or perhaps if we figure out these drivetec drives. [02:58] Like 80 megs compressed. I'm thinking it may not have captured everything. [02:59] Well, it's not a big site to begin with. [03:00] True. [03:01] http://archive.org/details/archiveteam-wfnx [03:02] Moved. [03:03] meh [03:03] I have no idea how to make sense of this actionscript disassembly output [03:03] Thanks! [03:04] i can see where it gets the "seed" property, but I don't know what it does with it [03:11] Don't murder yourself on it [03:17] I mean, we're essentially hacking a filesharing site [03:17] And those poor bastards, how long before they're all in Gitmo [03:17] Gitmo by Sony [03:17] Sony Presents Gitmo [03:19] well, i'd done it before with mediafire [03:21] yeah [03:22] the swf thing is pretty clever [03:22] but that didn't use flash :-\ [03:22] lot harder to decompose than some javascript [03:22] which is funny, because actionscript3 is derived from ecmascript [03:40] ftp://61.147.109.66/ [04:37] does anyone know what this means: ftp://4guest:4guest@81.176.65.96 [04:38] user:password@host [04:39] the site doesn't ping anymore [04:42] Fortunecity happening! [04:45] http://archive.org/details/archiveteam-fortunecity [04:48] SketchCow: Awesome! [04:49] i'm thinking archive.org could become like a web seed [04:49] yep, that it can [04:49] Web seed? [04:49] it's a pseudostandard whereby torrent files can point to http urls as alternative sources [04:50] that was the original idea of bittorent anyways [04:51] most likely magnet links will have to be used [04:52] to same space on the server end [04:52] as I recall, undersco2 hacked up support at IA [04:53] Gotcha. So using BT to spread around archive.org material, or using it as a backstop to BT material? [04:53] yep [04:53] one torrent/magnet link for item also [04:54] First 50gb set uploaded. [04:54] there maybe just one with original source file archive and one with all files [04:54] Looks pretty good - should upload quickly. [04:54] Going to get the rest up, THEN test, THEN delete originals. [04:54] Yay [04:55] Seeder of last resort - it'd be the Federal Reserve of data :P [04:57] > done [04:57] > sh cityofgold 000000${each} [04:57] Nobody is watching! Now's the big chance! [04:57] Yes, that's what my script says [05:00] hahahaha [05:00] chronomex: a little bit, but one of my projects is making it a first class citizen [05:01] like, a thing that can be derived to? [05:01] yeah [05:01] cool story bro [05:02] :) [05:02] this is cool: http://doc.hackbbs.org [05:08] ...I just uploaded a user that was over 100G to fos. [05:09] NotGLaDOS: It was actually 100G and not a recursion bug? [05:09] Could be either. [05:09] It happens [05:09] What's his name [05:09] rilo69 [05:16] Weird. [05:17] do you guys have circuit celar magazine? [05:17] *circuit cellar [05:25] No. [05:25] http://141.105.33.55/~lomov/library/bigdvd/ [05:25] Godane, I love you're so energetic about this, but driving tons of things into the dark archives is fun, but it's just not a priority. [05:26] i know [05:26] "The Dark Archives" sounds like some occult thing [05:26] there is also torrents on the site too [05:27] the darchives [05:33] 150gb of FortuneCity uploaded. [05:41] Looks like 26 parts to Fortunecity - so about 1.3tb of data. [05:41] (With a few gb of stuff that didn't slip right in.) [06:29] i would think any data would slip right in... like a hot dog down a hallway [06:32] No, problem is that there are sometimes two grabs of the same filename - with different sizes. Easier to just pack it in a separate setup. [06:38] 6,000 artists added to IUMA today. [06:38] When the archive.org reindexer hits, it's gonna be crazy. [06:49] how the hell is it 3am? [06:50] Because 2am is a lazy bitch and cut out on us [06:50] I had a REALLY nice healing nap today [07:26] ok 360k drive found [07:30] ...or not :/ modifying this drive for double-index use is gonna be damn near impossible. drat. [07:31] will certainly work right-side up with disks though [07:32] i need to figure out if my 360k drive is easy to make double-index [07:42] drive is the epson-seiko made model md5201 from an epson equity I+ PC-XT clone [07:42] the system is actually in pretty good shape, i was gonna try to boot it to see if it still runs [07:43] the drive uses an unusual clasp mechanism [08:02] ok that clasp is WEIRD as HELL [08:03] you push the disk in, then push the button AFTER that [08:03] and it locks the disk in place [08:03] then push the button AGAIN andit pops back out and ejects the disk [08:03] never seen a drive mech anything remotely like that befoe [08:06] and here I would have expected it to auto-lock after the disk got pushed all the way in [08:06] yeah [08:07] egh eew, a thousand-legger just crawled across the floor [08:08] https://en.wikipedia.org/wiki/Scutigera_coleoptrata <- one of those suckers [08:12] that's how the clasp on mine works, it looks like a giant 3.5" drive but the button behaviour is opposite [08:34] hey philpem [08:35] hi [08:35] tested with flaky a2 disks and yes the 360k/48tpi drive definitely does read them better [08:36] but this epson/seiko drive owing to its odd clasp mechanism is nearly impossible to modify for flippy disks [08:36] i'm not sure if i own any other 360k drives [08:36] flippy-flappy-floppy [08:36] other than the 5150 ones and those have flaky track 0 sensors [08:42] i'll keep checking in the attic though [12:48] uh, good news, donbex found 2 GB (compressed) of lost Splinder data [13:24] Mornin'. [13:24] This should be over in #discferret [13:33] In other news, I just turned the IUMA up to full to finish it off. [18:12] "This stuff about demanding source [for your backup software] is crap. It's like demanding source for your accounting software because the data is vital, and you'll need that accounting software to read it again. Or only trusting accounting software that stores its information as plain text. It's silly." [18:12] * Wyatt sighs heavily. [18:13] Awww, who said that! [18:15] Some guy on a forum in a thread about backups and The Clown. [18:15] awwww [18:15] I had a "creative" who sells his work to companies go on my Sockington Not Selling out and call me pathetic for being so haughty about my virtual cat. [18:16] A TRUE artist would pimp tweets to pet food companies [18:17] trash bag. [18:17] It's a trash bag company. [18:17] If he understood it, he wouldn't be in that line of work, so I suppose he's at least well-suited to his chosen profession. [18:17] I didn't actually read the agreement, it was a trash bag company that was asking about this? [18:18] Yeah, trash bag company. [18:18] Wow. [18:18] They had some campaign called, like Glad WildLife or something. [18:19] I was going to ask why a trash bag company thought that cat tweets would be the PERFECT way to get sales. That's really lame. [18:20] Didn't some other internet pets take the offer up? [18:22] The general idea is that if you live with animals, you are living a wild life, and the messes that happen could be cleaned up with Glad. [18:22] shaqfu: Apparently - https://twitter.com/#!/BronxZoosCobra/statuses/202111513396379649 [18:33] Yeah, just not a thing. [18:33] I love people who lecture me on how to live my life. [18:33] it ALWAYS works [18:41] found something: http://sohraburp.wordpress.com/2008/01/09/a-huge-collection-of-free-ftp-sites/ [18:54] uploading full episodes of the screen savers from 2002 [18:55] there is only 4gb of that [18:55] then from there i will start uploading march to november of 2004 of the screen savers [22:41] ----------------------------------------- [22:41] http://www.tabblo.com [22:41] 10 days warning [22:41] Who wants to take a shot? [22:41] ----------------------------------------- [22:43] i just remember i have some videos for stage6 [22:43] its the satliteview videos [22:45] one of the videos is this: http://en.wikipedia.org/wiki/St.GIGA [22:48] Due to the rewritability of the cartridges, the fact that "SoundLink" broadcasts were not downloaded to the game cartridges but rather were streamed live during the noon-2AM [22:55] SketchCow: the Tabblo FAQ is from 2010 [23:02] yeah [23:02] that's what makes it great. [23:04] what the FAQ says isn't what they're doing [23:04] they're taking it down, like it or not :( [23:05] still, no surprise from hp [23:06] good news/bad news- you need an account to access some albums, but it can still be created [23:10] also, there's no confirmation email or link to click on to activate your account- it's live as soon as you sign up, with the password you selected [23:24] -names --adjust-extension --span-hosts http://www.tabblo.com/studio/view/recent/ [23:24] hi guys, what's wrong with this wget-warc commandline: ./wget-warc -U "Mozilla/5.0 (X11; Linux i686; rv:8.0) Gecko/20100101 Firefox/8.0 Iceweasel/8.0" -e "robots=off" -nv -o "Tabblo/wget-warc.log" --directory-prefix="Tabblo/files/" --warc-file="Tabblo/Tabblo-html" --warc-max-size=inf --warc-header="operator: Archive Team" --warc-header="Tabblo-site-download" -r -l inf --no-remove-listing --no-timestamping --trust-server [23:26] Bring it to #sadlo [23:26] As per top thing [23:32] SketchCow: you got a mention here: http://arstechnica.com/information-technology/2012/05/digital-archivists-technological-custodians-of-human-history/ (no mention of Archive Team though)