#archiveteam 2011-07-18,Mon

↑back Search

Time Nickname Message
00:14 πŸ”— underscor SketchCow: ping
00:14 πŸ”— underscor SketchCow: See PM please
00:14 πŸ”— underscor It's semi-urgent
00:15 πŸ”— SketchCow The thing, yes.
00:15 πŸ”— SketchCow How much data
00:17 πŸ”— underscor Like 100 gigs
00:18 πŸ”— underscor It's the starwars forum stuff
00:18 πŸ”— SketchCow Sent you the password
00:18 πŸ”— underscor Got it, thanks
00:18 πŸ”— SketchCow Put in subdirectory called STARWARS-FORUMS
00:18 πŸ”— underscor Still gives me an auth failed message
00:19 πŸ”— SketchCow Then you're doing it wrong.
00:19 πŸ”— SketchCow Paste in the channel and stop blowing it into this one
00:19 πŸ”— underscor @ERROR: auth failed on module gv_
00:19 πŸ”— underscor Password:
00:19 πŸ”— underscor alex@datalove:~$ rsync -avP starwars.tar.maybebad blindtiger.textfiles.com::gv_5
00:19 πŸ”— SketchCow Nobody needs to see this.
00:19 πŸ”— underscor Oh, sorry
00:19 πŸ”— SketchCow Excellent, that is precisely the opposite.
00:20 πŸ”— underscor ?
00:34 πŸ”— db48x howdy all
00:36 πŸ”— SketchCow Yo
00:40 πŸ”— db48x SketchCow: could you check to see which slot I'm assigned to?
00:44 πŸ”— SketchCow I need to stop uploads to those machines
00:44 πŸ”— SketchCow Give me a day or two
00:46 πŸ”— db48x ok
00:47 πŸ”— underscor SketchCow: Where are we moving?
00:54 πŸ”— SketchCow New machine, 36tb
00:54 πŸ”— SketchCow We were only using 20, but with everyone with something 20mb-2tb in random slots
00:54 πŸ”— SketchCow now it'll pool
00:56 πŸ”— underscor Oh, that's nifty
00:57 πŸ”— underscor Same thing, write-only rsync?
01:15 πŸ”— SketchCow Yes
01:45 πŸ”— underscor SketchCow: "That awesome time..." is amazing
01:45 πŸ”— underscor I just watched it in its entirety for the first time
02:20 πŸ”— dashcloud the friendster answer to what happened to my stuff is really ugly- they've could've at least dropped the "we had to" part and just said we removed instead
02:42 πŸ”— SketchCow I love the we had to.
04:08 πŸ”— Coderjoe friendster answer? where's that?
04:35 πŸ”— SketchCow Hey, gang.
04:35 πŸ”— SketchCow OK, so I am moving data around like crazy on the servers.
04:36 πŸ”— RedType did you find Secretz~
05:28 πŸ”— Nemo_bis SketchCow, Jeff wrote me to reupload that file, will it work now?
05:28 πŸ”— SketchCow He is the expert.
05:30 πŸ”— Nemo_bis Γ―ΒΏΒ½Just realized the file you mention is not there. if it is less than 2GB you can upload it to the item at http://ia600602.us.archive.org/edit.php?identifier=openwetware.org. if it's over 2GB there are options that include using the ias3 interface, an auto submit or a wget if it is on a server.Γ―ΒΏΒ½
05:31 πŸ”— * Nemo_bis doesn't understand if he means under another item
05:45 πŸ”— SketchCow He's saying the item needs to be uploaded with an s3 command
05:53 πŸ”— Nemo_bis I already did so
05:53 πŸ”— Nemo_bis There's no way you could upload a 17.5 GiB file otherwise
12:17 πŸ”— SketchCow root@teamarchive-0:/3/FRIENDSTER# cat friendster.00220xxxx.tar.bz2 | bunzip2 - | tar vtf - | tail
12:18 πŸ”— SketchCow That... that is going to take a while.
12:21 πŸ”— Spirit_ i found that using tar v can be much slower than not having it output stuff
12:29 πŸ”— SketchCow I'm just trying to verify if that file truly has 002200000-002299999
12:49 πŸ”— Wyatt Maybe it's just me, but it looks like tr.im finally bought the farm? I can't get it to resolve here...
12:51 πŸ”— SketchCow Same here.
13:40 πŸ”— ersi Spirit_: No wonder, since it has to buffer up the text
13:40 πŸ”— ersi Spirit_: Anything that outputs to a display takes lots of time more than if it wouldn't
13:55 πŸ”— Wyatt Ooooh, dear, this could get large. I'm currently spidering the moribund Soundshock forums (FM Synth music community).
13:56 πŸ”— Wyatt Predictably, it looks like there's going to contain a lot of mp3s, pdf datasheets, and images (schematics and the like).
14:20 πŸ”— ersi Awesome :)
14:54 πŸ”— Spirit_ wicked, http://geocities.yahoo.co.jp/
14:56 πŸ”— Wyatt Yup. I've been wondering about geogities.jp, actually...
15:07 πŸ”— Wyatt Sort of in the same vein as Angelfire and Tripod. I hit a site and express surprise to myself that they're still around.
15:08 πŸ”— Wyatt Has anyone done any sort of initial surveying of any of them to determine archivability?
15:08 πŸ”— SketchCow No
15:08 πŸ”— SketchCow If you want it, it's yours.
15:10 πŸ”— Wyatt Hmm, I may at that. Not /super/ sure how to approach it, but I don't believe the Geocities effort started with better info.
15:14 πŸ”— SketchCow It's the big reckoning, where I am funelling all the material into the archiveteam machine
15:14 πŸ”— SketchCow That's... it's just a lot of friendster.
15:19 πŸ”— Wyatt Friendster really was much larger than I expected for a network I'd barely heard of.
15:19 πŸ”— SketchCow It's not THAT large.
15:19 πŸ”— SketchCow It's the photos.
15:22 πŸ”— SketchCow The dumping will be going all night.
15:22 πŸ”— SketchCow Probably a few days, it'll be exciting.
15:23 πŸ”— SketchCow Something like a terabyte, just for one of these directories.
15:23 πŸ”— Spirit_ oh, i should totally run a local dns server for my robots.txt downloading
15:23 πŸ”— Spirit_ how stupid of me
15:24 πŸ”— DFJustin yahoo is a lot more popular in japan for whatever reason and is basically run separately iirc, I don't know that geocities.jp is in any special danger
15:25 πŸ”— DFJustin what blew my mind the other day is homestead.com is still up
15:25 πŸ”— Wyatt Well in any case, I'll probably look into one of those three once I've finished soundshock. Regardless of Yahoo Japan's health, losing it would be suboptimal.
15:25 πŸ”— DFJustin for sure
15:27 πŸ”— Wyatt In the mean time, since I'm apparently not going to be employed by a hosting company, I need to figure out a good way of moving a Friendsterball.
15:27 πŸ”— Coderjoe ugh.
15:28 πŸ”— Coderjoe I have 988GB (compressed
15:28 πŸ”— Coderjoe ) myself
15:28 πŸ”— Wyatt Oh bloody hell, man.
15:29 πŸ”— DFJustin what spooks me japanese internet-wise is pixiv, there is SO much stuff on there and it's all behind a login barrier so wayback will have 0% of it
15:29 πŸ”— Coderjoe I think the best way is going to be sneakernet via ups/fedex
15:30 πŸ”— SketchCow Yeah, we have to get cranking on the friendster.
15:30 πŸ”— SketchCow Shortly, I'll be rejiggering things so we can figure out next moves.
15:38 πŸ”— Wyatt DFJustin: Ooooh Pixiv. I'm going to declare right now that it's worse than you think, even.
15:38 πŸ”— Wyatt (Well, unless you've done any research on it, in which case it's probably only just as bad.)
17:52 πŸ”— underscor ndurner_o: How we doing with ggroups?
17:53 πŸ”— underscor I've been getting a lot of 501s
17:59 πŸ”— ndurner_o Discovery by topic is done
17:59 πŸ”— ndurner_o I'm currently preferring downloads over discovery
18:00 πŸ”— ndurner_o All in all, we're doing pretty good.
18:31 πŸ”— Coderjoe http://archiveteam.org/index.php?title=Special:Contributions/Alepa
18:31 πŸ”— Coderjoe O_o
18:46 πŸ”— Wyatt Is there a way to tell httrack to ignore a certain set of URI (like, say, profile.php or privmsg.php)?
19:00 πŸ”— Wyatt Okay, I think I got it. Needed to use minus to filter.
20:51 πŸ”— SketchCow 2.6T .
20:51 πŸ”— SketchCow root@teamarchive-0:/3/FRIENDSTER# du -sh .
20:56 πŸ”— ersi Jeez
20:57 πŸ”— SketchCow Long way to go.
21:08 πŸ”— Coderjoe SketchCow: is that compressed or raw?
22:10 πŸ”— Wyatt Ugh, a lot of the forum links here are for googlepages domains. Is there a simple format for rewriting all of those?
22:30 πŸ”— Wyatt They say they "migrated" everything, but... well, it looks like Adlib Underground is a lost cause.

irclogger-viewer