[00:07] Downloaded: 2421 files, 146G in 23h 52m 40s (1.74 MB/s) [00:07] Burp [00:44] http://t.co/DbnlBBFM [00:51] joey hess is fucking awesome [00:56] Yes indeedy [00:56] We'll see if my pathetic little mewing changes anything. [01:08] oh, did you mew again? [01:08] oh hell yes [01:13] SketchCow: already have new $250 backer and stuff, thanks [01:15] closure: let me give you money for working on git-annex [01:15] is there a way I can get a tshirt *and* a usb key? [01:16] chronomex: yes, see the last FAQ question [01:16] it's some special money combo [01:16] ok! [01:16] go for the big key. It might just turn out to be 32 gb instead of 16 [01:16] was intending to [01:17] <3 [01:17] SketchCow: something like $750 in last 10 minutes [01:17] shit [01:17] * chronomex gets on board [01:18] why did you limit the rewards? [01:18] various reasons.. some of them involve me talking to people.. some of them involve batch orders and I want to keep a handle on it. Limits are constantly being adjusted [01:18] aye [01:20] done [01:32] I put in my $10 a few weeks ago [01:32] err days [01:32] it only feels like weeks to me [01:49] alard: That is SO COOL [01:49] Like, really. Absolutely fantastic. [01:49] You should do a little write up on how it works, I'd love to learn more about it :) [02:12] VERY carefully deleting Fortunecity original files [02:13] why? [02:13] are you low on disk or something? [02:13] FOS shouldn't have files [02:13] It should be a waystation, it's not permanent. [02:13] I am carefully making sure the files are in the system, one by one. [02:15] alard's searchamajig would be pretty cool for filename search of cdbbsarchive [02:15] Archive.org wants it [02:15] When we have time, we can discuss making it a default thing for all collections on archive.org. [02:16] punch in "clown.gif" and get 400 hits [02:16] only 400? [02:18] that would be pretty awesome to check a box and have a "index all files in this collection" thing [02:19] http://ia601202.us.archive.org/3/items/test-memac-index-test/tabblo-zip.html the preview images are a nice touch [02:19] for a lot of collections it wouldn't really make sense [02:20] right, I mean, make it optional [02:20] yeah [02:20] now we need to implement it for the flv files in the youporn.com warcs [02:20] :D [02:20] :D [02:33] I wish I could find the art of pixelvision dvds [02:47] Fortunecity verified and up. [02:51] \o/ [02:53] And memac is slowing the fuck down. [03:01] So, when we do a kickstarter for archiveteam, donations that end up with archive.org are tax-deductible [03:01] So people who donate will get letters thanking them [03:02] so what will the be kickstarter be funding? [03:04] Good question. [03:07] seagate should donate, they'd get it back quickly enough [03:08] I found a 2TB hard drive on newegg for $120 [03:08] i think you can get an extra $10 off of that, not entirely sure though [03:10] woah, come to think of it, theres a few that're $120 on newegg... weird. did taiwan have an anti-hurricane? :P [03:16] http://edwardbetts.com/price_per_tb/internal_hdd/ best money is 3 tb now [03:16] otoh, if you're not filling up 3 tb in the next say, year, a 2 tb may be a better choice really.. [03:16] yeah, but.. WD Green.. [03:16] Green = power saver > performance [03:18] production is just slowly picking u [03:18] p [03:18] Still not where it was before hand [03:18] what was the prices beforehand [03:18] were they lower? [03:18] say, shouldn't we be talking in -bs .. [03:19] lol, lookat 1st item on http://edwardbetts.com/price_per_tb/external_hdd/ [03:19] ya i saw lol [03:20] external is still cheaper for real tho [03:20] that doesnt make sense [03:21] you pay more for a case, when obviously theres just a drive in there thats on the internal HDD list, lol [03:21] err, pay less with a case* [03:21] Pros: Opened it up and inside is a nice Seagate Barracuda ST3000DM001 3TB 7200 RPM 64MB Cache SATA 6.0Gb/s 3.5" Internal Hard Drive. [03:22] Cons: The external case clips will break when you open it. [03:22] ghaha [03:23] some externals have the USB/esata/firewire interface right on the drive [03:24] but i dont see the logic in why seagate made the external cheaper [03:25] more demand for externals [03:25] i believe that was a fluke of inventory [03:25] in before offtopic siren [03:31] [20:20:03] say, shouldn't we be talking in -bs .. [04:09] I couldn't take it. [04:09] Forgive me, whatever it is that you pray to that I'll pretend I respect [04:09] He reminds me so much of that kid that ruined a swath of geocities. [04:15] wait, someone did that? [04:17] Goodness, look who comes out of the darkness. [04:17] Hellloooooooooooooooo [04:18] Yes, a young kid got involved in the geocities grab, did a TON of grabs, and fucked up the timestamps, the parsing, and the capitalization. By the time we tried to make up for what he'd done wrong, he'd been banned from here eternally and had sniffed how HARD he had worked, of course ignoring all our information on the best way to grab things. [04:18] Archiveteam Warrior, of course, prevents this from happening again. [04:18] hmm i did some geocities mirroring independent of archiveteam back when [04:18] using wget -m -np -p [04:19] which is probably NOT to the proper standards [04:19] i still have the stuff [04:19] Well, it was a bit of amess, that project. [04:19] I'd love a copy. [04:19] How big? [04:19] lemme check [04:19] does tar properly preserve capitals and timestamps? [04:19] Yes [04:19] ok [04:19] It's the filesystem that's important [04:19] He was doing some crazy thing. [04:20] It was now nearly 3 years ago, I've forgotten. [04:20] Again, we engineered out of it. [04:20] onesec gotta sftp to my other box [04:21] ... ok i'm stupid, no i don't need to. well, i do to preserve timestamps but i can calc size from the backup [04:21] though i'll need to re-tar the original to preserve stuff [04:22] heh, I remember that kid.. [04:24] gah i'm afraid i already f***ed stuff up [04:24] but let me verify [04:24] damn, that sucks (about the batch of geocities) [04:24] and yeah, sorry been mia [04:25] insanely busy in my department [04:25] * undersco2 waits for su to notice and complain in #archiveteam-bs [04:26] I was under the impression you were leaving us after whatever project it was. [04:26] SketchCow: Was he like my age? [04:26] :D [04:26] nah [04:26] 3 external hard drives to check [04:26] didn't plan on leaving [04:26] 3 MORE i should say [04:26] just have to focus priorities (i know archiving is a high priority) [04:26] You made some amount of noise about something. [04:27] * SketchCow is now shoving 20gb of magazines in. [04:27] Then they go dark. [04:27] But I'm cleaning out FOS. [04:27] Since I just yanked in 2tb of other data. [04:29] that's a lot of magazines [04:30] SketchCow: do you plan on making a diy bookscanner for all your infocube magazines at some point? (like, waaaaay down the road. Just curious) [04:30] Why do you ask questions like this. [04:31] Let's see, there's two possible answers. [04:31] Yes. No. [04:31] If I say yes, what happens? And if I say no, what happens? [04:37] Oh yeah, DELICIOUS cranky [04:37] It's getting mentioned on boingboing [04:37] Fucking hate boingboing [04:38] And as on cue, the comments at the bottom are just a trash-rific snobfest. [04:38] "Poor speaker" "boring" "Oh god, video" [04:48] Anyway, tomorrow I find out details about the use of this building for ArchiveTeamCon II [05:32] 21:10:33 <@SketchCow> He reminds me so much of that kid that ruined a swath of geocities. [05:32] that was a disaster [05:32] don't remind me [05:48] SketchCow: I dunno, I was hoping you'd say yes or no and then elaborate on why [05:52] oh, jonas [05:53] He ended up on my skype list for some horrid reason [07:28] JONAS [07:29] * SketchCow screams and runs into the night [07:29] * ersi sounds the alarm [08:10] i dont get git annex [08:10] does it track changes of those files? [08:10] does it work like normal git for textfiles? [08:10] track changes as in let me see a diff, let me revert etc [08:23] it tracks the sha1 of big files, and makes it so not every clone has to have a copy of everything [08:23] most of what it does falls out of these two attributes [09:11] http://matkelly.com/warcreate/ http://www.cs.odu.edu/~mkelly/papers/2012_jcdl_warcreate.pdf [10:26] alard: awesome [10:29] I'm not sure how well it works. (There are several warc-writing functions in that extension.) [10:29] Some of them seem to be reusing the same hard-coded GUIDs, as well as the author's name and IP address. [10:45] Nah, the WARCreate extension: cool idea, but doesn't really work. [10:50] :( [10:59] alard: Send him a few pointers :) [13:51] http://arstechnica.com/apple/2012/06/three-things-to-back-up-before-mobileme-goes-dark-on-june-30/ [14:34] Good article. [15:05] SketchCow: ok i found ONE site i had archived from geocities but i know i have archives of much more than that [15:06] datestamps are a little screwed on this one though [15:06] lemme tar it up [15:08] yeah datestamps and username are destroyed, i don't think i wgetted this with -p, this site was done much earlier than the others [15:09] Tar it up, we set it aside in the collection [15:11] done and emailed [15:12] its possible someone else archived the same site and presrved timestamps [15:13] "please review our attachment guidelines" - wtf [15:14] one sec, lemme rename the file extension or something [15:15] resent [15:16] looks like it went thru that time, i think [15:19] YES found the big archive [15:19] tarring it up [15:20] its 480mb uncompressed [15:26] packing into a tar [15:35] hmm adding in some parts of geocities i had in different directories its 516mb [15:35] i can't email that or even upload it to sendspace; is there somewhere i can sftp that to? [15:53] Give me a tad. [15:53] Can you rsync? [15:59] Script is now combining all our Tabblo.com items into one big directory - then follows the bunching up into .tars and the uploads. [16:22] SketchCow: rsync? probably [16:22] may need some instructions though [16:22] since i've never done it befoe [16:49] LordNlptp: rsync is pretty easy; do you have the build for cygwin? [17:05] probably [17:47] Greets from the archive. [17:47] If any of you are in the SF Bay area, come to the archive at noon. Free lunch. [17:48] wish i was :( [17:59] i'm missing free food :( [18:24] tanstaafl [18:38] Someday! [18:38] Just made the initial discussions for ArchiveTeamCon II [18:54] LET ME SEE YOUR WARFACE [18:59] Who in here would come to SF in January for ArchiveTeamCon! [19:00] Fuck, that's gonna rip me a new one just starting planning it [19:02] aw, wish i could [19:04] I'd like to go [19:04] depends on exactly when in January [19:04] winr4r: What would stop you? [19:05] I bet the douchebags at the border would stop me [19:11] I would probably come [19:11] SketchCow: being a very long way from the US, me living on as little money as possible, probably not getting into the US again thanks to the last time i tried it [19:13] and the fact i get to meet you in september which would be my meeting-jason-IRL-fix forever ;D [19:18] I'd be more on it if ArchiveTeamCon would be held in Europe or somewhere else than the US, honestly [19:18] ersi: Where are you? [19:19] ArchiveTeamCon sounds amazing. I'd go if it turns out I'm able! [19:20] Sweden [19:33] Nemo_bis: whats the syntax for the wiki batch downloader? and will it seesaw? [20:07] I would hope that someone in Europe would be more interested in attending a European con than an american con. [22:47] I wouldn't say the $20,000 is assured, closure - but we're closer.