#archiveteam 2012-05-23,Wed

↑back Search

Time Nickname Message
00:36 πŸ”— SketchCow WHY HELLO
00:37 πŸ”— chronomex ahoy maytee
00:46 πŸ”— shaqfu ohai
00:47 πŸ”— SketchCow I wish I could have AT members hanging here
00:47 πŸ”— SketchCow I'm speaking on archive team stuff, they put me on a suite
00:47 πŸ”— shaqfu Is this Ann Arbor?
00:59 πŸ”— bsmith095 SketchCow: whens the talk going up?
01:28 πŸ”— SketchCow Oh who knows
01:45 πŸ”— SketchCow The fuck, they just dropped off a fruit and cheese plate
01:46 πŸ”— SketchCow I just walked downstairs and toured the ballroom I'm speaking at tomorrow.
01:49 πŸ”— shaqfu Were there mints on the pillows?
01:50 πŸ”— SketchCow Nah, it's marriott.
01:53 πŸ”— shaqfu Fruit and cheese, still classier than any Marriott I've been in
01:53 πŸ”— SketchCow Yeah, it's a weird marriott.
01:53 πŸ”— SketchCow And no porn, not that I'm checking.
01:54 πŸ”— SketchCow Because? two guesses.
01:54 πŸ”— SketchCow (guessing as to why there will be no porn)
01:54 πŸ”— shaqfu What city is this?
01:54 πŸ”— chronomex hmmmm
01:54 πŸ”— SketchCow Ann Arbor/Ypsilanti
01:54 πŸ”— shaqfu It's owned by evangelicals?
01:54 πŸ”— SketchCow Eagle Creek Resort, which is just a marriott with a conference center.
01:55 πŸ”— SketchCow No.
01:55 πŸ”— SketchCow I mean it is, but that's not the reason.
01:55 πŸ”— chronomex corporate purchasing card?
01:55 πŸ”— SketchCow No, this is at all marriotts.
01:55 πŸ”— SketchCow They're losing millions
01:55 πŸ”— chronomex Million Mom March or some similar bullshit?
01:55 πŸ”— dashcloud disputes over people disagreeing that they watched it?
01:56 πŸ”— shaqfu Cleaning issues?
01:56 πŸ”— SketchCow Nope!
01:56 πŸ”— SketchCow Much more insidious!
01:56 πŸ”— SketchCow MUCH more insidious
01:56 πŸ”— chronomex CONGRESS
01:56 πŸ”— shaqfu Janitors were watching?
01:56 πŸ”— dashcloud hd porn costs too much for too little return?
01:56 πŸ”— SketchCow No, and no and no.
01:56 πŸ”— SketchCow Here we go!
01:56 πŸ”— SketchCow Romney is running for president
01:56 πŸ”— chronomex ahhhhhh
01:56 πŸ”— SketchCow He's on the board of marriott
01:56 πŸ”— chronomex ahahhaa
01:56 πŸ”— shaqfu That'll do it
01:56 πŸ”— SketchCow They've yanked it while he's runnning
01:56 πŸ”— chronomex that'd do it it
01:57 πŸ”— chronomex sorry, I'm drinking at work
01:57 πŸ”— chronomex kind of intoxicated
01:57 πŸ”— SketchCow Bravo
01:58 πŸ”— chronomex my coworker's homebrew is excellent
01:59 πŸ”— dashcloud beer or liquor?
01:59 πŸ”— chronomex beer
01:59 πŸ”— chronomex speaking of, I should go stock up on liquor before they privatize all the liquor stores
01:59 πŸ”— shaqfu Pennsylvania?
01:59 πŸ”— chronomex washington
02:00 πŸ”— chronomex state stores close down at the end of the month
02:00 πŸ”— chronomex and I'm not sure if I trust grocery stores to carry applejack
02:00 πŸ”— shaqfu Ah, yeah - it'd take time for liquor stores to open up
02:00 πŸ”— chronomex sure as hell they won't sell it for cheaper
02:11 πŸ”— chronomex hahaha, I just drunkenly called the grocery store to ask whether they'll be carrying a particular liquor
02:12 πŸ”— shaqfu Do they?
02:12 πŸ”— chronomex the person who answered didn't know and said to call back tomorrow before 5 to catch the wine steward
02:13 πŸ”— chronomex D:
02:19 πŸ”— shaqfu Your supermarkets have somolliers? Classy
02:19 πŸ”— chronomex hardly
02:19 πŸ”— chronomex more like "pimply 18 year old in charge of making sure hobos don't steal the wine"
02:20 πŸ”— shaqfu Rofl - a wine steward that can't drink
02:20 πŸ”— chronomex idk, I've never asked
02:20 πŸ”— chronomex they're probably old enough
02:53 πŸ”— Coderjoe s/insidious/stupid/
02:54 πŸ”— Coderjoe SketchCow in the stizzate
03:29 πŸ”— underscor Coderjoe: It is, but negligibly so.
03:29 πŸ”— underscor It's just 7z l, so it's not too expensive
03:38 πŸ”— Coderjoe eh?
03:38 πŸ”— Coderjoe oh
03:39 πŸ”— Coderjoe make me connect your seemingly-random utterance to something i said 6 hours ago, without saying anything else
03:40 πŸ”— Coderjoe 7z l on a tar file still requires parsing through the entire tar file. granted, it isn't too bad if not compressed.
03:42 πŸ”— Coderjoe but on a 100+GB tar file with a few thousand files, that can still take awhile
03:42 πŸ”— underscor sorry
03:43 πŸ”— underscor results are cached for 24h after last request too though
03:43 πŸ”— underscor so it's not as expensive if it gets popular
03:43 πŸ”— * Coderjoe wanders off to do stuff like pack and try to get the company laptop ready
03:44 πŸ”— shaqfu Ah, phew. Makes me feel like less of a jerk for using it so much lately :P
03:44 πŸ”— underscor lol
03:44 πŸ”— underscor nah, it's fine
03:44 πŸ”— underscor we want it to be a first class citizen!
03:44 πŸ”— underscor It's one of my projects.
03:44 πŸ”— underscor one of the ones I can talk about anyway
03:44 πŸ”— underscor lol
03:47 πŸ”— shaqfu It's fantastic - it'd be impossible to item-level index without it
03:47 πŸ”— shaqfu (at least, on this collection)
03:47 πŸ”— nitro2k01 http://i.canvasugc.com/ugc/p/canvas_rytff.png
03:50 πŸ”— underscor nitro2k01: nice!
03:51 πŸ”— underscor haha
06:52 πŸ”— godane SketchCow: I just noticed the GoNintendo archive is not up to date
06:54 πŸ”— godane like it stop at episode 46 on archive.org but its currently at 353
06:55 πŸ”— godane the worse part is the list only goes down to 329
06:55 πŸ”— godane but the path a least works for episodes 210 and up i think
08:43 πŸ”— ersi godane: good ridance
08:43 πŸ”— Coderjoe ugh
08:43 πŸ”— Coderjoe today is going to suck so bad
08:44 πŸ”— Coderjoe I didn't get any sleep, because i was tossing and turning and worrying about waking up on time
08:45 πŸ”— Coderjoe and I have a three hour drive to the other side of the state this morning, working over there for who knows how many hours, then possibly a break, then SketchCow's aadl presentation, and at some point a three hour drive home, arriving home no earlier than 2am
14:33 πŸ”— SketchCow HI GANG
14:33 πŸ”— SketchCow Working on the effort
14:33 πŸ”— SketchCow The slides.
14:42 πŸ”— ersi -!- the effort is now known as The Slides
15:20 πŸ”— alard Hi all, Tabblo needs your help!
15:20 πŸ”— alard There's a seesaw script if you want to help: http://archiveteam.org/index.php?title=Tabblo#How_to_help_archiving
15:21 πŸ”— alard For those on Windows (or unwilling to install things), try the ArchiveTeam Warrior appliance: http://archive.org/details/archiveteam-warrior
15:21 πŸ”— alard Tracker: http://tabb.heroku.com/
15:24 πŸ”— Schbirid alard: how does that stuff work? is it easy to setup?
15:25 πŸ”— Schbirid i mean the tracker and automatic assignment of task
15:25 πŸ”— alard Easy for whom?
15:25 πŸ”— Schbirid ah poop, maybe we should have used wget-warc for fileplanet
15:25 πŸ”— Schbirid me! :)
15:25 πŸ”— alard Ah, it's relatively simple, if you know what to do.
15:26 πŸ”— alard https://github.com/ArchiveTeam/universal-tracker
15:27 πŸ”— Schbirid cheers
15:53 πŸ”— SketchCow Hi everyone,
15:53 πŸ”— SketchCow I heard back from Lisa, and unfortunately Connecticut is not a popular location for SIGS. Our event, and the other events that day have very low registrations. We currently have 4 non-speaker registrants, one event was cancelled and the third event also has low numbers.
15:53 πŸ”— SketchCow hahahah
15:54 πŸ”— SketchCow So this person has been wanting me to come in and speak for, like, months
15:54 πŸ”— SketchCow They've had google voice chats to discuss how they will do each thing
15:54 πŸ”— SketchCow this, that the other thing
15:54 πŸ”— SketchCow Lot of work
15:54 πŸ”— SketchCow P.S. for free
15:54 πŸ”— SketchCow Anyway, as the note says, this conference currently has 4 attendees
15:55 πŸ”— balrog_ :'(
15:55 πŸ”— SketchCow Not the one I'm AT
15:55 πŸ”— SketchCow The one that wants me to talk in June
15:56 πŸ”— SketchCow I may have myself a free day
15:56 πŸ”— balrog_ ah ... yeah
15:59 πŸ”— SketchCow ------------------------------------------
15:59 πŸ”— SketchCow THE NEW TABBLO PROJECT IS IN EFFECT YOU PEOPLE
15:59 πŸ”— SketchCow GET THE CLIENTS RUNNING - WE ONLY HAVE A FEW DAYS
15:59 πŸ”— SketchCow -----------------------------------------
15:59 πŸ”— SketchCow http://archiveteam.org/index.php?title=Tabblo#How_to_help_archiving
16:01 πŸ”— mistym Need to get on that soon as I get home.
16:04 πŸ”— mistym Dear Mac OS X users, if you exist - brew tap mistydemeo/archiveteam; brew install wget-lua
16:14 πŸ”— pberry alard: something funky with the detection of lua on ubuntu
16:16 πŸ”— alard pberry: Oh?
16:16 πŸ”— pberry maybe I'm just doing it wrong
16:17 πŸ”— pberry but even after I was sure I had 5.1 it kept saying I needed Lua
16:17 πŸ”— pberry I'm trying the script with the exit on the check commented out
16:18 πŸ”— alard Do you also have the liblua5.1-0.dev ?
16:19 πŸ”— pberry aha
16:19 πŸ”— pberry that will probably fix it
16:20 πŸ”— pberry \o/
16:20 πŸ”— pberry alard: thanks
16:28 πŸ”— closure wow, check out all the new tabblos, all about the site going down
16:29 πŸ”— SketchCow OK, disappearing for a little - giving keynote!
16:29 πŸ”— closure one of them is 100% archiveteam http://www.tabblo.com/studio/stories/view/1850589/?nextnav=recent
16:30 πŸ”— mistym Hah!
16:30 πŸ”— mistym I like how Tabblo still has the "Want to make your own?" etc. header, like it's not about to disappear.
16:32 πŸ”— closure wget-warc needs lua now?
16:33 πŸ”— pberry how's the fortress looking?
16:38 πŸ”— closure hmm, I should be able to get quite close to tabblo.com on the network and zip thru the zip downloads
16:39 πŸ”— * closure powers up a machine..
16:40 πŸ”— alard closure: wget-warc-lua does.
16:41 πŸ”— closure what's the lua used for, out of curiousity?
16:43 πŸ”— alard That's where the logic is. This is very simple: https://github.com/ArchiveTeam/tabblo-grab/blob/master/dld-tabblo-user.sh
16:43 πŸ”— alard but this Lua script is doing the work: https://github.com/ArchiveTeam/tabblo-grab/blob/master/tabblo.lua
16:44 πŸ”— closure oh ok, filtering etc
16:45 πŸ”— closure btw, I hope that the embedded S3 creds are not abusable..
16:46 πŸ”— alard Shhhhhh.
16:48 πŸ”— closure ;)
16:49 πŸ”— closure is the lua support going into wget upstream?
16:56 πŸ”— beardicus every tabblo user has been 520k thus far... is this normal or an issue?
16:57 πŸ”— Aranje I've got one that was 3m, and one that was 9m
16:57 πŸ”— * Aranje fires up another downloader on another machine
16:58 πŸ”— beardicus ooh, there goes a bigger one. nm.
17:01 πŸ”— Aranje is the downloader sensible about not wiping out hd space?
17:01 πŸ”— Aranje which is to say, will it stop just before it fills a drive?
17:02 πŸ”— closure aww, wrong datacenter
17:03 πŸ”— closure darn, I *used* to be in tabblo's datacenter.
17:03 πŸ”— yipdw evicted
17:12 πŸ”— tomwsmf The VirtualBox appliance seems to be working nicely
17:21 πŸ”— closure it is pretty cool to start a download and get this 10 minutes later http://archive.org/details/archiveteam-tabblo-10
17:24 πŸ”— Aranje Can you run multiple warc's on the same box?
17:24 πŸ”— Aranje Or does that... piss off tabblo?
17:26 πŸ”— closure oh, this range stuff is confusing
17:27 πŸ”— closure so confusing
17:27 πŸ”— Aranje Sui, http://archiveteam.org/index.php?title=Tabblo#How_to_help_archiving If you have extra other boxes
17:27 πŸ”— * closure redoes from start
17:28 πŸ”— Sui yeah i was reading that
17:28 πŸ”— Sui i'm wondering if i xenserver has a NAT network
17:28 πŸ”— Aranje closure: Do you know if we can run multiple seesaw's on the same box? Will tabblo you if that happens?
17:28 πŸ”— Aranje s/tabblo you/tabblo ban you/
17:29 πŸ”— closure haven't tried
17:31 πŸ”— Guest_ SketchCow: hahaha
17:31 πŸ”— Guest_ damnit
17:32 πŸ”— beardicus Aranje, I'm running 5 seesaws on one machine right now, no issues thus far but it hasn't been even an hour yet.
17:32 πŸ”— * Aranje nods
17:32 πŸ”— Aranje can you just run them on top of eachother?
17:32 πŸ”— Aranje same dir?
17:33 πŸ”— beardicus yes.
17:33 πŸ”— Aranje awesome
17:33 πŸ”— Sui Aranje: try running ten
17:33 πŸ”— Sui we have the bandwidth
17:33 πŸ”— kennethre better
17:34 πŸ”— Aranje lemme just spawn 10 new screen tabs
17:34 πŸ”— kennethre SketchCow Ҁ” saws == seesaws?
17:35 πŸ”— Aranje okay that's...
17:35 πŸ”— Aranje I dunno how many
17:35 πŸ”— Aranje Enough that I've got 0-9
17:35 πŸ”— Aranje and maybe more
17:36 πŸ”— mistym btw, who admins the wiki? Registration seems to be kind of broken.
17:36 πŸ”— Sui Aranje: look at the tracker
17:36 πŸ”— winr4r has been for a while, mistym, i believe it was disabled because of spam
17:36 πŸ”— Aranje aww fucking yeeeee
17:37 πŸ”— winr4r good evening, archive team
17:37 πŸ”— mistym winr4r: It doesn't seem to be outright disabled, so much as it throws a php error
17:37 πŸ”— winr4r mistym: yes
17:39 πŸ”— DoubleJ Two questions: 1) What's lua? 2) Which version of it do I need to make the tabblo downloader work?
17:40 πŸ”— Aranje It's a programming language, and 5.1 is working just grand for me
17:40 πŸ”— closure lua is a embedded scripting language
17:40 πŸ”— DoubleJ So I need to download a language interpreter to make wget work? Do I even want to know?
17:42 πŸ”— mistym DoubleJ: You need a language interpreter to run the wget scripts.
17:44 πŸ”— yipdw DoubleJ: it's just another library
17:44 πŸ”— yipdw you don't need a Lua development environment to run wget
17:44 πŸ”— tomwsmf http://tabb.heroku.com/ Leaderboard style tracker for the tabblo project
17:45 πŸ”— yipdw codinghorror?
17:45 πŸ”— yipdw Jeff Atwood is doing AT now? :P
17:46 πŸ”— Aranje :D
17:46 πŸ”— closure lol
17:47 πŸ”— yipdw not that I'd mind, it'd be pretty cool if that actually was him
17:48 πŸ”— shaqfu Archiving with the Stars
17:49 πŸ”— yipdw next up, Joel Spolsky
17:54 πŸ”— Sui my two servers duking it out
17:55 πŸ”— Aranje lmao
17:55 πŸ”— Aranje lmao
17:55 πŸ”— Sui Aranje: your server is stuck on four instances, so it's going extra slow
17:57 πŸ”— Sui do I have to upload stuff later or is this inline
17:57 πŸ”— Aranje upload later
17:57 πŸ”— Sui just making sure
17:58 πŸ”— alard Sui/Aranje: The Tabblo seesaw script uploads as you go.
17:58 πŸ”— Aranje Oh, it does?!
17:58 πŸ”— Aranje Awesome :D
17:58 πŸ”— alard Yes, it's download - rsync - delete - repeat.
18:22 πŸ”— Sui me and beard are duking it out while aranje is like "I'm on the moon!"
18:28 πŸ”— l-fy hello
18:28 πŸ”— l-fy is there any way i can ask archive.org to make a snapshot of a website/
18:29 πŸ”— shaqfu l-fy: Which site?
18:30 πŸ”— l-fy http://www.crimelecomunismului.ro
18:31 πŸ”— l-fy we will need that because the website will be changed, and for political reasons is important to have a full archive of the website today
18:32 πŸ”— shaqfu I don't know if it's possible to get a specific Wayback request, but we can grab it and put it up
18:32 πŸ”— l-fy shaqfu: that will be cool
18:33 πŸ”— shaqfu You said it'll be changed in a few hours?
18:34 πŸ”— l-fy maybe a day
18:34 πŸ”— l-fy and it's important to keep a recorded copy from a international organisation so that the backup will be believed
18:35 πŸ”— shaqfu I can't get it - no way I can grab the site in time on my line - but hold on...
18:36 πŸ”— shaqfu ===================================
18:36 πŸ”— shaqfu IF ANYONE WANTS TO BE A POLITICAL HERO
18:36 πŸ”— l-fy i can grab it
18:36 πŸ”— Sui lol
18:36 πŸ”— shaqfu WE NEED A BACKUP OF http://www.crimelecomunismului.ro/
18:36 πŸ”— Sui i already started wget --mirror
18:37 πŸ”— shaqfu FUTURE ROMANIANS WILL THANK YOU
18:37 πŸ”— l-fy but it has to be backup by an internation archive
18:37 πŸ”— shaqfu ========================
18:37 πŸ”— Sui that should be the right flag?
18:37 πŸ”— l-fy damn
18:37 πŸ”— shaqfu Sui: I think
18:37 πŸ”— Sui right on
18:37 πŸ”— Sui i'll let you know when it finishes
18:37 πŸ”— shaqfu Sui: Using wget-warc?
18:37 πŸ”— Sui no, should i?
18:37 πŸ”— shaqfu Yeah
18:38 πŸ”— Sui i'm new to all this
18:38 πŸ”— shaqfu It'll grab headers necessary for wayback, and it'll give added legitimacy
18:38 πŸ”— shaqfu Which seems to be an issue here
18:38 πŸ”— Sui oh
18:38 πŸ”— Sui oops
18:38 πŸ”— Sui well, should i stop it?
18:38 πŸ”— shaqfu Yeah
18:39 πŸ”— shaqfu We're time-limited here; gotta get it right the first time
18:40 πŸ”— l-fy shaqfu: how can i grab it to be ok?
18:40 πŸ”— Sui will wget-warc-lua work?
18:40 πŸ”— l-fy i can make a copy in Romania and transfer latter
18:40 πŸ”— shaqfu Sui: Hm, dunno; never used that one
18:40 πŸ”— shaqfu l-fy: Is it a different page based on location?
18:41 πŸ”— Sui it's working
18:41 πŸ”— l-fy no shaqfu
18:41 πŸ”— shaqfu l-fy: Should be fine either way, then
18:41 πŸ”— Sui and much faster on a 100mbit line than my house
18:41 πŸ”— l-fy yes, i have a 100mbit connection in .ro
18:41 πŸ”— l-fy and usually that should be faster than .us
18:41 πŸ”— Sui yeah
18:42 πŸ”— Sui grab from both, tar and diff?
18:43 πŸ”— shaqfu Hm?
18:43 πŸ”— l-fy Sui: how can i grab the website?
18:43 πŸ”— shaqfu l-fy: wget-warc
18:43 πŸ”— shaqfu Install it, point it at the site, set up options, go make a sandwich
18:44 πŸ”— shaqfu Does -m include --convert-links?
18:44 πŸ”— shaqfu And --page-requisites?
18:44 πŸ”— Sui Total wall clock time: 3m 36s
18:44 πŸ”— Sui Downloaded: 1466 files, 603M in 1m 53s (5.32 MB/s)
18:45 πŸ”— l-fy there seems to be just a git
18:46 πŸ”— DFJustin that site seems to have a bunch of flash links, you may want to follow those manually and make sure they get grabbed
18:47 πŸ”— Sui someone else needs to scrub, i'm terrible at noticing small links
18:47 πŸ”— l-fy :(
18:47 πŸ”— patrickg the flash items on the front page all seem to point to different hostnames. are those relevant, too?
18:48 πŸ”— l-fy which ones?
18:49 πŸ”— patrickg for example their historical photo archive: http://fototeca.iiccr.ro
18:49 πŸ”— l-fy there is just one to www.fenomenulpitesti.ro
18:49 πŸ”— l-fy yes
18:49 πŸ”— l-fy that has to be backup
18:49 πŸ”— l-fy is a part of iiccr.ro
18:49 πŸ”— Sui one moment
18:50 πŸ”— l-fy damn fototeca.iiccr.ro is one of the most important stuff :(
18:50 πŸ”— l-fy i didn't knew that is on a different website
18:50 πŸ”— l-fy i have a better link
18:51 πŸ”— l-fy iiccr.ro with any subdomain
18:51 πŸ”— l-fy www.crimelecomunismului.ro is the same as www.iiccr.ro
18:52 πŸ”— Sui what else to mirror then
18:53 πŸ”— l-fy anything that has a link from there and is in iiccr.ro domain
18:53 πŸ”— Sui someone else find a use for this http://108.170.13.180/crimelecomunismului.tar.bz2
18:55 πŸ”— patrickg that site also generates its menu via javascript (see http://cdn.iiccr.ro/menu/ieiiccr_menu_ro.js)
18:55 πŸ”— l-fy Sui: i'm getting the backup from you
18:55 πŸ”— Sui ok
18:55 πŸ”— Sui i'm also grabbing fototeca
18:56 πŸ”— l-fy superb
18:56 πŸ”— l-fy thank you
18:57 πŸ”— Sui i'm probably gonna get in trouble for this, thank goodness i have a company in front of me
18:57 πŸ”— Sui note: my company
18:57 πŸ”— Sui shall i grab fenomenu
18:57 πŸ”— shaqfu l-fy: So what exactly is happening that puts the site at risk? Regime change?
18:58 πŸ”— l-fy shaqfu: yes
18:58 πŸ”— l-fy we got back a communist goverment
18:58 πŸ”— patrickg seems like they're merged with some other institute?
18:58 πŸ”— l-fy and they've just removed the president of the institution behind this website
18:59 πŸ”— l-fy and that happened today
18:59 πŸ”— shaqfu Gotcha; hence the rush
18:59 πŸ”— shaqfu Sui: Once you get everything, upload to archive.org
18:59 πŸ”— l-fy patrickg: that announcemnt doesn't have a date
18:59 πŸ”— l-fy however, since the most important job of this institution was the website :)
18:59 πŸ”— l-fy and promoting the website :)
19:00 πŸ”— Sui oh god i'm mirroring the .ro cia
19:00 πŸ”— Sui if i didn't love the internet any more than this
19:00 πŸ”— shaqfu l-fy: Just letting you know, AT isn't affiliated with archive.org, so if that's a major issue, let us know
19:00 πŸ”— l-fy Sui: what?
19:00 πŸ”— l-fy no
19:01 πŸ”— shaqfu So if it absolutely must be an international org, you might want to contact elsewhere - but I don't think they'll work fast enough
19:01 πŸ”— l-fy this is an institution that was studied how communism destroyed Romania, and provide documents and reports
19:01 πŸ”— Sui ok
19:01 πŸ”— l-fy shaqfu: i was thinking that archive.org can do a great job
19:01 πŸ”— l-fy since they've mirrored some of the website
19:03 πŸ”— shaqfu l-fy: They'll do a fantastic job hosting it, but if legitimacy's an issue, we're not archive.org
19:03 πŸ”— l-fy right now i'm concerned just to not lose anything
19:04 πŸ”— Sui it's still important to have a mirror
19:05 πŸ”— patrickg Sui: does it help if I provide you the links of that js-menu as entry points to your crawler?
19:05 πŸ”— Sui i'm just using wget-warc, but sure
19:05 πŸ”— patrickg Sui: sorry, I don't know how smart that tool is
19:06 πŸ”— l-fy ok, so Sui i'm downloading your archive
19:07 πŸ”— l-fy for my personal archive
19:07 πŸ”— l-fy now, how can i get archive.org to archive this?
19:07 πŸ”— patrickg Sui: http://pastebin.com/ZrG79GvB - extracted from the english and romanian menu-js. the menus might point to places that aren't accessible otherwise (no idea)
19:08 πŸ”— shaqfu l-fy: Upload it
19:08 πŸ”— l-fy shaqfu: how can i do that?
19:08 πŸ”— Sui oh
19:08 πŸ”— Sui patrickg: i got those dirs
19:08 πŸ”— Sui filled with pdfs
19:09 πŸ”— shaqfu l-fy: Register here: http://archive.org/account/login.createaccount.php
19:10 πŸ”— shaqfu Then, in the top right, upload button
19:10 πŸ”— patrickg l-fy: what about http://www.youtube.com/user/iiccmer/?
19:11 πŸ”— l-fy o jeez
19:11 πŸ”— l-fy i will ask about backuping that
19:13 πŸ”— shaqfu https://github.com/rg3/youtube-dl
19:14 πŸ”— l-fy damn, how do i get a git?
19:14 πŸ”— l-fy i only know svn and cvs
19:15 πŸ”— l-fy and google doesn't work because my badwidth is full
19:15 πŸ”— patrickg l-fy: git clone git://path (or whatever protocol you use)
19:15 πŸ”— mistym l-fy: You can download the source as a tarball from here if you don't want to clone w/ git: https://github.com/rg3/youtube-dl/downloads
19:15 πŸ”— Sui http://108.170.13.180/fenomenulpitesti.tar.bz2
19:16 πŸ”— Sui tiny 22M website
19:16 πŸ”— l-fy Sui: thank you dude
19:16 πŸ”— l-fy aha
19:16 πŸ”— l-fy clone
19:16 πŸ”— l-fy Sui: did you got fonoteca?
19:16 πŸ”— Sui i can tell you right now, fototeca is humongous
19:16 πŸ”— Sui it's still going
19:16 πŸ”— l-fy :(
19:16 πŸ”— l-fy o shit
19:19 πŸ”— Sui there's actually an archive of it from 2010 on archive.org
19:19 πŸ”— mistym Which may mean there's a more recent crawl that hasn't gone public yet.
19:19 πŸ”— Sui should i keep crawling or should i save these people bandwidth
19:20 πŸ”— shaqfu Sui: Keep going; might as well get the most recent possible version of the site
19:20 πŸ”— Sui 361M fototeca.iiccr.ro/
19:20 πŸ”— mistym Yeah, there might be a gap between whatever the last version the wayback machine has and what you're getting now.
19:20 πŸ”— Sui current status
19:20 πŸ”— Sui also
19:20 πŸ”— Sui we just rolled under 200k on tabblo
19:21 πŸ”— Sui me and Aranje launched seesaw on my two servers
19:23 πŸ”— l-fy thank you, thank you guys
19:24 πŸ”— * l-fy apreciate a lot all the effort
19:24 πŸ”— mistym l-fy: It's really awesome you're doing this.
19:24 πŸ”— l-fy mistym: no, just that i have a hard time to forget my first 10 years of life
19:24 πŸ”— mistym I understand.
19:25 πŸ”— chronomex l-fy: did you do something horrible then?
19:26 πŸ”— l-fy chronomex: no, just that .ro was under the communist regime
19:26 πŸ”— chronomex ahh
19:26 πŸ”— mistym l-fy: I mean, it's very good to get this done while there is time.
19:27 πŸ”— l-fy mistym: someone has to do it
19:28 πŸ”— l-fy and now i don't live there anymore
19:29 πŸ”— l-fy and i live in SF
19:29 πŸ”— l-fy i actually been yesterday at internet archive for a meeting
19:29 πŸ”— shaqfu l-fy: The PDA one?
19:29 πŸ”— chronomex neato
19:29 πŸ”— l-fy no
19:29 πŸ”— l-fy something with burning man
19:30 πŸ”— shaqfu Hm, is this the first time AT's dealt with regime change?
19:30 πŸ”— l-fy burning man it stuff meeting thing
19:31 πŸ”— chronomex actually I think the answer is no, shaqfu
19:31 πŸ”— l-fy AT's?
19:31 πŸ”— chronomex we did some archiving of egypt
19:31 πŸ”— patrickg l-fy: archiveteam
19:31 πŸ”— shaqfu chronomex: Ah, neat
19:33 πŸ”— l-fy is something different
19:33 πŸ”— l-fy because .ro is part of the UE
19:33 πŸ”— l-fy but
19:34 πŸ”— l-fy this traces may disapear
19:35 πŸ”— Sui 570M fototeca.iiccr.ro/
19:35 πŸ”— l-fy crap
19:36 πŸ”— l-fy ok, i will backup that latter
19:36 πŸ”— l-fy i have to go to work
19:37 πŸ”— Sui we're in the same timezone, so i shouldn't miss you coming back
19:37 πŸ”— l-fy Sui: can you give me the link please?
19:37 πŸ”— Sui foto is still running
19:38 πŸ”— Sui it's currently mirroring the letter M
19:38 πŸ”— Sui it looks like it's going in alphabetical order
19:39 πŸ”— shaqfu Sui: Do you know what to do to upload?
19:40 πŸ”— shaqfu And it might be best to wait for l-fy to get back to write a desc, since he's familiar with it
19:40 πŸ”— Sui i'll pm him the link
19:41 πŸ”— shaqfu Awesome
19:41 πŸ”— l-fy Sui: i'm a she :)
19:41 πŸ”— l-fy thank you
19:42 πŸ”— Sui no problem
19:42 πŸ”— l-fy the latest backup from archive.org for iiccr.ro is april 2011
19:45 πŸ”— Sui wow, we should be done with tabblo within the day
19:45 πŸ”— Sui we went from 200k to 198k in 24 minutes
19:46 πŸ”— Sui well, 199
19:48 πŸ”— mistym Nice!
19:49 πŸ”— mistym You guys may have this done before I can even start my computer on it when I get back from work. :V
19:49 πŸ”— Sui if we keep going at this speed, it should be done in 5 hours by my horrid math
19:49 πŸ”— Sui wait no
19:50 πŸ”— Sui erase that from the internet
19:50 πŸ”— Deewiant 24 minutes times 200 = 80 hours
19:50 πŸ”— Sui ^
19:50 πŸ”— Sui pardon my brain being fried
19:50 πŸ”— godane getting hakin9 magazine
19:51 πŸ”— l-fy good
19:51 πŸ”— l-fy i've grabbed the big one
19:51 πŸ”— Sui you should steel yourself for the image dump
19:51 πŸ”— Sui it's gonna be at least 1.5gb
19:52 πŸ”— l-fy this has 500MB
19:52 πŸ”— Sui 799M fototeca.iiccr.ro/ currently
19:54 πŸ”— Sui oh, I should have said five days, not five hours
19:55 πŸ”— Aranje Sui, how do I get my screen back
19:55 πŸ”— Aranje you're attached still
19:55 πŸ”— Sui -Udr
19:55 πŸ”— Aranje tks
19:56 πŸ”— l-fy i have to go now
19:56 πŸ”— closure alard: with dld-tabblo-zip.sh, can I delete data/ after running and rerunning? I assume it's been uploaded
19:56 πŸ”— l-fy i'll be back latter
19:56 πŸ”— Sui i'll get you that file linked
19:58 πŸ”— Aranje hmm
19:58 πŸ”— Aranje I want to know what makes them slow to a crawl after a while
19:58 πŸ”— l-fy bye
19:58 πŸ”— Aranje restarting the scripts fixes it, but it just stops downloading stuff after a bit
19:58 πŸ”— Sui it stops on certain people
19:58 πŸ”— Aranje hmm
19:59 πŸ”— Sui i've got 20 right now
19:59 πŸ”— Aranje I just restarted my 10
19:59 πŸ”— Sui they do it from the beginning too
19:59 πŸ”— alard closure: Yes, you can delete it. If you're running more than one instance you should check which directory you remove.
19:59 πŸ”— Sui i hope whoever owns that rsync server is ok
20:01 πŸ”— Aranje I wonder if there's a way to add in a bit of logic to have it move on to a new user if it sits there for more than x seconds
20:01 πŸ”— Sui we're currently downloading at 1k per 20 minutes
20:01 πŸ”— Sui Total wall clock time: 1h 6m 13s
20:01 πŸ”— Sui Downloaded: 49167 files, 775M in 7m 31s (1.72 MB/s)
20:01 πŸ”— Sui fototeca mirrored
20:01 πŸ”— Aranje 198x20 is?
20:01 πŸ”— Aranje 4000?
20:03 πŸ”— Sui at this speed, 66.66 hours
20:03 πŸ”— Aranje so 10.5 days or so
20:03 πŸ”— Aranje err
20:03 πŸ”— Aranje 5.5
20:03 πŸ”— Aranje I can't math lol
20:03 πŸ”— Aranje which is under the 10 we have, hopes
20:03 πŸ”— Sui don't worry, i said five hours earlier
20:03 πŸ”— Aranje off by a zero!
20:03 πŸ”— Aranje :D
20:03 πŸ”— Aranje if we had more machines we could drop that a bit
20:04 πŸ”— Sui *more screen sessions
20:04 πŸ”— Aranje should I launch another?
20:04 πŸ”— Aranje :>
20:04 πŸ”— Sui i've got two full right now
20:05 πŸ”— Sui if you look at the graph, you can tell right where i launched the next ten about eleven minutes ago
20:07 πŸ”— Aranje Check it
20:07 πŸ”— Sui whoa
20:07 πŸ”— Sui look at the count
20:08 πŸ”— Aranje I know
20:08 πŸ”— Sui 1k in 10 minutes
20:08 πŸ”— Aranje Yeah I'm not even going to talk about how many screen windows are open
20:08 πŸ”— Sui it's glorious
20:08 πŸ”— Aranje mainly because I have no fucking idea
20:10 πŸ”— Sui Aranje: look now
20:11 πŸ”— Aranje I see that :D
20:11 πŸ”— Aranje We gonna have a war to see who can launch more seesaws?
20:11 πŸ”— Aranje :D
20:11 πŸ”— Sui just don't break the server
20:12 πŸ”— Sui two per second
20:14 πŸ”— Aranje so yeah, if you guys have the resources, you can run as many of these as you want
20:14 πŸ”— ersi Are you sure you're getting real responses and not just error pages?
20:14 πŸ”— ersi We've broken services before.
20:14 πŸ”— Pronoiac This might be useful for someone else:
20:14 πŸ”— Pronoiac Hey, the Tabblo version of wget-warc, with Lua, has a dependency that's not in the default Ubuntu install: liblua5.1-0-dev
20:15 πŸ”— ersi So, install it? :P
20:15 πŸ”— Sui what version of ubunu
20:15 πŸ”— Sui *ubuntu
20:15 πŸ”— Pronoiac Let me check.
20:16 πŸ”— ersi Pronoiac: do 'lsb_release -a'
20:16 πŸ”— Pronoiac I just looked at /etc/issue.
20:16 πŸ”— Pronoiac It's running 10.10.
20:17 πŸ”— Sui that's why
20:17 πŸ”— Sui too old
20:17 πŸ”— ersi that's almost two years old, mate
20:17 πŸ”— Pronoiac Yup.
20:17 πŸ”— Pronoiac I hadn't noticed, but yup.
20:18 πŸ”— Pronoiac My bad; I'd thought it might help someone else.
20:19 πŸ”— ersi Heh :)
20:19 πŸ”— Sui tabblo speed is now 6 minutes per 1k users
20:20 πŸ”— Sui less than a day left, 19 hours
20:20 πŸ”— Pronoiac Has anyone estimated a completion date for the MobileMe mirror lately?
20:25 πŸ”— ersi Yeah, 30 June.
20:25 πŸ”— mbeckler Hello, just started helping with the tabblo archival - Just to confirm, I can run multiple instances of seesaw.sh in the same directory without things breaking?
20:26 πŸ”— Deewiant Yes.
20:26 πŸ”— mbeckler well that is just excellent, thanks
20:28 πŸ”— Sui i've got something like 60
20:28 πŸ”— Sui i lost count
20:33 πŸ”— Sui i think we're at tabblo's limit
20:34 πŸ”— alard Well, we'd like to have useful data, not errors.
20:36 πŸ”— xarph welp, I tried to bid on this http://www.rrauction.com/bidtracker_detail.cfm?IN=309 for release to the public domain or at least archive.org, but I've been pushed out by The Old Money :/
20:37 πŸ”— xarph I know it's not digital but I thought it would be of interest
20:45 πŸ”— shaqfu xarph: Surprised that's not at SI/NARA
20:50 πŸ”— shaqfu And it's tempting - hell of a treasure at $2000
20:50 πŸ”— xarph be my guest, I bow out of an auction when it reaches how much I pay for rent
20:50 πŸ”— xarph except I don't know if you can register now, I believe the cutoff was a while ago
20:52 πŸ”— shaqfu He's not in the JSC oral history archives, hrm
20:54 πŸ”— shaqfu And neither is it in NASA's oral history holdings...
20:54 πŸ”— xarph well this was dictation for a for-profit book, so I doubt it fell under any rules for government archiving
20:54 πŸ”— shaqfu I wasn't sure if he donated it afterwards or not
20:55 πŸ”— shaqfu Ick, requires references?
20:56 πŸ”— chronomex ?
20:57 πŸ”— shaqfu Eh, too much trouble, esp. since it's about to close
21:07 πŸ”— mbeckler Not sure who set up the awesome virtual machine image, but I just downloaded it today and when I tried to do the tabblo, I'm getting errors like "./wget-warc-lua: error while loading shared libraries: liblua5.1.so.0: cannot open shared object file: No such file or directory\nERROR (127)"
21:08 πŸ”— mbeckler I'm running linux so I just installed the tabblo stuff outside the VM and it's working just fine, but thought I would report that something is weird with the warrior VM's tabblo setup
21:08 πŸ”— ersi Thanks for tellin'
21:09 πŸ”— alard mbeckler: Thanks for the report. It's supposed to download and install liblua5.1.so, but apparently it doesn't in your case.
21:09 πŸ”— mbeckler that's what I figured, but didn't know much about how to diagnose it
21:12 πŸ”— Pronoiac mbeckler: Can you run apt-get? oh.
21:20 πŸ”— ersi Does anyone that watches have any idea how to grab video from justin.tv?
21:25 πŸ”— xarph okie doke, we're back in the game on the aldrin tapes auction (http://www.rrauction.com/bidtracker_detail.cfm?IN=309). it's currently at $1840. My max proxy bid is currently at $2244. Bids after that are $2716, $3287, $3978, and then it gets ridiculous. I'll keep the channel advised of the status. If we win, hopefully you can find me some good people in silicon valley that will be able to digitize these tapes correctly. :>
21:26 πŸ”— S[h]O[r]T ersi what OS are you on?
21:26 πŸ”— xarph Then would comes the fun part which is getting clearances to put then back into the public domain where they belong.
21:31 πŸ”— shaqfu xarph: I know archives in Philly/NYC that can handle them and want to, but that's the wrong side of the nation
21:32 πŸ”— ersi S[h]O[r]T: Linux, Ubuntu mostly
21:48 πŸ”— S[h]O[r]T there is rtmpdump, ive tried it before with some sucess but it can be complicated. also not sure if the streams are protected
21:48 πŸ”— S[h]O[r]T there is streamtransport for windows which works great for flash stuff if the stream isnt protected
21:48 πŸ”— l-fy hi
21:48 πŸ”— S[h]O[r]T http://code.google.com/p/get-flash-videos/ that too apparently
21:54 πŸ”— godane i'm uploading welcome to the scene videos
21:54 πŸ”— godane only version2.0 or season 2.0 right now
21:55 πŸ”— godane item id will be welcometothescene_version2.0_xvid
21:55 πŸ”— godane i also added the theme song Maylynne Catch Me
21:56 πŸ”— xarph I am now high bidder at $2716
22:04 πŸ”— S[h]O[r]T i remember that series hehee
22:16 πŸ”— godane only three episodes are on archive.org
22:17 πŸ”— godane so i putting up the full season 2 as on item to make things easier for me
22:17 πŸ”— shaqfu l-fy: Would you like to write up a desc of the .ro site/
22:17 πŸ”— l-fy shaqfu: sure
22:17 πŸ”— l-fy where can i do it?
22:18 πŸ”— shaqfu l-fy: Do you have everything you want to upload yet?
22:18 πŸ”— l-fy no
22:18 πŸ”— l-fy i don't have fototeca and fenomentul pitesti
22:19 πŸ”— shaqfu Once you start uploading stuff, you can add it to the desc section of the upload screen
22:19 πŸ”— shaqfu Or, write something collection-level
22:20 πŸ”— godane i also want to get hak5 season 1-3 uploaded to archive.org
22:21 πŸ”— godane since finding those are going to get hard has time goes by
22:57 πŸ”— xarph now bidding $3287
23:00 πŸ”— Sui l-fy: did you get the links in your email?
23:00 πŸ”— l-fy yes Sui
23:00 πŸ”— l-fy thank you
23:01 πŸ”— Sui no problem, you said you didn't get them earlier
23:01 πŸ”— Sui the photo site was smaller than i expected
23:03 πŸ”— l-fy more than 700MB?
23:03 πŸ”— Sui 770
23:03 πŸ”— Sui if compressed a bit better, it'd fit perfectly on a cd-r
23:05 πŸ”— xarph now at $3978 with a max at $5826. If it gets above thatҀ¦ I can't keep bidding.
23:05 πŸ”— chronomex <3
23:07 πŸ”— l-fy xarph: what you want to buy?
23:07 πŸ”— xarph http://www.rrauction.com/bidtracker_detail.cfm?CFGRIDKEY=309
23:08 πŸ”— xarph if I win (and it can be legally cleared), the contents will be made public domain
23:08 πŸ”— xarph but that's stepҀ¦ 40.
23:08 πŸ”— xarph currently on step 1
23:08 πŸ”— chronomex not 40, probably more like 6
23:08 πŸ”— xarph well you guys are the experts on these things. :>
23:09 πŸ”— xarph I'm just a space memorabilia nut
23:09 πŸ”— chronomex acquire, digitize, publish ... 3 steps, I'm probably missing something.
23:10 πŸ”— xarph make sure the tape contents aren't covered by the copyright for the published memoir they were used for, get permission from aldrin in writing, clear it legally in case there are any libel issues...
23:10 πŸ”— dashcloud well, as long as you get the tapes, nothing else really matters
23:10 πŸ”— chronomex the 2nd is the only one I'd really worry about
23:10 πŸ”— dashcloud they can be digitized, and then shoved into a dark archive if need be
23:11 πŸ”— xarph yah, but like I saidҀ¦. step one :>
23:11 πŸ”— chronomex please, for the sake of history, even if you can't get clearance, please do shove into archive.org and work with them, they can keep it dark for a long time
23:11 πŸ”— dashcloud so no one will see them until the end of time, but they'll be well cared for
23:11 πŸ”— chronomex but you probably already know this
23:11 πŸ”— xarph yep.
23:13 πŸ”— shaqfu Looks like it's getting out of range :(
23:13 πŸ”— xarph We'll see
23:15 πŸ”— shaqfu What time does it end?
23:15 πŸ”— xarph right now, 19 minutes away. If there's another bid, the clock is reset to 30 minutes.
23:16 πŸ”— LordNlptp i often get stuck at the acquire step
23:16 πŸ”— LordNlptp lot of stuff here needs digitizing and/or publishing
23:16 πŸ”— shaqfu I'll check back in 20 minutes, then
23:17 πŸ”— shaqfu If it goes through, you'll have the $ tonight
23:18 πŸ”— shaqfu And we'll start tracking down people in the SF/NY area to handle these
23:18 πŸ”— xarph another bid, $4814. One more bid and I'll have to drop out unless archive team can raise some more.
23:19 πŸ”— xarph I could completely empty my savings to chase this but...
23:19 πŸ”— shaqfu $5826
23:19 πŸ”— xarph that's me
23:19 πŸ”— shaqfu It's extraordinary but...I'm done after $1k :(
23:19 πŸ”— shaqfu Ah, okay
23:19 πŸ”— xarph next bid will end me :/
23:21 πŸ”— Sui somewhere else, another archivist is like damn big money
23:21 πŸ”— xarph ahahaha
23:21 πŸ”— xarph watch jason scott come in here LOOK WHAT I WON
23:21 πŸ”— chronomex >.>
23:21 πŸ”— shaqfu Kinda funny - he's away all day, and it's been a big day for AT
23:22 πŸ”— shaqfu Between this and the .ro site
23:22 πŸ”— Sui that was actually kinda fun
23:22 πŸ”— shaqfu While you were out: SAVED HISTORY, TWICE
23:22 πŸ”— Sui i enjoyed watching my screen scroll
23:23 πŸ”— Sui also how tabblo is going at hyperspeed
23:25 πŸ”— godane we need archive.org version at home
23:25 πŸ”— godane i hope we get 60tb disks in five years
23:26 πŸ”— shaqfu Sui: If you have a lot of bandwidth/storage, #fireplanet could use you \o/
23:26 πŸ”— godane shaqfu: I have all magazines of hakin9
23:26 πŸ”— Sui i'm starting a hosting company, so any extra bandwidth and storage are open for archival use
23:26 πŸ”— godane just under 500mb too
23:27 πŸ”— shaqfu godane: Awesome
23:27 πŸ”— Sui Aranje is in on it, he guided me here
23:27 πŸ”— shaqfu Sui: #fireplanet, then :)
23:28 πŸ”— Sui oh, fileplanet
23:28 πŸ”— Sui i thought that got a reprieve
23:29 πŸ”— shaqfu Not that I'm aware of
23:30 πŸ”— Sui when we're done with tabblo i'll get cracking on that
23:30 πŸ”— Sui i'm currently maxing out my hdd i/os
23:30 πŸ”— shaqfu Take your time - it's not super-critical
23:32 πŸ”— shaqfu We've been at it for almost a month now
23:32 πŸ”— shaqfu xarph: :(
23:34 πŸ”— xarph That's it, then :(
23:35 πŸ”— xarph At least we had a shot at it.
23:35 πŸ”— xarph And we know it exists.
23:35 πŸ”— shaqfu Yep
23:35 πŸ”— shaqfu Anyone else want in on this? Last chance
23:35 πŸ”— shaqfu The future will write sordid love letters to you
23:36 πŸ”— shaqfu I just hope it's going to SI/NASA/NARA/Purdue/APS/etc
23:36 πŸ”— xarph the next bid is $7050, plus there's an 18% commission. I'd need at least $2500 more to have a fighting chance. But who knows...
23:36 πŸ”— xarph Yeah no kidding
23:37 πŸ”— shaqfu If it's for someone to put into a vault or listen to a few times, well, RIP tapes
23:37 πŸ”— shaqfu Esp. if they listen to it w/o digitizing
23:37 πŸ”— godane got season 2 of welcome to the scene up: http://archive.org/details/welcometothescene_version2.0_xvid
23:38 πŸ”— DFJustin need to do filenames like 00, 01, 02 I think
23:38 πŸ”— godane also i'm getting season 1 of that
23:43 πŸ”— godane season 1 episodes are like going at 40kbytes a sec
23:44 πŸ”— godane i'm more likely call the item welcometothescene_version1.0_xvid

irclogger-viewer