[00:00] It's so much more polite when respectable organizations extend the middle finger. It's backed in politically correct prose and lawyers. [00:17] do these people think we somehow hacked into the servers and got files that were not already publicly accessable? [00:18] yes [00:18] probably [00:18] the world is full of people who don't understand things [00:18] let it be [00:32] mc [00:32] ups. sorry, wrong window [00:40] I wish we could tell people like that to fuck off [02:30] We're almost down to the end of 1.0 run of PicPlz!! [02:30] http://picplz.heroku.com/ [02:31] This is what the archiveteam warriors look like as they arrive at the picplz servers: http://www.youtube.com/watch?v=lvAG2D7SvCo&t=0m22s [02:34] $1300 to a year of git-annex! [03:11] i just noticed that the game shows what the camera sees up by the score [03:14] yes [03:14] SketchCow: btw, did you notice the Get Lamp shout out in my video? :) [03:14] No. I'll look again. [03:14] heh [03:16] Also: you should do an update now. [03:16] yes, coming soon [03:16] Because 4 days is enough for people to spread the news. [03:17] I was hoping to have the first milestone usable, but the code's not quite there yet [03:17] Do it right, not for show [03:18] Also: Best pronounciation of Box ever [03:18] I do not see the get lamp shoutout at all [03:18] I pronounced box? [03:18] well, the lamp's not golden :) [03:18] I see. [03:19] That's a shout out? [03:19] weak one :) [03:19] more like a nod? [03:19] yeah, nod [03:19] yeah, I'm doing it right: http://git-annex.branchable.com/design/assistant/blog/ [03:19] freaking race conditions [03:19] ha ha [03:19] I was going to see about maybe swinging by next week [03:20] Holy fuck, you are in a nowhere that makes nowhere go 'yeah, that's nowhere' [03:20] Backed [03:20] what, you want the $1500 reward I chickened out of? Weekend at the offgrid cabin? :) [03:20] Hurrah for clouds that don't suck [03:20] Why chicken out? [03:20] well, chance of random psycho with $1500 [03:21] I'd be glad to host you. Airport code TRI, then a 1.5 hour drive :P [03:21] Damn, I'd pay $1500 for that [03:21] heh [03:21] I love being nowhere <3 [03:21] I have a friend in the new mexico mountains [03:21] nearest town is like 8 miles away [03:22] it's AWESOME [03:22] :D [03:22] I bet it's wonderful out there at night [03:22] that sounds nice.. while I do live in possibly the most remote (and certianly the most econonically depressed) county in Tennesse, it sometimes feels a little close to civilazation [03:22] Aranje: freaking east coast skyglow [03:22] :( [03:22] Only ever met one person that went off the grid; she picked up one day, drove across the continent, did some archival research, then drove back [03:23] not as bad as some east coast, but I've seen real dark in Honduras, so I'm picky [03:23] Which is the kind of crazy awesome shit you can only do off-grid [03:23] I like montana's dark [03:23] Oh man, the Bronycon documentary is going to break $270k [03:23] on my todo list, for sure [03:24] thanks syhaqfu! [03:24] Aranje: It's great :D [03:24] He works at an observatory [03:24] never underestimate the POWER of the brony side [03:24] I spent a week there [03:24] It's great for, I dunno, centering yourself or something [03:24] Being a fire watcher in Montana is on my list of `summer jobs to do` [03:25] Man, look at that awesome fire [03:25] for me it's great for concentration [03:25] closure: Awesome, I got the super-secret "closure camgirl thanks in IRC" backer level :D [03:25] Like, it's very humbling to see SO MANY stars and know you're the only human for tens of miles [03:25] Heh. And awesome. [03:25] Because you're made up of those stars :D [03:25] :D [03:26] We should have the archiveteam con in the middle of the nm mountains [03:26] oh, also, get this. [03:26] there's a major fiber conduit that runs right nearby [03:26] I'm going to a con next year in a Swiss Chalet :) [03:26] haha, middle of fuckall with 10gbe [03:26] yeah... that doesn't sound like the setup for some horror movie at all... [03:26] so he gets 50/25 internet tapped off of it [03:26] he could get a full gigabit, but it's like $200 a month [03:27] ok, I'm adding fiber backbone map to my househunting checklist [03:27] a little dribble he's got there, but it'll do [03:27] and 50mbps is plenty to download enough porn to last you forever [03:27] I'd download old maps [03:27] Someday when I have money for more drives, I will be a source of old maps. [03:28] Aranje: Do you physically collect? [03:28] I haven't found any yet, but I haven't looked hard. If given the opportunity the answer would be yes. [03:29] I've recently checked out the garage here at my house and found hundreds of very old books too. Fun times. [03:30] Aranje: Either get them appraised or donate them out of the garage [03:30] http://goo.gl/maps/Gdli just north of weed, nm [03:30] Off-on-topic, I saw a conversation today that might pique #archiveteam [03:31] Unfortunately, I don't believe they're mine to give away. They're the landlords. His... aunt? died here 20 years ago and they chucked her shit in the garage and never looked at it again. [03:31] She has a fantastic collection of porcelain dolls as well [03:31] Aranje: Tell him to do something with them. Sell them to dealers and get a few (thousand) bucks, or donate like a boss [03:32] If I'm reading your hostmask right, OSU might be interested [03:32] Mmmk. I figured I'd get rid of (donate, what have you) the filthy romance and dirty old woman books, but there are a few here from the early 1800's I'd like to keep or scan. [03:32] haha it still says I'm in medford [03:32] Hold up - the trashy fic is the *good* stuff [03:32] fucking charter [03:33] I'm in san luis obispo, ca [03:33] (cal poly is the local uni) [03:33] Since it's underrepresented - nobody saved it, it's usually on shit paper, etc [03:33] huh. [03:33] I'll keep that in mind. They're all paperbacks [03:33] Wait, from when? [03:34] some were ravaged years ago by bugs, but I've kept and cleaned the usable ones (pretty low bar for usable) [03:34] 30s-50s for trashy books [03:34] Oh, hm. Not as exciting as I thought [03:34] dime and quarter books [03:34] I thought you meant they were *all* 19th century [03:34] nah [03:34] just the fancy old ones [03:34] hardbound, those [03:35] pretty damn good condition. They're fiction books, the lot of them. [03:35] At some point I'll catalogue them all so I have a list to hand out [03:35] Talk him into doing sometihng with them [03:36] If only for "you'd clean out your garage" :) [03:36] Heh, it needs to be torn down [03:36] If somebody inspected it, I'm sure it would be labeled unsound [03:36] I could kick it over if I tried hard [03:36] It's just enough to keep the rain mostly off what's in there [03:39] * Coderjoe clears his throat [03:39] -bs [03:40] shaqfu, #archiveteam-bs :) [03:43] YES PLEASE [03:44] Anyway - are there any webcomic-archiving projects out there? [03:44] I saw the question go out and the response was "no" [03:45] but indeed, donate. if IA got them, I would hope they would be treated a bit better than what I saw in that one news story [03:48] So, since we got free artists for archiveteam brands/logos, and they're working on the stuff now, I think the Archive Team kickstart will be 100% for hard drives [03:50] excellent.. hmm, rewards.. [03:51] Yeah, going to think that over. [03:52] speaking of drives... [03:53] make any headway on that drive of friendster data? :D [03:54] $1000+: Rent-an-archivist [03:54] I would totally rock an internet archive shirt and/or stickers [03:54] who, me? [03:54] I still have it here in my git annex [03:55] no, SketchCow. [03:55] Someone goes to your house, gets your shit in order [03:55] IA shirt is good [03:55] shaqfu: the opposite of one of those hoarder shows where they throw everything out? [03:56] Coderjoe: The codeword for that is "aggressive appraisal" ;) [03:57] hell, I'd do that for $100+ [03:57] I love cataloguing [04:07] ... [04:07] that bronycon doc total is without whatever has come in via paypal [04:08] hahahaha [04:08] Bronies are serious business, it seems [04:08] and the average pledge amount on kickstarter for that is $130 [04:08] Is that being pulled up by some insanely high amount? [04:09] woah [04:09] six backers at $10,000 or more [04:09] one at $5k [04:09] 11 at $3k [04:09] one at $2.5k [04:09] Wonder if one is an aggregate donation, like the brony one for HIBs [04:10] 8 at $1k [04:10] But it's still a fuckton of cash [04:11] 7+16 at $500, 231 at $250, 98 at $150, 94 at $100, 262 at $75, 189 at $50, 744 at $30, 292 at $10, 5 at $10 [04:12] The hell? Registration at the con itself is $65. Where the hell are bronies getting this kind of money? [04:12] lol [04:16] saving up? [04:17] Oh hi :D [04:17] The doc broke 270k tonight counting PayPal [04:18] Zebranky: Remind me again, are you director or con chair or something, right? [04:18] Con chair, aye [04:18] And consulting producer on the doc [04:18] aha [04:18] so that's another $7k from paypal [04:20] Zebranky: ddg sends "bronycon" to http://www.bronycon.org/index.html [04:20] might want to redirect that properl [04:20] y [04:20] Indeed [04:20] or not, whatever [04:21] I'm a fair bit tipsy, so I don't want to deal with that shit just this moment, but either I'll get someone to tackle it tonight, or we'll do it tomorrow :D [04:21] Thanks for the heads up [04:21] Zebranky: Andrew Brockert? [04:21] Aye [04:22] famenagged [04:22] So hard. [04:22] http://cincymugshots.com/inmate-permalink.php?jms=1446873 :D [04:22] lolnope [04:22] hahaha [04:22] https://twitter.com/#!/lastzebranky more likely [04:22] That'd be me [04:23] cool [04:23] https://twitter.com/#!/zebpony for the pony stuff [04:24] Bronies are serious business, it seems [04:24] You have no idea. [04:24] I mean, *Hasbro* is paying attention. Hasbro corporate. [04:28] < Zebranky> I'm a fair bit tipsy <-- what, you too? [04:28] :D [04:32] everybody's tipsy [04:32] except me [04:32] I should go get tipsy :O [04:33] Still in Rochester or wherever? [04:34] I'm going to be getting quite hammered with three of my senior staff in Albany tomorrow and Saturday nights [04:34] We call ourselves the War Room [04:34] I'm in DC [04:34] well [04:34] south of DC [04:34] Rochester is in the fall :) [04:37] Well, I'll be in the area for the foreseeable future [04:37] Unless one of the $10k donors is Gabe Newell and he offers me a job at Valve, in which case, PEACE, HEADING FOR THE LEFT COAST [04:38] (This is not entirely unrealistic. I'm told the first donor at that level is a software engineer from Seattle who wishes to remain anonymous.) [04:38] A certain W. Gates III? [04:38] he's not a software engineer [04:38] Used to be [04:39] hahah [04:39] Zebranky: move to seattle, best city. [04:39] I've heard many good things. [04:39] I do loves me some Boston. [04:39] Think it'll always be my city. [04:39] I've only heard of the city's drinking tolerances [04:40] (seattle, that is) [04:40] lol [04:40] Boston has a significant Irish population. [04:47] woop woop woop off-topic siren [04:48] Alcohol is always on topic. [04:48] only when you're drunk [04:48] and this is the internet, there are sober people online all the tmie [04:48] The world needs more alcohol archivists. [04:48] ... [04:48] lol [04:48] I archive alcohol with my liver [04:49] s/archive/recycle/ [04:49] I've seen wine libraries; never archives, though [04:50] lending library? [04:51] I am intriqued, how would that work, exactly? [04:51] I think it was just a library about wine [04:52] The whole "exactly one use ever" thing makes it tricky [04:54] It could be like Kopi Luwak [04:55] So if the internet archive had a large collection of music CDs that were going to be digitized, what cool things would you do with the results? [04:55] BRAINSTORM TIME [04:55] like pandora but with weirder music [04:56] yeah, that's one thought [04:56] What kind of music? [04:56] And how big? [04:56] But there's all sorts of gross red tape involved because it's like "changing of media" and stuff [04:56] Every genre imaginable [04:56] >250,000 unique releases [04:56] Pitch it as a stock library [04:57] changing of format of media* [04:57] stock library? [04:58] undersco2: Music you can use for stuff when you need music, but don't care enough to buy rights [04:58] We don't have the rights though [04:58] I refuse to believe the intergalactic criminal Zod uses mirc [04:58] These are commercially pressed albums [04:58] Oh :( Thought it might've been a big set of free/orphan works [04:58] nope, sorry [04:59] undersco2: sound like a large dark blotch [04:59] THen, uh, shop it around the DH community? [04:59] DH? [04:59] Coderjoe: I don't understand? [04:59] Digital humanities [04:59] oh [04:59] undersco2: a large swath of dark items [04:59] I'm sure there's some history-of-music geek that'd want to number-crunch against a large set [05:00] oh, yeah. they will be atm [05:00] just trying to brainstorm of possible things to do with them [05:00] (one of my lower priority projects) [05:00] It sounds like your hands are bound by rights [05:01] Yeah. [05:01] lefts [05:01] But we could do something like openlibrary for audio [05:01] Ensuring not more than one person has "checked out" the track at a time [05:01] and as long as we maintain a physical copy, then it's purportedly okay [05:01] um... [05:02] yeah... you know full well that if it can be played, it can be copied [05:02] (doing something nasty with DRM or something to "ensure" consistency) [05:02] oh, yeah. no different than recording pandora though [05:03] It just hass to be sufficient "drm" to make the RIAA satisified [05:03] (unfortunately) [05:03] I long for the day where we could just have these huge archives of older songs [05:03] nothing will ever satisfy them [05:03] except all the money [05:03] except everyone giving them all their monies [05:05] nope [05:05] nothing redeems [05:05] After they get all the moneys, they would want more :| [05:06] It's a totally broken system D: [05:06] this is why I love bandcamp, et al though [05:07] they're the beginning of the music revolution (tm) [05:07] must mirror remix.nin [05:07] no, not going away (that I am aware of) [05:39] TODAY I LEARNED IF I PLUG IN TWO 3TB DRIVES INTO THIS SATA DOCK IT BLUESCREENS [05:39] A fun lesson, useful to nobody [05:40] I bet you won't be doing that again [05:40] at least until next time, anyway [05:40] most controllers/port expanders don't like 3tb drives [05:40] two 3tb drives* [05:41] I've seen that with a LOT of them [05:41] :( [07:14] bluescreen = windows? [07:14] * SmileyG likes centos pink screen [07:14] its rather amusing to see that explode. [07:14] osx kernel panic in 7 languages is purty [07:15] Windows 8 has an emoticon BSoD [07:15] potentially useful: http://joepie91.cryto.net/trac/wiki/resolv [07:15] sideways frown with a brief error message [07:17] do we think in future versions of warrior it would be possible to specify the number of workers? [07:17] I was able to run 8 copies of seesaw.sh on a big VPS [07:18] i guess if you want to do that you have to have the initiative to grab it by the balls and do it yourself [07:19] couldn't you just bash out a script which takes a numeric input and spawn that many copies? [07:19] i did that [07:19] erm [07:19] that's what I did [07:19] want to see if i can find it? [07:20] for x in [1..$1] do; ; done [07:20] I had tmux tile it all out with nload in the bottom [07:20] BUT, what I'm saying, is that most of the time it seems like 2 workers inside a whole vm is not worth it [07:21] you have a whole OS for 2 measly workers that aren't even touching your total bandwidth [07:21] it'd be nice for warrior to ask how many you wanted when it started [07:22] yeah, I guess [07:22] on a *nix it makes more sense to just grab the code and run it natively I guess [07:23] the VM does have the advantage of being foolproof [07:25] 2 seems a good one to do while we work out bugs [07:25] Warrior has only been in wide release a week. [07:25] It's probably a good idea to, you know, see it in the field for a while. [07:27] agreed [07:28] word [07:29] There's also a memory issue: the warrior takes 512 MB, I believe, so you can't run an infinite number of workers. [07:30] it'd be cool to be able to boot spot instances of ec2 using something like fog that ran this shit [07:31] they're at like 2.7cents/hr right noww [07:31] whoa the micros are at .6 cents/hr [07:32] how about a boinc project o_O [07:32] $52/yr for a micro spot instance [07:32] but you could get bumped [07:33] it only happens a few times a month [07:33] boinc could be interesting, if you get academia support especially [07:33] Need a willing student to do the coding ;D [07:33] SoC? [07:34] Oooo maybe [07:34] summer of cider? [07:34] :D [07:34] with the TR piece jason could probably get funding [07:34] Does he need anymore stress? [07:34] or just use regular old archive.org funds [07:35] I like how we went from something relatively free to something costing money. [07:35] its a collection of scripts that any linux boincer could run [07:35] Let's try to make it cost more money [07:35] they love to have projects just for them [07:36] SketchCow: lol yeah exactly [07:36] boinc should be simple enough to do freely [07:36] let's hire guys out of google and get a suite at a hotel to run the servers [07:36] * SmileyG might actually look into it today [07:36] also, every time we download a user, we burn a dollar [07:36] Is that factual cost? [07:36] It'd be like chinese ghost/spirit money, but real money [07:36] or just making a point? [07:37] but we have to put the ashes in a greeting card and mail them to someone [07:37] "Look, I spent money and now you get nothing!" [07:38] YOUR DATA DID THIS! [07:38] Let's mail them to the CEO who got fired from yahoo [07:38] lol [07:38] yes! [07:39] hm [07:39] I guess I'm technically running my shit on non-free architecture [07:39] because we're just not using it right now [07:40] http://boinc.berkeley.edu/trac/wiki/VboxApps [07:40] DONE IT [07:40] \o/ [07:40] does it need to be headless? [07:40] i'll figure a way to pass the command to it. [07:41] does boinc set env vars somewhere? [07:41] swhitt: sure, the instance itself may be cheap, but you forgot the bandwidth cost [07:41] Coderjoe: we just need 100 newbs with free micro instances [07:41] then why are you talking spot instances? [07:41] and you only get so much bandwidth in the free tier [07:42] how dare you poke holes in my postulating! [07:42] swhitt's idea is unworkable, but it's always good to have people brainstorm [07:42] OK, bed, busy day tomorrow [07:42] :) [07:43] hehe [07:43] however boinc seems very very possible if I just think about this today while I should be working [07:43] Need to modify the startup script for the warrior, but... (is the code for that somewhere, am I ok to *play* with the vm-image? [07:44] boot.sh in warrior-code? [07:44] I guess you want the tool that made the ova [07:44] and I presume your using mingetty with autologin or something funky like that. [07:44] I just wonder why, when our servers already rape a service pretty soundly, why you'd want it to extend up to mega-ultra-rape [07:45] SketchCow: you inspire us to FUC KSHI T U P [07:45] does heroku hate archiveteam? [07:45] Because with the exception of something like myspace or flickr, we'll demolish most services. [07:45] o_O spaces. [07:45] I'd bet heroku doesn't like us [07:45] Heroku likes us. [07:45] SketchCow: new age of slashdot effect? [07:45] Where as slashdot effect kills stuff, we are saving it for everyone? [07:45] are we paying for heroku? [07:45] I'm just saying, I think you need to go after other things. [07:45] Better uses of your time. [07:46] I'm at work [07:46] I should be working in 10 minutes [07:46] Than turning the warriors into a massive bandwidth-plugging please-look-at-us flare parade [07:46] That'd be a far better use of my time :D [07:46] Well, that's true [07:46] But.... this is *fun* and I might actually be able to do it [07:46] That jizz isn't going to mop itself [07:46] whoa I should not leave my linode vps running the mobileme seesaw [07:47] swhitt: lol i accidently left my work system running warrior ¬_¬ [07:47] it's DESTROYING bandwidth [07:47] fortunately due to me being one of the only two IT people [07:47] I can go "oh erm backups?" [07:50] is picplz going to start up again? [07:50] or is it finished? [07:51] New thought (maybe we should put the warrior ideas on the wiki?): if you're looking at improving the warrior I think it would be good to make Python version of seesaw. [07:52] Make it so that it manages one or more dld-* scripts, captures the output, make it modular enough so you can use it from within Python. [07:53] seesasw is just a shell script now [07:53] why not ruby!? [07:53] no one actually knows ruby? [07:53] I grok ruby [07:53] Ruby is cool. [07:54] But python is everywhere. [07:54] So are dogs. [07:54] So in the interest of not forcing everyone to install Ruby, scripts like these are best done in Python, I think. [07:55] (I originally wrote a Bash seesaw because I then thought that bash + these utilities were everywhere, but it turns out everyone has a slightly different bash.) [07:55] don't most systems have ruby now? [07:55] hmmm I don't :P [07:55] and python is changing every point release [07:56] warrior crashes in my 2004 macbook pro [07:56] inside of virtual box, every time [07:57] From the little that I have seen so far, the different Pythons are at least reasonably predictable. [07:58] ruby 1.8.x is pretty much homogenous [07:58] as is 1.9.x [07:59] have we approached services like picplz to ask them to give us the data in nice formats? [08:00] instead of raping their bandwidth [08:00] bleh [08:00] i don't feel like learning ruby just to contribute [08:00] swhitt: (I probably shouldn't continue.) Look, I don't say that Ruby isn't predictable or homogenous. I prefer Ruby to Python, I use Ruby whenever I can, but I think have the impression that Python is pre-installed on more machines. That is worth something for making these scripts. [08:00] and i don't have ruby installed, afaik [08:01] alard: understood; I think most modern distros / os x have it installed by default now [08:01] but if you're on an older system I get it [08:01] When it comes to having people install things, I'd rather have then install wget-warc (+lua). [08:01] lua is much more obscure than ruby... [08:02] bash: ruby: command not found [08:02] ruby [08:02] this system being only a few months old [08:02] Coderjoe: what distro? [08:02] debian, and i skipped tasksel [08:03] swhitt: Agreed, you can however include Lua in Wget, so you can write scripts like these: https://github.com/ArchiveTeam/picplz-grab/blob/master/picplz-user.lua [08:03] alard: any idea why wget would be passing a nil file pointer? [08:03] yeah, that is nice [08:03] (a few people reported such a couple days ago) [08:04] if you have a warrior OTA then it doesn't matter [08:04] because it can have any shit you want in it [08:04] To Lua? Well, it's not very stable and robust yet. Sometimes Wget says 'Can't write the file', but then it still gives the filename to the wget.callbacks.get_urls Lua method, which tries to open the file. [08:05] (It's passing file names, not pointers, so it gives the name of a file that wasn't actually written.) [08:05] damn this mobileme backup is insanely sized [08:05] ~300 terabytes!? [08:06] You can also get a Wget to segfault if you return the wrong shape of url list. [08:06] why wouldn't apple let us bring ~200 drives to their DCs and copy it right off? [08:07] instead of raping them for 25 megabytes per second for months [08:08] because it's akamai? [08:08] and are you smoking something? [08:08] no... [08:09] I guess I just don't understand why they wouldn't let us come plug in instead of hitting them via http for months [08:09] akamai isn't free [08:09] i can't recall a single time a closing site has handed over data willingly [08:09] one was even playing whac-a-mole [08:09] do we know what the reason is behind those actions though? [08:10] Privacy, for one thing? ("Hello user, we've given a copy of all your data to this obscure group, hope you don't mind.") [08:10] electronic archiving has merit, all the way up to the national archives [08:10] I'd have thought if the govt thought it was somewhat important then private companies would think so too [08:10] or I'm just being naive [08:10] companies are loyal to one thing: money [08:11] but isn't it more expensive to pay for bandwidth costs of us essentially ddosing them for months than just giving us filled harddrives? [08:11] if it makes money, how can we get it to make more? [08:11] if it isn't making money, can we dump it? [08:12] Let me interrupt for a moment to paste the rest of my earlier argument: a Python seesaw module would be the way forward I think. It makes it easier to script, so on top of that seesaw module you could build a seesaw manager. This would make it possible to scale the number of workers up and down while the thing is running, for instance. [08:12] Plus (I don't know who said that, was it you, Coderjoe, or closure/chronomex?) a seesaw manager would be able to bundle uploads and throttle the upload bandwidth better. [08:12] That is useful if you want to make a 'auto-warrior', where you run the thing and let AT decide which project should be run at the moment. The seesaw module would also allow us to make, say, a web interface that you can use to configure your warrior. [08:12] (So, that's in the log.) [08:13] i suggested it [08:13] I find it hard to believe (perhaps naively) that major corps like IBM wouldn't like the PR of handing over data to be archived when they shut shit down [08:13] is it because we get it from the publicly-accessible web interface that they think it's cool? [08:15] I'm truly not trying to debate - I was just wondering how thoroughly these avenues have been explored [08:15] <@SketchCow> I herewith ask you to completely remove alle files, datas, photos, tabblos, texts etc. of tabblo user moorlerche from your website as well as your offline servers and back-ups: [08:15] <@SketchCow> When I call up my user name 83 files are shown and moreover downloadable by whoever clicks on download zip.files. This is not in my interest and does violate the author's right after European legislation, which was part of the user contract with Hewlett Packard. [08:15] that is why [08:16] That's a paste from a user email? [08:16] it would largely result in negative pr from a privacy perspective [08:16] Coderjoe: Ah, so it was you. So making a python seesaw would be really useful, I think, especially if it would be flexible enough to build on. [08:16] yes [08:16] hm [08:17] this makes me think [08:17] do you think that all publically available data on facebook should be forever available to anyone? [08:17] good lord the dashboard is taking forever to load [08:17] yeah, the first script takes like 15+ seconds [08:18] it's been a minute [08:18] memac or picplz? [08:18] handing over _user_ data willing, or just data? [08:18] picplz [08:19] SmileyG: well, that email was referencing user data [08:19] yah [08:19] I think companies fear inciting wrath if they hand over anything user related [08:19] how will that tabblo removal request be handled? new tarball without that data? [08:19] I honestly don't know [08:19] Do IA care? [08:20] if I had a tarball with all of facebook's current users' data and their public photos people would be pissed [08:20] The internet doesn't forget. [08:20] that user's data will probably be moved to a dark item [08:20] swhitt: theres a dating site with exactly that [08:20] well, photos.... [08:20] :D [08:20] it uses them on the dating site. [08:20] SmileyG: I know that shit like that exists and people HATE it [08:20] http://digitallife.today.msnbc.msn.com/_news/2012/04/30/11471856-moms-facebook-photo-pops-up-on-porn-dating-sites?lite << lol [08:20] whenever anyone discovers that their data (by default) on facebook is publically availbe they freak [08:21] swhitt: but is it illegal? [08:21] SmileyG: do you have any idea about copyright and such? [08:21] I'm not sure if it's illegal [08:21] I'm not sure if it's moral [08:21] I honestly don't really know [08:21] yeah [08:22] Schbirid: copyright is with the holder of the camera at hte time of takign the image [08:22] unless otherwise handed to another enity. [08:22] The internet and social media make all of this shit pretty ambiguous [08:22] Schbirid: If your not claiming to own the image, theres no legal recourse afaik. [08:22] what if the ToS of FaceSpace says that if you upload images, copyright transfers to them? [08:22] swhitt: and you agree? Then you've agreed. [08:23] who reads clickthrough ToS'? [08:23] Not the discussion :) [08:23] SmileyG: there is usually a license grant as part of uploading said picture to a site, though [08:23] Coderjoe: yup. [08:23] ... [08:23] So, Person X whos complaining, likely doesn't even OWN their own photo anymore [08:23] I bet 99.9% of facebook users have not read the ToS [08:23] dude, shut up and come back when you educated yourself [08:23] swhitt: Agree. [08:23] Schbirid: sorry? [08:24] Point out where I'm wrong? [08:24] not copyright as a whole, but a license to distribute the image as part of displaying the page [08:24] so is it legal or even moral for us to copy and distribute the data of a user if they ask us not to? [08:24] swhitt: I don't know either. [08:24] if a ToS grants a license to a site to display an image, that has nothing to do with the ownership or control of rights [08:24] Schbirid: right. [08:24] [09:23:43] < swhitt> what if the ToS of FaceSpace says that if you upload images, copyright transfers to them? [08:24] SmileyG: that's what I was getting at [08:24] Thats _NOT_ what he said though. [08:24] swhitt: no, it is not legal unless their data was released under a free license [08:25] Schbirid: i know that [08:25] Schbirid: but what if that's part of the facebook ToS? [08:25] uploading images implies that you own them and you're giving facebook unlimited license to them [08:25] or transferring copyright to them [08:25] what if what if, transfer of copyright is not part of any reasonable tos [08:25] have any courts addressed that? [08:26] I honestly have no idea [08:26] read the tos and checked what actually is written in them [08:26] to me, as a layperson, this shit is morally ambiguous [08:26] Schbirid: legal presidence? [08:26] swhitt: Me too. However discussion is good on any level [08:27] Hey guys, I don't know because I'm not on that channel, but would this qualify for #archiveteam-bs? This is becoming a very long discussion. [08:27] Yeah, Moving. [08:27] typically "by uploading this photo, you grant $site a non-exclusive license to redistribute the photo" (possibly with "in perpetuity" or something like "until you revoke such license") [08:27] i dont agree, this matter is very much about archiveteam and legislation [08:28] it does belong in -bs [08:28] my fault [08:28] sorry for starting it here, moving [08:28] AT's general philosophy is "grab now, worry about legality later" [08:28] what is the purpose of this channel then? [08:28] why not worry here :( [08:28] because once the house is burned down, the discussion is over [08:29] and the legal questions come up again every few weeks [08:29] ah, ok [09:25] looks like this morning the ovh server giveaway took over an hour to complete [13:56] i'm archiving G4 E3 videos [13:56] i also have some full months of x-play now [14:31] Don't know why you'd bother with X-Play. I watched that show a few years ago and it was pretty bad. [14:31] Wasn't helped that on subscription TV here, in Australia, it was a few weeks late so a lot of the industry news and stuff was irrelevant. [14:31] Fortunately a local show replaced it, on a non subscription TV channel :D [14:33] Well, different people have different tastes and it might have some relevance to someone in the future. It's not relevant to judge what's being saved, it's being saved (In my opinion). [14:33] Yeah I guess. But damn what a waste of space :P [14:34] Meh. [14:34] Granted this is coming from a person who persistently records endless hours of political crap on TV even though said political crap is available online (albeit in a really low resolution format) from the broadcaster. [14:35] Yeah, I do understand the humour and sarcasm :p [14:37] brayden must be moxtv :) [14:37] lol [14:37] I do actually get Fox News live here. I just need to figure out how to wire up the Foxtel box to the computer [14:38] thing is it might cause issues if someone actually wants to watch the subscription service that costs $100/month [14:38] eh, you never know when something might become interesting and yet be ephemeral -- here in Chicago, a radio station was using an ad campaign featuring images of the former governor Rod Blagojevich, and they had to cover his face with a smiley face because said governor felt insulted [14:38] I have no images of the pre-smiley face ads [14:38] and I wish I did [14:39] lol [14:39] I have endless hours of digital radio too. A lot of it with presenters stuffing up [14:39] One had a particularly bad day because he was moved to a new studio [14:39] A politican whos name begins "Blago" ? [14:39] yes [14:39] :D [14:39] Rod Blagojevich, former governor of Illinois [14:39] and all-around asshole [14:40] lloll [14:41] lol [14:42] oh looks like he got screwed [14:57] Oh my, things really get out of hand here lately. Please (Nice word, not angry word) use #archiveteam-bs for "off-topic" discussions/conversations. ie. things not related to the actual act of archiving/getting/grabbing/rounding up items into a collection or so [14:59] sry. [14:59] Well it is not mentio--.. oh [14:59] * brayden looks at topic [14:59] ;-) [14:59] http://archiveteam.org | GET YOUR INNER WARRIOR: http://archive.org/details/archivet] [14:59] All i normally see. [15:00] I see: [15:00] it's just that since we're more people, and there's a lot of projects - it's really nice if we keep non-issue stuff to some other channel - so that projects get the proper attention [15:00] oh damnit we're doing it again [15:00] Does IA have a games archive thing? [15:01] I've seen SketchCow in IA and tons of games. So I think so. [15:02] * SmileyG goes to figure out how to search the IA site properly [15:28] new fileplanet (#fireplanet) statistics: 76k files downloaded, 3.2 Terabytes [15:28] Yes, it has an archive. [15:31] You now what I love waking up to? [15:31] A letter from a lawyer saying you're getting sued for billions of dollars? [15:31] 37k IDs left to try [15:31] I love waking up to someone badgering my most productive applications coder with endless "do it completely different and I question every choice you've made" messages, making him justify over and over all his excellent choices. [15:31] I want that shit to scroll for DAYS. [15:31] Can I get ops, please? [15:32] He's not the real SketchCow! [15:33] I'm as real as fuck. [15:34] brayden: The Internet Archive has been recording dozens of tv for 10 years [15:34] nice [15:34] http://archive.org/details/tv enjoy [15:36] Jeezus. That syntax took far too long to google. [15:36] Thanks. [15:37] One does not simply ghost SketchCow [15:37] into #archiveteam [15:37] No prob. I just wish the commands/parameter order were a bit easier to remember. [15:38] Hm? /op works in most clients. Very standard alias :) [15:38] Ya wanna know where I found that on Google? Nowhere. [15:39] You went "/mode #archiveteam +o "? [15:39] But saying "/mode Sketchcow +o" makes a very unhelpful error message. [15:39] Eventually, yes. [15:39] indeed. Good stuff. [15:40] It's like trying to remember a funciton name in PHP. doThis? do_this? dothis? doth? [15:40] function, even. [15:41] Oops. Too many uparrows. [15:41] I think I'm going to wander off and do not-typy things for a while. [15:43] lol [15:43] use something like phpstorm then. it has autocomplete for most functions builtin to PHP [15:47] Anyway, as if I need to continue this overnight conversation, python is the way to go. [15:47] lua is an unfortunately more obscure thing, but the framework can do other things in the future. [15:49] I've used Lua in the past and it is very easy to learn! [15:49] Oh wow, this #-bs thing is getting complicated [15:49] lol [15:50] It exists, so now there are edge cases [15:50] What, you mean what stays and gos? [15:50] A lot of edge cases [15:51] I agree, it's here and there, but as annoying as it is to have to justify base premises, justifying base premises helps think through things we might have just lunged at because we were under a time crunch. [15:51] I wouldn't call where we are a "lull", but we're not in full-on crazy mode. [15:51] Yarr [15:52] I do think shwitt got a little reductionist there. [15:52] With bonus morality discussion, which is pretty dull and probably should have been quelled. [15:52] It was, it continued in #archiveteam-bs for three ages [15:53] Did it? [15:53] Yepp [15:53] and it's a-ok for #-bs talk, even if it's (IMO) a bit dull [15:54] It's not dissimilar to when people would show up with Windows XP and huge smiles wanting to "help" [15:55] Without understanding that, like Mastiffs, bringing Windows into the room rarely "helps" [15:55] We have since fixed that with this ArchiveTeam Warrior project, which has made Mac, Linux and Windows boxes all run homogenous, dependable downloaders with PRECISE tracking of downloading. [15:56] also, WARCs [15:56] And as the full-scale assaults on Tabblo and Picplz showed, they're very effective, and the resulting files are very complete and dependable. Even if something goes wrong, they all go wrong the same way. [15:56] I count that as homogenous. [15:57] WARC + Data [15:57] * ersi nods [15:57] So, let's let alard keep working on the warrior - he's been rewriting it a few times and adding features. [15:57] I'd like us to focus on other problems. [15:57] I'm working on the fundraising thing. [15:58] Got two artists doing stuff. [15:58] fundraising for? :o [15:58] My apologies for being out of the loop but fundrasin- [15:59] For hard drives. [15:59] dammit. [15:59] Mmmh, hard drives. It's what *drives* us. [15:59] Tax Deductable kickstarter for archive.org to buy hard drives to lessen the load [15:59] I was thinking "I might suggest Kickstarter but that would be stupid" [15:59] We were 25% of all incoming archive.org data that wasn't their own web crawls. [15:59] That'd be maybe the first KS I donate to :) [16:00] You'd want to try and get it on KS's frontpage if you do, and maybe state a specific project you would use said hard-drives for [16:01] (this is purely my opinion) [16:01] SketchCow: Got that one public yet, or is that what you're working on? Kickstarting IA disks? [16:03] Working on. [16:03] I don't need help with it. I need to see about thinking of good rewards. [16:04] Yeah, I was just interested. Will probably chip in (even though, it isn't tax deductable for me) [16:04] Hmmmm, does Kickstarter take out the usual fees for the project? Else, I'll just donate to regular IA instead [16:05] SketchCow, floppy disk badge, signed beard photo, specially designed flash drive with a full copy of Textfiles.com, yo-yos, me in a tutu... [16:11] http://piratepad.net/88kaSOEokp < a little idea list of mine (serious) [16:15] I'm not really sure what could be added as rewards since I don't know the resources that IA would have access to on the free/cheap [16:16] Yeah, I'll think it through. [16:17] 2000$ gets a sponsored tweet from sockington ;) [16:17] So that's on my side, the project I'm working on, as well as getting further details together for ArchiveteamCon II [16:17] SketchCow did one of my responses really get a laugh at one of your DEFCON presentations? I really must watch those [16:18] You need to get a TV set up and a Skype call so the rest of us can join in :P [16:18] When you went full-chicken out on poetry.com [16:18] And I had to send you to the playpen for a week [16:18] I have no memory of that, my memory is awful [16:19] http://www.youtube.com/watch?v=-2ZTmuX3cog&t=35m05s [16:20] Archiveteamcon *TWO*? [16:20] I fucking love your outfit [16:21] This was the name we came up with [16:21] ah, I see [16:21] "We lost all the records of ArchiveTeamCon I" [16:21] "Yahoo decided to shut that Con down" [16:21] "to cut PR-losses" [16:23] lol you sure you were talking about me SketchCow in that video? doesn't sound like something I would do [16:24] ersi: we had the get-together details on Geocities [16:24] well, you did chicken out when that nice lawyer suit came if I remember correctly [16:24] "Oh my god, oh my god" etc :) [16:25] ...um...shit, I actually remember some of that, crap I wanna see logs of me chickening out now [16:31] SketchCow: at least for me http://archive.org/details/tv and all of the stuff under it redirects to http://archive.org/details/911 which is cool but maybe not what you were intending to link to? [16:32] Oh, yeah, probably admin only right now. Apologies. [16:33] I know they want to to open it soon. [16:34] Nearly at $20k [16:34] For git-annex [16:34] omigaws amiwagod [16:34] haha the pressure builds [16:34] * chronomex squee [16:35] Let's see if my machine explodes. [16:35] just checked tracker stats, we made picplz our bitch didn't we. [16:35] Oh yeah, picplz had a copy party and didn't know it was invited [16:37] we're the mc double def DP, the pic proectors for you and the posse. [16:38] thats your archivists, anarchists, haxors, and pro-grammers, they download the images for history and grammas that lets you learn, but also lets you play, with the pics you came here for today [16:39] Yes, that's what archiveteam was needing, freestyle. [16:40] i'm just stealing MC Hart's flow. instead of "Don't copy that floppy," it's "Don't forget to mirror that site" [16:41] also archive team only being 25% of non-wayback IA uploads, that is an insane amount of non-AT content [16:42] so i've been kicking around a site idea for a while, wondering if something like it exists. Basically a site with end-user guides on getting your shit OUT of the cloud [16:43] across various services, along with tools, etc [16:43] would i be duplicating any efforts anyone is aware of? [16:43] I saw something like that [16:44] any idea where? [16:44] http://www.dataliberation.org/ [16:45] limited to google I guess [16:45] yeah that's google centric [16:45] thats their official thing [16:45] i'd be pulling heavily from that for google items [16:45] other then that, i haven't seen any good central resource, it's piecemeal all over the place [16:46] hell, would probably just mirror that as a backup and point people to it [16:46] (the current one) [16:46] DrainLib: Put it on the archiveteam wiki [16:46] Speaking of which. IS THIS THE DAY I FINALLY FIX THE WIKI??? [16:46] Sounds like a plan [16:47] I took some old notes, will have to dig them up and then I'll get wikifying [16:48] but the title blacklist looks so happy [16:48] I need us to let people on, we need to do more wikicleanup [16:49] dataliberation.org waybacks like a year old. got any pull to crawl it again soon? [16:50] SketchCow, I cleaned up that wiki for you ages ago, DO NOT BREAK IT. [16:51] SketchCow: Let me know when all's good, I'll get on there and start writing .... will also populate some articles along the way on the List of social networks with some descriptors of what info is in there [16:52] even if i have no idea how to get stuff out [16:52] fyi everything on wayback is a year old right now, they're still crawling but way behind on indexing/putting-up for some reason [16:52] see e.g. http://wayback.archive.org/web/*/http://www.cnn.com/ [16:53] got it, will drop that line of questioning :) [16:54] Google's Data Liberation Front team has been silent since January... oh yheah... nothing to worry about with that piece of information, move along now Googlers ... [17:02] man...that gay construction worker man story on the video really warmed my heart. [17:15] holy god an archivememe is on there [17:16] hahaha [17:18] I didn't know Away From Keyboard existed [17:18] Just [17:18] Holy crap [17:18] SketchCow, undersco2, chronomex, dammit I love you all [17:18] :D [17:19] I'm actually close to crying here [17:28] What's the easiest way to grab a copy of textfiles.com? [17:31] http://archive.org/details/textfiles-dot-com-2011 [17:32] I really should have thought of going through IA [17:32] Thanks DFJustin [17:58] I had an idea for a form of "video series" [17:58] Where a member of AT dives into the archives of Geocities/Textfiles/Usenet and finds interesting stories to bring to the forefront [17:58] In order to say "this is what we're backing up" [18:58] http://www.bbc.co.uk/sport/0/football/18356761 [18:58] woops wrong window [19:03] Whenever the wiki comes up, I'll put these in... my rough idea/format/improvements on getting data out of cloud services [19:03] https://docs.google.com/document/d/1xG_9eVTxox-ARdLDcF2evqWKxLTHbBL33MlMFHU-vT8/edit [19:03] More to come, with better links/steps, etc [19:04] i had a whole list of services i wrote up somewhere, could easily just do it again [19:07] Pages will be less archiveteam-internal centric, more I'm Joe Smith and want to know how to get my shit out of XYZ service when it's suddenly shutting down, or not shutting down and I'm just sick of using it [19:07] focusing on easy to use tools/methodologies and detailed steps [19:10] will still keep the archivist-AT-team focused material, just split each page into the basics up top, technicals in depth below. I like the Facebook page format and would likely follow similar [19:16] also just stumbled on data compression algorithims and tools wiki page. is there something like "What the hell is this file?" that if someone finds old media they know how to get info off it? [19:17] Basically, file extensions and some explanations on what they are, what one might want to do with them [19:19] "i wrote my college thesis in 1988 and saved it in this WP4 format, now I don't know how to open it!" [19:19] for example http://www.fileinfo.com/extension/uc2? (one of the first matches when searching for that rather obscure file format) [19:20] Yup. [19:20] Starting a bit more niche perhaps, focusing on Document, Graphic formats :) [19:20] even linking into this [19:20] cause yeah this is awesome [19:20] ok nevermind that idea [19:21] done and done [19:29] Car Talk show to end. 35 years of programming, new shows to air remixed from archived material - http://www.wbur.org/2012/06/08/car-talk-retirement [19:31] podcasts back to 87 - http://www.cartalk.com/content/browse-shows [19:32] sadly, all paywalled [19:33] the actual files are on itunes/audiable [19:33] so they won't go, will they? [19:34] i wouldn't rely on itunes/audible for long term [19:34] Yeah true. [19:34] Someone got deep pockets want to buy them all? :D [19:34] i see torrents on a less then legal one eyed shipping fan website [19:35] yheah erm I found a torrent too :S [20:21] Back [20:26] welcome [20:28] All documentaries shipped out. [20:28] That part of my life (backorder fulfillment) is increasingly becoming a drag. [20:30] my new colocated server is in and im happy :) 16 cores and 32gb ram of goodness [20:30] :) [20:31] SketchCow: Do you have a pretty continuous stream? [20:32] i'd say his life has a continuous stream [20:32] maybe ask him tomorrow for confirmation [23:37] http://thomas.loc.gov/home/thomas.php [23:38] I think if you close your eyes, you'll get a pretty continuous stream