[07:39] ersi: Me too [07:40] Are you from Sweden? I think I recall something like that... [07:40] Because if you are, den hemliga traedgaarden arranges for a bus (which I'm going to join) [07:40] #s23c [07:41] #s23k [07:42] This is important [07:42] https://thepiratebay.org/torrent/6554331 [07:42] Please help seeding this [07:42] 32 GB of scientific articles in the public domain [07:42] Previously behind a paywall at JSTOR [07:43] A guy used his university license to download all of it, now faces charges for copyright infringement [07:43] Is that from Aaron Swartz's "attack" on JSTOR? [07:44] Yes. [07:44] Or rather, it's the same articles [07:45] I don't know where the data comes from [07:45] But please seed. [07:45] And since they're in the PD, one could justify them being hosted by archive.org? (nudge SketchCow) [07:48] jch: Yeah, you remember correctly. :) I'm flying there, though [07:48] jch: And besides, I bet the bus goes from like Malmö or something else really down south [07:48] Nope [07:48] sthlm -> gbg -> malmo [07:48] oh dang :) [07:49] flight is already booked and paid for though [07:49] I'm going to live in Stockholm in a couple of weeks [07:49] OK [07:49] See you down there though [07:49] You are? Neat! [07:49] poke me on IRC and we'll have lunch or a beer or something :P [07:50] sure thing [07:50] I need a place to live though [07:50] You know any people with a vacant room [07:50] or maybe a collective [07:50] Can pay from aug 1. [07:51] Will move in the 5th, then going to hacknight.se and CCC, coming back the 15th, living there 4real :) [07:51] Studying at KTH next semester [07:51] unfortunally not, but I'll poke and see if any one I know has [07:51] cool, thanks [07:52] http://kollektiv24.forum24.se/kollektiv24-about989.html [07:52] I'll pass it on :) [07:53] Speaking of JSTOR... http://www.google.com/search?q=%22JSTOR+transmission+must+contain+the+same+copyright+notice%22+filetype:pdf [07:54] I wonder if there's a good way to download all of those (while deduping). [07:55] Wow, now that I look, yesterday there were under 200k results for that query. Today it says 476k. [08:13] ersi: thanks. [08:13] I am now at work [10:22] ersi: up for splitting a kryoflux? [10:22] we could do some public good at the CCC [10:23] i'm completely certain that there's going to be lots of people there with old disks that need saving [11:35] Or, maybe we should announce a promotion for that. [11:35] The KryoFlux? [11:35] Yeah, an ArchiveTeam appearance for CCC [11:35] I wouldn't mind [11:36] And there's still time [11:36] what does it mean to announce a promotion? (sorry...) [11:36] announce an appearance, or some kind of competition or what? [11:38] announce an apperance most likely [11:41] SketchCow: would you come? [11:47] No. [11:47] But I'd promote it. [11:48] Cool. [11:53] ersi: should we do it? [12:01] 5362.9 / 33259.3 MB Rate: 2135.5 / 4801.4 KB Uploaded: 3189.5 MB [15%] 0d 1:39 [ R: 0.59] [12:01] JSTOR_01_PhilTrans [12:01] So I'm on that. [12:01] Cool [12:01] Is it by any chance ending on archive.org [12:01] This one I'm actually going to check with them first for an opinion. [12:02] As I understand it it's Public Domain data [12:02] Activism is cute, but Maxwell is right, there - he's punching some mighty clouds. [12:02] Oh, you read it Public Domain from the description on a torrent on the Pirate Bay, did you? [12:03] yes, and from Aaron Swartz' (sp?) breakdown [12:03] Aaron is cute as all heck but his knowledge of law is shaky. [12:05] Guy's young! [12:05] Maxwell [12:05] Anyway, since the entire purpose of this is to punch JSTOR in the eye, I'm just going to check with my folks about their opinion. [12:05] Other sources says the works are copyrighted [12:05] ok [12:05] There is a good chance they are. [12:05] They however "should be" in the public domain, for the better of the world (presumably) [12:05] This is a cute fight but it's not archiveteam's fight. [12:06] We do like them datas though [12:06] this isn't user generated data being destroyed by forces outside of people's control [12:06] Oh, we LOVE them datas [12:06] But this information has lots of advocates [12:06] And maybe I can put it up, of course. [12:06] But I will likely check [12:06] Certainly spot on [12:06] Cool [12:07] ersi: Are you up for splitting a kryoflux? [12:07] splitting? [12:07] I've already downloaded 20% of this thing. [12:07] ersi: the costs [12:07] I will buy you free of it when I return to Copenhagen (january) [12:07] I just don't have the money to pay for it all right now [12:07] What does it go for? (I have no clue) [12:08] It's for backing up old floppies [12:08] I believe $80 [12:08] I was thinking we could bring it to the camp, and make a statement in English and German telling people to bring their archaic stuff for archival [12:08] I misunderstood your question, it seems ;) [12:08] €100 if you want it with cables and all [12:09] http://webstore.kryoflux.com/catalog/product_info.php?cPath=1&products_id=29 [12:09] Then we'll need a 3.5in and a 5.25in drive [12:09] There is an old arcade game village there. They probably have a lot of games. [12:10] If you guys want to do that, we could talk to the people running it to make it a promoted/known event [12:10] unless it's all arcade machines [12:10] I have connections in that regard [12:10] And I can have stuff translated to German [12:14] (probably) [12:17] Damn. [12:17] So I'm over here downloading archive.org shareware CDs from a collection uploaded about 5-8 years ago. [12:17] And of course preparing to upload hundreds of mine in ISO form. [12:18] Some of these, man... these are some lame ass CD-ROMs [12:18] And they did a horrible job describing them. [12:18] IMPORTANT HISTORICAL DATA. [12:18] ;) [12:23] http://www.archive.org/details/BetweenHeavenHell [12:23] Oh, it's all going in, make no mistake. [12:36] I bet that if you used something like deduplication on the shareware collection it would end up being less than 10GB :P [13:09] Some of these are definite crap. [13:09] Or do you mean the whole site? no way. [13:10] SketchCow: Do you have the magazine Atari Life at hand? [13:11] Atari Age* [13:12] This video was kindda interesting: http://www.youtube.com/watch?v=IqR6fnxAnN0 [13:13] Oh, they're already archived here [13:13] http://www.atariage.com/magazines/atariage.html [13:15] I spend way too much of life staring into the face of people like that atari age guy [13:16] hah [13:16] he's made some pretty solid ragtime arrangements [13:19] http://ia600509.us.archive.org/5/items/SWAVALANCHE/Avalanche.jpg [13:19] Woah, $159.00 shareware CD [13:19] Haha! [13:22] Look at this man sight-read video game music: http://www.youtube.com/watch?v=9LrelS09BMc [13:24] Discovered tom Brier, did you [13:24] http://www.youtube.com/watch?v=AcZroW6fGcA [13:25] That's the one to watch [13:25] He plays it twice [13:25] I've been with him from the start ;) [13:25] I was suckered into Keeper1st's videos by this one: http://www.youtube.com/watch?v=f4PvHmqvQko [13:25] Not brier, still good ;) [13:26] This is improv [13:26] Whats critical is 1:54 [13:26] in the animaniacs one? [13:26] Yeah [13:26] When he on the fly rearranges it [13:27] I don't really have any emotional attachment to Animaniacs [13:27] Ok, that is pretty awesome. [13:27] Because he's pissed the guy added optional cues for a flute [13:27] so he integrates the flute [13:27] the guy being the atari guy I linked earlier [13:29] I can't wait to see what's in the $159 shareware avalanche. Almost done downloading, then I'll mount and extract. [13:33] Surprise, Surprise, it's more of the same [13:37] Fucking damnit, another corrupt one. [13:37] What a shitty, shitty job this was. [13:38] I've now sent out a message asking for the original CDs, hopefully buried somewere in a storage locker at archive.org's locations. [13:44] I don't like how many of the 77 CDs I have to make dark. [13:44] It's going to be, like, 20. [13:50] Can't take it. Going to upload 118gb of CDs to the site to make up for these lame-ass dead ones. [13:55] how seriously does archive.org take copyright? [13:55] I know you're entitled to psas shareware around to all of your friends, so that's not really a concern [13:55] but otherwise... [14:00] I wish Ron would upload more of Tom's sight-reading, but he's been pretty mum lately. [14:07] To be honest, I've personally never found the reliability of optical media in general particularly stellar, either. [14:12] That's pretty much true. I saw something recently about old Playstation discs deteriorating, even. (Not sure on the veracity of that, but I'd not be surprised in the slightest.) [14:13] Though my DVD±Rs still seem to hold up okay once I managed to find a halfway decent burner (I definitely keep at least some DVDisaster ECC images around just in case, though) [14:14] And even when the media itself is reliable, they seem to get too scratched up way too easily with frequent use no matter how careful you try to be (at least, in my case) because of the ridiculously soft clear plastic layer. :P [14:37] JSTOR torrent: 1.8MB/s down, 6.4MB/s up [14:37] So far I [14:37] So far I've pushed 55GB [14:37] I won't discuss archive.org like I'm representative of them. [14:38] Who are you talking to? [14:38] Everyone [14:38] ok, i guess [14:38] JSTOR_01_PhilTrans [14:38] done 33259.3 MB Rate: 3231.2 / 0.0 KB Uploaded: 25374.2 MB [ R: 0.76] [14:38] Hooray [14:39] Wow, your ratio is <1 on a piratebay torrent [14:40] sketchcow: "I just work here" http://www.flickr.com/photos/textfiles/5659199357/ [14:41] :P [14:42] I must be losing my mind.. what is the key/menu option to save entire webpage as jpg in firefox? [14:42] there isn't one, some third-party screenshot tools can do that though [14:42] there is an addon called abduction (I think) [14:43] ah.. i must have had one before i installed ff5 [14:44] had to install 'lightshot' as it was the only ff5 compat one on launch day [14:44] and it can't do whole pages, just rectangles on screen [14:50] thanks Cameron_D, abduction works ok [15:09] JF - Proceedings of the Royal Society of London (1854-1905) [15:09] SP - 356 [15:09] T1 - Thermo-Electric Properties of Salt Solutions [15:09] TY - JOUR [15:09] VL - 55 [15:09] EP - 373 [15:09] PY - 1894/01/01/ [15:09] UR - http://dx.doi.org/10.1098/rspl.1894.0053 [15:09] M3 - doi:10.1098/rspl.1894.0053 [15:09] AU - Emery, G. [15:10] or [15:10] JF - Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character (1905-1934) [15:10] T1 - On the Electricity of Rain and Its Origin in Thunderstorms [15:10] TY - JOUR [15:10] VL - 82 [15:10] IS - 553 [15:10] SP - 169 [15:10] EP - 172 [15:10] PY - 1909/05/06/ [15:10] UR - http://dx.doi.org/10.1098/rspa.1909.0020 [15:10] M3 - doi:10.1098/rspa.1909.0020 [15:10] AU - Simpson, G. [15:10] ER - [15:10] So that's the format. [15:26] I'm trying to prepare http://thepiratebay.org/torrent/6554331/Papers_from_Philosophical_Transactions_of_the_Royal_Society__fro for upload to IA [15:26] Don't. [15:26] I'm on that [15:26] ah ok [15:26] Are you removing watermark? [15:27] We're stewing on the thing [15:27] looks like greg wrote a script for that [15:27] Did he, where is it? [15:27] no idea, he said so on #wikimedia at FreeNode some hours ago [15:28] Wait a minute. [15:28] ok [15:28] Greg Maxwell...... [15:29] Oh yes, now I remember. [15:29] One of the worst of the worst with wikipedia [15:29] OK, anyway. [15:29] ... [15:30] If someone is interested in doing that, they should strip the JSTOR watermarking and djvu them, it roughly halves their size. I've got script to do this, I don't remember how well they work. [15:30] so I've unpacked the papers and stored them temporary [15:30] http://cdmirror.textfiles.com/JSTOR_01_PhilTrans [15:30] So anyone who wants to see, and is curious. [15:31] What fields are you using? [15:31] I thought something like http://p.defau.lt/?bH1gxEERT0OowEBqoST0Gg [15:34] I'm going downstairs, BUT [15:34] - I am unpacking the collection [15:34] - I have suggested this to my bosses [15:35] - If I end up getting no pushback, I'll work with a few of you to make a thing [15:35] - I have direct admin s3 access. if I do it, it'll all be up within a day. [15:35] All gabillion papers [15:35] yep [15:35] So let me let them wake up and see what happens [15:35] Meanwhile I am making a staging area for hundreds of CD-ROM ISOs [15:35] And THOSE are going up [15:37] This is about 18,000 papers, by the way. [15:37] heh heh.. got an archive of all AOL disks/cds? [15:37] I probably do [15:38] all the way back to QuantumLink [15:41] why does archive.org even bother showing things like this : http://web.archive.org/web/20090303154356/http://quantumlink.tk/ ? It just gets your hopes up that something useful was saved [16:03] Hah, I thought gmaxwell looked familar [16:10] ersi: who is he [16:18] jch: the guy who made the torrent you linked [16:18] ah, I knew that [17:16] i came here to post this https://thepiratebay.org/torrent/6554331/ but jason has it on his galactic-server [17:26] http://news.slashdot.org/story/11/07/21/1632244/Wolfram-Launches-Computational-Document-Format [17:28] IN THE NEWS: PDF FORMAT IS DEAD. [17:33] SketchCow, Nemo_bis: fair warning, some of the metadata has some HTML crap in it. I didn't catch it previously. [17:38] awesome torrent emijrp [17:50] hi MarkDille [17:51] urlteam torrent is slow, trapped in 48% [17:54] gmaxwell is the wikipedia guy Nemo_bis ? [17:56] http://googleblog.blogspot.com/2011/07/more-wood-behind-fewer-arrows.html [17:57] [Jul 21 11 11:28] Greg Maxwell...... [17:57] [Jul 21 11 11:29] Oh yes, now I remember. [17:57] [Jul 21 11 11:29] One of the worst of the worst with wikipedia [17:57] Hello emijrp [17:58] HAHAHAHAHAHA WIKIPEDIANS ARE INVADING ARCHIVE TEAM. I GUESS JASON SCOTT Y HAPPY ABOUT THAT. [17:58] MarkDille is a wiki guy too [17:58] and Nemo_bis [17:58] and me [17:59] lulz [18:03] greg is more than a wikipedian [18:04] he does a lot of things in Wikimedia [18:05] Who is Greg? [18:05] http://toolserver.org/~pathoschild/stalktoy/?target=gmaxwell [18:06] > IIRC, it Greg Maxwell who had (some of?) the images that the [18:06] John wrote: [18:06] > Foundation lost when a bug was rolled into production. [18:06] Yes. He has a partial copy of the images. [18:06] And I wouldn't call me a Wikipedian - I watched Wikipedia and Wikimedia grow up. :-) I am a plain ole WikiGuy :-) [18:07] eww. I like that toolserver link [18:07] thanks Nemo_bis [18:07] thank pathos :-p [18:07] thanks pathos [18:08] :-D [18:08] :-D [18:10] wikiindex migration finished? [18:15] There's Wikipedians and there's people who work with Wikipedia [18:15] Difference. [18:16] And there's Wikimedians. [18:16] I put them under wikipedians [18:16] noooooooooooo [18:16] Ha ha, do not care about self-identification. [18:16] It's plainly wrong. [18:17] Nope, care not coming [18:17] nggggggghhhhhh [18:17] I'm out, can't find the care [18:17] Anyway, I have to go run a panel right now, I just was stopping in for a specific statement. [18:17] http://meta.wikimedia.org/wiki/Wikimedian btw [18:18] Whoever twittered the URL to my private unpacking of the Gmaxwell Rip: You were stupid. Don't do that again. [18:18] I was putting it somewhere for archive team members to mull/plan from without them needing to get a 33gb torrent to investigate themselves. [18:18] I know it was twittered privately, but then your little friends started twittering it publically [18:18] So now it is gone [18:21] .. [18:22] Now, maybe this is a bit subtle, but as for the thing about wikipedians and wikipedia, my dislike how the environment encourages a certain type of personality to flourish that is normally shot and left in some closet in a flophouse. [18:22] But they are not the majority, the brilliance is that it sneaks people with actual good skills and outlook and abilities to contribute. [18:24] So maybe that's subtle, but whatever. So there's a set of personalities from the last, what, 8 years I've been on Wikipedia, and GMaxwell is one of them. [18:24] But let me be clear: The skillset that emijrp and others have brought to the archiveteam wiki have really made that shine, and I really appreciate it. [18:26] The HTML is trivial to edit out, by the way [18:29] Also, shit's getting real with Friendster. [18:29] Lots of packs of data coming in. [18:32] Ersi's is now transferring over. [18:32] We're well past a few terabytes [18:33] Yay [18:36] sorry SketchCow, my fault :-( [18:36] * Nemo_bis slaps himself [18:41] any news from shoutwikis Nemo_bis ? [18:43] hm, no [18:43] We should start downloading them, I suppose [18:43] How many for each person? [18:43] All. For you. [18:43] Ah ok. :-D [18:44] lol [18:44] How many in parallel, then? [18:45] 1. [18:45] They are tiny. [18:46] I did some stats (100 random wikis) and most of them were almost empty. [18:46] aww, only 1 is boring [18:46] list is here http://code.google.com/p/wikiteam/source/browse/trunk/listsofwikis/shoutwiki.com [18:46] be nice with servers, not emergency by now [18:47] not until we kill them :-p [18:47] we may offer single 7z for every dumpand a global dump with the entire shoutwiki [18:48] yes, 7z and then tar [18:48] A single item on archive.org [18:48] then access single 7z with tar.php [18:49] not sure, to IA you can upload trivially (it is automatic) [18:49] but do what you want [18:49] I know, why the tar then? [18:49] i mean a bunch of files in the same item [18:49] It depends how big they are [18:49] ok, we will see [18:49] But these are going to be tiny [18:52] man, shoutwiki is slow [18:53] you DoSed them? [18:53] : P [18:53] SPAM MOMENT COURTESY BY EMIJRP: #WIKITEAM [18:58] Is gmaxell's stash on IA yet? [18:58] *gmaxwell [19:02] was just wondering the same thing [19:03] lul [19:04] gmaxwell has a rsync of all wikipedia images too [19:04] that guy has some interesting stuff [19:05] only Wikipedia? [19:05] wikimedia commons, i wrote that for non-wikipedia-geeks [19:07] wikiteam is going to develop a similar script, but to pack images using date [19:07] uploaded images in 2005-01-01, and so on... [19:07] yes [19:07] currently, about 6 GB are uploaded every day [19:08] so, we catch from old dates, that is 'stable' content [19:08] newer ones may contain copyvios [19:08] ah, so you'll never have to update your torrents, just issue new ones [19:09] db48x: yep, it is better [19:09] although not sure if torrent or only IA items [19:10] i saw a website which generates webseed using IA, i think [19:16] cool [19:22] emijrp, will you do it from Toolserver? [19:38] upload to IA? i dunno [19:38] toolserver has only 1-2tb free [19:40] 365 days * 6 GB = 2 TB [20:27] hahaha, read this http://wikitech.wikimedia.org/view/Disaster_Recovery [21:21] Back [21:21] Ok, so, after some informal discussion, I'm not putting that torrent on IA. [21:21] However, obviously one of you fuckers can, if you're so inclined. [21:22] I can certainly give education to someone about ways it could happen [21:25] heh [21:27] SketchCow: Doesn't get stuff done to get stuff done. [21:30] Is that even english [21:30] the words are all individually well-formed [21:33] works if you remove the : [21:40] SketchCow, can you help me to create a well-formed metadata.csv for the bulk uploader? [21:40] http://p.defau.lt/?On1cdi8BZpJPb8HVvRvoewp [21:40] (This is what I though of as a format.) [21:52] I didn't see it. [21:52] There's basically two ways to do this. [21:52] I look at the metadata, and I feel it can be combined into a description, in most ways. [23:34] 0.5% done on the JSTOR torrent [23:34] downloading at 100kilobytes/s, uploading at 1.5megabytes/s [23:34] oh those zany torrents