[07:07] I'm back! [07:07] (My machine had a hard drive going slightly bad, enough that once every 5-7 days it would crash the machine but SMART didn't pick it up. [07:08] SketchCow, FOS is full and causing us problems [07:08] So I finally bit the bullet, bought a SSD drive, did the fragginatin' and the cloninatin' and here I am with a machine that boots in, like, 12 seconds. [07:09] It is not full. [07:09] It's bloated to be sure but not full. [07:10] xmc, was the disk full error returned to the tracker? [07:11] Which tracker. [07:11] He didn't specify [07:26] I just looked at the graph and saw that the disk had filled [07:26] well strictly speaking we're about 350M out from disk-full [07:27] I've seen creeping disk fill on the tracker several times, so it's probably something gone slightly off the rails [07:37] underscor: ping when alive plz :) [07:40] http://zeppelin.xrtc.net/corp.xrtc.net/shilling.corp.xrtc.net/df.html [08:46] Yeah, see, none of these are my machine. [08:48] Is that the url tracker then? [08:49] That's the tracker machine, yes. [08:50] GLaDOS: what link [09:01] SketchCow: in january of 2012 you scraped some fanfiction.net stories using the fanfictiondownloader project from googlecode. for the last few months ive been doing the same thing. I have from id 1 to 3 million and have most of the intervening numbers between 3-5 and 5-7 million, running 2 paralell downloads. its still going and currently its at about 80gb of text files, is there somewhere i could rsync this? im almost out of sp [09:02] incidentally im also the guy who grabbed all of ao3 [09:02] bsmith094: you can upload it to IA.... [09:02] not via rsync, but.... ia3uploader script? [09:02] Or just the form on the website, just log in first [09:05] what does 80gb of text compress to cause i ive only got 24gb of space left [09:08] SketchCow: was not saying your box is full. said it elsewhere and omf_ seems to have gone off on his own tangent [09:15] SmileyG: whats the link to the ia3uploader script [09:17] off 2 bed back in ~10 hrs [09:17] https://github.com/kngenie/ias3upload [09:17] bsmith094: https://github.com/kngenie/ias3upload [09:17] thanks [10:49] GLaDOS, and I have been working on the docs for our servers. We have 6 servers, 5 of which are up and running different services for us [13:00] I guess everyone here has seen http://arstechnica.com/tech-policy/2013/08/changing-ip-address-to-access-public-website-ruled-violation-of-us-law/ [13:24] oh dear, im going to gitmo. [13:25] we all are :D [13:30] Groklaw is stoping posting new things, no sign on an actual shutdown or not http://www.groklaw.net/article.php?story=20130818120421175 [14:01] does anyone have any ideas on how to get all comments from groklaw.net? [14:13] i have another problem [16:21] We should grab groklaw [16:27] godane: Set the view mode to "nested" instead of "threaded" (or "flat" or "printable" would work too, I guess); then all comments show on the article page directly (it seems to set a cookie) [16:38] Just this morning PJ, the maintainer of Groklaw.net, basically announced the end of the site. Groklaw has covered many important cases and events in software patent/copyright law for many years and is a really unique source of that history. While it wouldn't be in her nature to just shut down the site without explicit warning or making a backup available, due to the nature of her last message I don't know if we can be sure. There are alre [16:38] all recent articles and other sections of the site are not backed up as far as I can tell. I am worried that if Groklaw is lost, a lot of important history will be lost along with it. I remembered hearing about archive team so I came here. Apologies for the essay. [16:48] anon42: we're well aware... [16:54] balrog: oops. no reason to get dramatic then. what's your guys' take on it then? [17:11] anon42: our take doesn't matter [17:12] our take is [17:12] lets back that shit up [17:13] If you want to hear different opinions about how we feel ask in #archiveteam-bs. That is the channel where we have those discussions. [17:17] Good to know. I just realized the first message I saw in here is actually refers to this and I missed it. So, how can I help? [17:19] Firstly, does anyone want to call themselves out as doing it? [17:20] SketchCow: your normally on stuff like this damn fast? [17:20] anon42: we have a wiki which documents how you can take your own archive of the site at: http://www.archiveteam.org/index.php?title=Wget#Creating_WARC_with_wget [17:24] I have one grab going for the pages and plan a follow up for the pdfs [17:24] https://law.resource.org/pub/us/code/ga/ needs backup [17:46] it does. [17:57] I create a simple network diagram of the warriors. http://picpaste.com/6uO20RMg.png Is anything missing? Is there a better format to use? [17:59] I am going to lay it out different in the next version so none of the labels are obscured by connection lines [17:59] looks right [18:16] Deewlant: my problem is i don't know how do it in the url [18:16] give me the url that works then i can do it [18:17] godane: Look into the cookie that it sets and just send that as part of the request, I don't think you can set it in the URL [18:21] Here is version 2 of the warrior network - http://picpaste.com/pics/Pz81z7Mx.1377022875.png [19:20] Deewiant: its not working [19:21] i only got 2 lines in cookies with growlaw.net [19:21] .groklaw.net TRUE / FALSE 1408547640 LastVisit 1377026061 [19:21] .groklaw.net TRUE / FALSE 1377012240 LastVisitTemp 1377022575 [19:22] I am already 500mb deep into my groklaw grab [19:22] are you grabbing all comments? [19:22] everything [19:23] what are you using for commands? [19:23] godane: Ah sorry it's evidently just a post request, use &mode=nested in the url [19:23] Didn't really look into it properly earlier, sorry again for the trouble [19:24] fuck yes [19:24] this well work [19:37] also good news is that if a get error on byte with a list of urls i only stop going to down that one page and goes to the next url in the list [19:37] so it doesn't just fail and calls quits on me [19:38] :/ [19:38] good, can yoiu log the error too? [19:41] i log everything these days [19:41] to make sure we know what is missing or just 500 error on me [20:54] https://vine.co/v/hMLVA1emhej [20:54] Someone save that [20:54] That's going to disappear. [20:56] got it [20:56] o.o [20:56] óò [20:58] http://www.buzzfeed.com/mikehayes/this-terrifying-vine-shows-the-exact-moment-a-truck-flies-ov [20:58] (He survived) [22:18] One of the newly uploaded Prelinger films is basically a mid-century modern animated version of the IPV4 --> IPV6 upgrades: https://archive.org/details/6317_Mr_Digit_and_the_Battle_of_Bubbling_Brook_01_15_17_02 [22:18] Really cute, and definitely quotable. Letters! They're hemming us in!