[00:01] alard: sweet, thanks [00:12] chronomex: Isn't this rad? [00:12] hahaha [00:12] You win the game by idling [00:28] oh boy. another IRC channel in my already-overcrowded channel bar [00:28] Coderjoe: Come on, you know you want to play the game [00:29] what is the "class_name" field? [00:29] Whatever you want [00:29] Mine is "FUCKING ARCHIVIST!" and chronomex's is "cardboard box collector" [00:29] chronomex, the cardboard box collector, has attained level 2! Next level in 0 days, 00:13:27. [00:29] dude this is awesome [00:30] chronomex: hahaha [00:30] I was wondering what was highlighting me [00:30] now i know [00:30] chronomex: eventually you'll get to days and weeks of idle between levels [00:31] I know, I've played this before [00:31] oh awesome [00:31] I was just working and kept getting highlighted [00:31] I was like "hmmmm" [00:31] cos most of the shit my boss tells me to do, he does by irc [00:33] ha [00:33] oh damn [00:33] Cannot write to `listserv.aol.com/cgi-bin/wa?A2=ind0709&L=class8731&D=0&F=P&T=0&P=1717&F=' (No space left on device). [00:33] ;_; [00:33] doh [00:40] most of the volume is being used up in the mobileme grab [00:55] - Discovering urls (XML)... ERROR (1). │ [00:55] Anyone else get errors like this? [00:55] Downloading public.me.com/sw1975 │ [00:55] Error downloading 'sw1975'. [00:55] Error downloading from public.me.com. │ [00:55] Then it craps out [01:09] alard: Need help when you get a chance [01:11] dammit. s3cmd spits out upload updates way too rapidly [01:13] oh, heh [01:13] cut: data/j/ju/jun/junko_tuki/web.me.com/wget-discovery.log: Illegal byte sequence [01:13] so many fun and wonderful things on mobileme [01:14] owO [01:14] I didn't know youtube even allowed stuff like this http://www.youtube.com/watch?v=Nw8_bOPgwPw&feature=related [01:14] oh ugh, wtf [01:14] (nsfw) [01:14] I'm at work, what is it? [01:15] https://gist.github.com/6a38c8e6a896f34de00f <-- said wget-discovery.log [01:15] german televeision program part about collecting dog semen [01:15] jeepers creepers [01:15] Semen collection from a dog [01:15] ah [01:16] Payment required lol [01:16] yeah, I dunno if that shows up in every wget-discovery.log just yet [01:16] i've seen it at least once before [01:24] hmm [01:25] a quick grep is showing 1444 instances [01:56] hmm. [01:56] I wonder if apple is saying that any companies with web spiders are supposed to pay them [01:58] appears to always be robots.txt requests [01:58] er [01:59] strike that [02:03] https://s3-us-west-1.amazonaws.com/archiveteam-cojo/mobileme/payment_required.txt [02:04] lol first time I've actually seen that error code in use [02:32] http://evangelicaltextualcriticism.blogspot.com/2011/11/robots-and-archives.html [02:40] anyone have an archive of the computing lives podcast the rss feeder link is dead? [02:46] I likely have all of them somewhere [02:51] is that podcast archive ( the huge one) online yet? [03:08] No [04:31] /j #2,000 [12:42] oh stupid efnet. k-lines my current ip and the sites the servers tell me where i can request a revoke are either timing out or parked... [12:43] heh [12:48] if someone wants to grab 3-4 months of daily robots.txt downloading for alexa's top 10k domains, here you go: http://www.quaddicted.com:27500/ (the files*.7z ) [12:48] i have not look at it for ages so i am not sure what might hide inside [12:48] the cronjob is just happily running in the early morning hours ;) [15:44] underscor: Still need mobileme help? Or did a re-run fix it? [15:46] as java is not so friendly to me, someone has compiled this? https://github.com/openplanets/wap [15:46] is a gui to inspect content of a warc file [15:46] alard: Hmm, looks like it might be working now [15:46] one sec [15:47] I tried to download the public.me.com data of the user you mentioned, but didn't have a problem. (There was no content, though.) [15:48] In general, the error numbers you get from the script are the error numbers returned by wget. 1 is a 'general error', so that isn't very helpful. [15:55] Hmm, same problem [15:55]  - Discovering urls (XML)... ERROR (1). │ [15:55] Error downloading 'kmisak'. [15:55] Error downloading 'kmisak'. │ [15:55] Error downloading from public.me.com. │ [15:55] (that's from after it does the wget -m, and starts public [15:58] Is there anything in data/k/km/kmi/kmisak/public.me.com/webdav-feed.xml ? [15:59] No [15:59] Oh [15:59] Return code of the script was 6 btw [16:00] Ah, yes, that's for 'can't download this user'. The wget exit code is in ERROR (1). [16:00] Aha [16:00] Or the curl exit code, to be more precise. [16:00] This is done beautifully [16:01] Does this give any interesting information: [16:01] …opfind xmlns:DAV="DAV:">' --user-agent 'Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US) AppleWebKit/533.20.25 (KHTML, like Gecko) Version/5.0.4 Safari/533.20.27' [16:01] (Sorry, should be one line at the ...) [16:01] curl https://public.me.com/ix/kmisak/ -v --request PROPFIND --header 'Content-Type: text/xml; charset=utf-8' --header 'Depth: infinity' --data '' --user-agent 'Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US) AppleWebKit/533.20.25 (KHTML, like Gecko) Version/5.0.4 Safari/533.20.27' [16:02] ha [16:02] ha [16:02] * Protocol https not supported or disabled in libcurl │62 [16:02] curl: (1) Protocol https not supported or disabled in libcurl [16:02] oops [16:02] Ah, that's a pity. [16:02] Thanks for helping find it though :) [16:02] I thought you had run the scripts successfully before? Did you change anything? [16:51] alard: Different machine [17:47] underscor: Ah, I see. (I've added a test for curl+ssl to the script, by the way.) [17:47] alard: just to clarify: is the mobileme grab project actually officially going, or will there be a big reset-and-then-go action soon? [17:47] I've hit a few inconsistencies on OS X that I think probably make what I've collected quite questionable [17:48] yipdw: No, it's going, slowly. If you've found a few inconsistencies that's something to look into. [17:48] ok [17:48] But I've got to go now, hopefully speak to you later. [17:48] sure, no problem [17:48] I'll write up what I've hit so far [17:53] alard: Is there a status page for the tracker? [20:09] underscor: Status page: no, though there is a list: http://memac.heroku.com/ [20:11] Those are the "last done"? [20:11] Yes. [20:11] Oh, it's live too! [20:11] Awesome [20:12] That's great [20:12] Apparently you've been quite active. [20:12] Looks like it, heh [20:18] 28GB in 2 hours [20:18] Not shabby [20:49] Hah, you can see where I started the mobileme grab [20:49] http://tracker.archive.org/tracker.png [20:49] http://tracker.archive.org/batcave.png too, for those that are interested [22:10] GREETINGS FROM CALIFORNIA [22:18] hey SketchCow [22:18] where in california are you? [23:00] San Francisco. [23:14] underscor: http://memac.heroku.com/ [23:44] alard: hot damn [23:44] That's AWESOME