[09:30] Coderjoe, lemonkey: spread the @ please [09:38] give love five lof [12:07] OPS please. [12:21] Hi SketchCow [15:23] Hi, I am here from a phone but I have a request for a panic download [15:23] AndroUser: yes, go ahead [15:23] also /nick username [15:25] I realize this is pretty geographically specific, but journalstar.com posted last night saying they are putting up a $10/mo paywall [15:25] http://journalstar.com/news/local/journalstar-com-building-for-the-future/article_f9a4c22f-9572-52f0-b5b6-ad673a34d7f4.html [15:25] Was supposed to take effect this morning but right now it is broken [15:25] huh, 10 view per month [15:25] I wonder how they detect that [15:25] And tens of thousands of news stories are in immediate jeopardy [15:25] they will delete them? [15:25] this will probably backfire on them though [15:26] "JournalStar.com building for the future" is a pretty crappy headline for this. [15:28] I wish I had the power to work on this but I have no internet access... it seems like they are in big trouble though. I imagine the whole site could be grabbed in an afternoon. Hopefully before they fix the paywall. No idea whether they plan on deleting historical content but with this I wouldn't put it past them. [15:28] Anyway I just wanted to relay this, I hope you guys can help [15:28] Thank you either way [15:28] is it http://journalstar.com/news/local/ ? [15:28] restricted to that? [15:28] Yed [15:28] Yes [15:28] Should be [15:28] There may also be pics under a different subdir [15:28] They have various photo galleries too [15:29] Comments sections are on most articles as well [15:30] Right now the paywall is JS and bypassable, they could fix it at any time [15:30] They have had a whole host of technical difficulties lately but say they are working to remedy them [15:32] Thanks guys, I hope you can help [15:33] i am throwing "wget -a journalstar.com_news_local_20120730.log -e robots=off -nv --adjust-extension --convert-links --page-requisites --span-hosts -D journalstar.com,townnews.com -m -np --user-agent="Googlebot hurr durr" --warc-file=journalstar.com_news_local_20120730 http://journalstar.com/news/local/" at it [15:35] :D [15:35] love the user agent ;D [15:36] doing the same for http://journalstar.com/sports/local/ [15:39] seems to run fairly well [15:43] * SmileyG doesn't understand exactly whats going on. [15:43] does wget-warc take the same options as wget? [15:44] I'm just thinking of firing up and starting to test stuff; but I don't really know where to start other than copying someone else? [15:46] Yup, it takes the same options except, obviously, it supports --warc-file= [15:46] i use a uptodate wget [15:46] it supports warc [15:46] Hopefully an actual point release with warc will come by one of these days. [15:48] * SmileyG ponder some of hte options and how you realise you need htem [15:49] Such as convert links, and others.. [15:50] crap, now i am downloading things like "http://local.journalstar.com/malone+manor+bus+office.9.105559913p.home.html" and "http://www2.journalstar.com/admarket/business_services/alterations_sewing/" [15:50] o_O [15:52] adding --exclude-domains=www2.journalstar.com,local.journalstar.com,my.journalstar.com,local.journalstar.com [15:52] wtf, I threw it at the forums and it downloaded just the theme o_O [15:53] Registered users: djsmiley2k, Google [Bot], shinymcshine <<< HAHAHAH [15:54] So, how do I make it actually follow the links :S [15:55] ../mobileme-grab/wget-warc -v -a gamestm.com_30072012.log -e robots=off --adjust-extension --convert-links --page-requisites -nv --user-agent="Googlebot hurr durr" --warc-file=gamestm_30072012 http://www.gamestm.co.uk/forum [15:55] -m [15:55] whats -m do then? [15:55] mirror [15:55] -nv is non-verbose btw [15:55] :D [15:55] even without -nv and with -v its still silent :| [15:56] oh eek [15:56] because of -a [15:56] its straying outside of /forum/ [15:56] -np [15:57] so remove -a? add -np?! [15:58] man wget and see what those options are [15:58] sorry :P [15:58] yeah I went to do that and accidently closed my terminal XD [15:59] Oh you can't append to log file AND view it :/ [15:59] you could use tee [15:59] i always use another screen and tail -f the log [15:59] --adjust-extension isn't in the help file ;) [15:59] Schbirid: ctrl+Z; bg; tail [16:00] or convert-links :( [16:02] * SmileyG figured them out he thinks [16:03] wtf [16:03] with -np on it STILL went to a different dir instead of /forum/ [16:08] It has been confirmed. I am giving a talk in September about big data and arichve.org and AT are main topics [16:08] getting the word out [16:17] should have excluded /news/local/records/ [18:14] Ops, please. [18:14] Everyone, I'm still doing stuff here in Vegas, but by Wednesday, I will be DESTROYING my backlog. [18:19] :) [18:19] I finally started doing something \o/ [18:19] figuring out when to do what is hard; I'm trying to do kernel testing for gentoo too [18:35] "stuff" being "cocaine" [18:45] SketchCow: any idea when underscor will be back? [19:18] This stuff is DELICIOUS [19:31] balrog_: spread the @ [19:33] Thank youuuuuuu [19:34] \o/ [19:34] ORDER HAS BEEN RESTORED. [19:34] SketchCow: still busy as ****? [19:36] SketchCow: How many Tera of video did you collect at Defcon? [19:38] http://www.sexfilmler.com free hardcore movies! [19:38] grrrrr [19:39] that was the answer [19:41] over 1, twitter reports [19:41] SketchCow: can you delete http://archive.org/details/Wiki-BibliotecaWikimedia ? it's a test item [19:42] I was wondering if a better tally showed up between closing ceremonies and now. [19:47] Nemo_bis: Darked out [19:48] thanks [20:55] can haz ops? [23:54] how the hell did i get a torrent for one item: http://archive.org/details/GBTV_01_25_2012 [23:54] i want them for all :-D