#archiveteam 2012-07-30,Mon

↑back Search

Time Nickname Message
09:30 🔗 chronomex Coderjoe, lemonkey: spread the @ please
09:38 🔗 SmileyG give love five lof
12:07 🔗 SketchCow OPS please.
12:21 🔗 BlueMaxim Hi SketchCow
15:23 🔗 AndroUser Hi, I am here from a phone but I have a request for a panic download
15:23 🔗 balrog_ AndroUser: yes, go ahead
15:23 🔗 balrog_ also /nick username
15:25 🔗 AndroUser I realize this is pretty geographically specific, but journalstar.com posted last night saying they are putting up a $10/mo paywall
15:25 🔗 AndroUser http://journalstar.com/news/local/journalstar-com-building-for-the-future/article_f9a4c22f-9572-52f0-b5b6-ad673a34d7f4.html
15:25 🔗 AndroUser Was supposed to take effect this morning but right now it is broken
15:25 🔗 balrog_ huh, 10 view per month
15:25 🔗 balrog_ I wonder how they detect that
15:25 🔗 AndroUser And tens of thousands of news stories are in immediate jeopardy
15:25 🔗 balrog_ they will delete them?
15:25 🔗 balrog_ this will probably backfire on them though
15:26 🔗 mistym "JournalStar.com building for the future" is a pretty crappy headline for this.
15:28 🔗 AndroUser I wish I had the power to work on this but I have no internet access... it seems like they are in big trouble though. I imagine the whole site could be grabbed in an afternoon. Hopefully before they fix the paywall. No idea whether they plan on deleting historical content but with this I wouldn't put it past them.
15:28 🔗 AndroUser Anyway I just wanted to relay this, I hope you guys can help
15:28 🔗 AndroUser Thank you either way
15:28 🔗 Schbirid is it http://journalstar.com/news/local/ ?
15:28 🔗 Schbirid restricted to that?
15:28 🔗 AndroUser Yed
15:28 🔗 AndroUser Yes
15:28 🔗 AndroUser Should be
15:28 🔗 AndroUser There may also be pics under a different subdir
15:28 🔗 AndroUser They have various photo galleries too
15:29 🔗 AndroUser Comments sections are on most articles as well
15:30 🔗 AndroUser Right now the paywall is JS and bypassable, they could fix it at any time
15:30 🔗 AndroUser They have had a whole host of technical difficulties lately but say they are working to remedy them
15:32 🔗 AndroUser Thanks guys, I hope you can help
15:33 🔗 Schbirid i am throwing "wget -a journalstar.com_news_local_20120730.log -e robots=off -nv --adjust-extension --convert-links --page-requisites --span-hosts -D journalstar.com,townnews.com -m -np --user-agent="Googlebot hurr durr" --warc-file=journalstar.com_news_local_20120730 http://journalstar.com/news/local/" at it
15:35 🔗 SmileyG :D
15:35 🔗 SmileyG love the user agent ;D
15:36 🔗 Schbirid doing the same for http://journalstar.com/sports/local/
15:39 🔗 Schbirid seems to run fairly well
15:43 🔗 * SmileyG doesn't understand exactly whats going on.
15:43 🔗 SmileyG does wget-warc take the same options as wget?
15:44 🔗 SmileyG I'm just thinking of firing up and starting to test stuff; but I don't really know where to start other than copying someone else?
15:46 🔗 mistym Yup, it takes the same options except, obviously, it supports --warc-file=
15:46 🔗 Schbirid i use a uptodate wget
15:46 🔗 Schbirid it supports warc
15:46 🔗 mistym Hopefully an actual point release with warc will come by one of these days.
15:48 🔗 * SmileyG ponder some of hte options and how you realise you need htem
15:49 🔗 SmileyG Such as convert links, and others..
15:50 🔗 Schbirid crap, now i am downloading things like "http://local.journalstar.com/malone+manor+bus+office.9.105559913p.home.html" and "http://www2.journalstar.com/admarket/business_services/alterations_sewing/"
15:50 🔗 SmileyG o_O
15:52 🔗 Schbirid adding --exclude-domains=www2.journalstar.com,local.journalstar.com,my.journalstar.com,local.journalstar.com
15:52 🔗 SmileyG wtf, I threw it at the forums and it downloaded just the theme o_O
15:53 🔗 SmileyG Registered users: djsmiley2k, Google [Bot], shinymcshine <<< HAHAHAH
15:54 🔗 SmileyG So, how do I make it actually follow the links :S
15:55 🔗 SmileyG ../mobileme-grab/wget-warc -v -a gamestm.com_30072012.log -e robots=off --adjust-extension --convert-links --page-requisites -nv --user-agent="Googlebot hurr durr" --warc-file=gamestm_30072012 http://www.gamestm.co.uk/forum
15:55 🔗 Schbirid -m
15:55 🔗 SmileyG whats -m do then?
15:55 🔗 Schbirid mirror
15:55 🔗 Schbirid -nv is non-verbose btw
15:55 🔗 SmileyG :D
15:55 🔗 SmileyG even without -nv and with -v its still silent :|
15:56 🔗 SmileyG oh eek
15:56 🔗 Schbirid because of -a
15:56 🔗 SmileyG its straying outside of /forum/
15:56 🔗 Schbirid -np
15:57 🔗 SmileyG so remove -a? add -np?!
15:58 🔗 Schbirid man wget and see what those options are
15:58 🔗 Schbirid sorry :P
15:58 🔗 SmileyG yeah I went to do that and accidently closed my terminal XD
15:59 🔗 SmileyG Oh you can't append to log file AND view it :/
15:59 🔗 Schbirid you could use tee
15:59 🔗 Schbirid i always use another screen and tail -f the log
15:59 🔗 SmileyG --adjust-extension isn't in the help file ;)
15:59 🔗 SmileyG Schbirid: ctrl+Z; bg; tail
16:00 🔗 SmileyG or convert-links :(
16:02 🔗 * SmileyG figured them out he thinks
16:03 🔗 SmileyG wtf
16:03 🔗 SmileyG with -np on it STILL went to a different dir instead of /forum/
16:08 🔗 omf_ It has been confirmed. I am giving a talk in September about big data and arichve.org and AT are main topics
16:08 🔗 omf_ getting the word out
16:17 🔗 Schbirid should have excluded /news/local/records/
18:14 🔗 SketchCow Ops, please.
18:14 🔗 SketchCow Everyone, I'm still doing stuff here in Vegas, but by Wednesday, I will be DESTROYING my backlog.
18:19 🔗 SmileyG :)
18:19 🔗 SmileyG I finally started doing something \o/
18:19 🔗 SmileyG figuring out when to do what is hard; I'm trying to do kernel testing for gentoo too
18:35 🔗 DFJustin "stuff" being "cocaine"
18:45 🔗 Schbirid SketchCow: any idea when underscor will be back?
19:18 🔗 SketchCow This stuff is DELICIOUS
19:31 🔗 chronomex balrog_: spread the @
19:33 🔗 SketchCow Thank youuuuuuu
19:34 🔗 chronomex \o/
19:34 🔗 chronomex ORDER HAS BEEN RESTORED.
19:34 🔗 balrog_ SketchCow: still busy as ****?
19:36 🔗 goekesmi SketchCow: How many Tera of video did you collect at Defcon?
19:38 🔗 sexfilmle http://www.sexfilmler.com free hardcore movies!
19:38 🔗 chronomex grrrrr
19:39 🔗 Schbirid that was the answer
19:41 🔗 chronomex over 1, twitter reports
19:41 🔗 Nemo_bis SketchCow: can you delete http://archive.org/details/Wiki-BibliotecaWikimedia ? it's a test item
19:42 🔗 goekesmi I was wondering if a better tally showed up between closing ceremonies and now.
19:47 🔗 SketchCow Nemo_bis: Darked out
19:48 🔗 Nemo_bis thanks
20:55 🔗 SmileyG can haz ops?
23:54 🔗 godane how the hell did i get a torrent for one item: http://archive.org/details/GBTV_01_25_2012
23:54 🔗 godane i want them for all :-D

irclogger-viewer