Time |
Nickname |
Message |
09:30
🔗
|
chronomex |
Coderjoe, lemonkey: spread the @ please |
09:38
🔗
|
SmileyG |
give love five lof |
12:07
🔗
|
SketchCow |
OPS please. |
12:21
🔗
|
BlueMaxim |
Hi SketchCow |
15:23
🔗
|
AndroUser |
Hi, I am here from a phone but I have a request for a panic download |
15:23
🔗
|
balrog_ |
AndroUser: yes, go ahead |
15:23
🔗
|
balrog_ |
also /nick username |
15:25
🔗
|
AndroUser |
I realize this is pretty geographically specific, but journalstar.com posted last night saying they are putting up a $10/mo paywall |
15:25
🔗
|
AndroUser |
http://journalstar.com/news/local/journalstar-com-building-for-the-future/article_f9a4c22f-9572-52f0-b5b6-ad673a34d7f4.html |
15:25
🔗
|
AndroUser |
Was supposed to take effect this morning but right now it is broken |
15:25
🔗
|
balrog_ |
huh, 10 view per month |
15:25
🔗
|
balrog_ |
I wonder how they detect that |
15:25
🔗
|
AndroUser |
And tens of thousands of news stories are in immediate jeopardy |
15:25
🔗
|
balrog_ |
they will delete them? |
15:25
🔗
|
balrog_ |
this will probably backfire on them though |
15:26
🔗
|
mistym |
"JournalStar.com building for the future" is a pretty crappy headline for this. |
15:28
🔗
|
AndroUser |
I wish I had the power to work on this but I have no internet access... it seems like they are in big trouble though. I imagine the whole site could be grabbed in an afternoon. Hopefully before they fix the paywall. No idea whether they plan on deleting historical content but with this I wouldn't put it past them. |
15:28
🔗
|
AndroUser |
Anyway I just wanted to relay this, I hope you guys can help |
15:28
🔗
|
AndroUser |
Thank you either way |
15:28
🔗
|
Schbirid |
is it http://journalstar.com/news/local/ ? |
15:28
🔗
|
Schbirid |
restricted to that? |
15:28
🔗
|
AndroUser |
Yed |
15:28
🔗
|
AndroUser |
Yes |
15:28
🔗
|
AndroUser |
Should be |
15:28
🔗
|
AndroUser |
There may also be pics under a different subdir |
15:28
🔗
|
AndroUser |
They have various photo galleries too |
15:29
🔗
|
AndroUser |
Comments sections are on most articles as well |
15:30
🔗
|
AndroUser |
Right now the paywall is JS and bypassable, they could fix it at any time |
15:30
🔗
|
AndroUser |
They have had a whole host of technical difficulties lately but say they are working to remedy them |
15:32
🔗
|
AndroUser |
Thanks guys, I hope you can help |
15:33
🔗
|
Schbirid |
i am throwing "wget -a journalstar.com_news_local_20120730.log -e robots=off -nv --adjust-extension --convert-links --page-requisites --span-hosts -D journalstar.com,townnews.com -m -np --user-agent="Googlebot hurr durr" --warc-file=journalstar.com_news_local_20120730 http://journalstar.com/news/local/" at it |
15:35
🔗
|
SmileyG |
:D |
15:35
🔗
|
SmileyG |
love the user agent ;D |
15:36
🔗
|
Schbirid |
doing the same for http://journalstar.com/sports/local/ |
15:39
🔗
|
Schbirid |
seems to run fairly well |
15:43
🔗
|
* |
SmileyG doesn't understand exactly whats going on. |
15:43
🔗
|
SmileyG |
does wget-warc take the same options as wget? |
15:44
🔗
|
SmileyG |
I'm just thinking of firing up and starting to test stuff; but I don't really know where to start other than copying someone else? |
15:46
🔗
|
mistym |
Yup, it takes the same options except, obviously, it supports --warc-file= |
15:46
🔗
|
Schbirid |
i use a uptodate wget |
15:46
🔗
|
Schbirid |
it supports warc |
15:46
🔗
|
mistym |
Hopefully an actual point release with warc will come by one of these days. |
15:48
🔗
|
* |
SmileyG ponder some of hte options and how you realise you need htem |
15:49
🔗
|
SmileyG |
Such as convert links, and others.. |
15:50
🔗
|
Schbirid |
crap, now i am downloading things like "http://local.journalstar.com/malone+manor+bus+office.9.105559913p.home.html" and "http://www2.journalstar.com/admarket/business_services/alterations_sewing/" |
15:50
🔗
|
SmileyG |
o_O |
15:52
🔗
|
Schbirid |
adding --exclude-domains=www2.journalstar.com,local.journalstar.com,my.journalstar.com,local.journalstar.com |
15:52
🔗
|
SmileyG |
wtf, I threw it at the forums and it downloaded just the theme o_O |
15:53
🔗
|
SmileyG |
Registered users: djsmiley2k, Google [Bot], shinymcshine <<< HAHAHAH |
15:54
🔗
|
SmileyG |
So, how do I make it actually follow the links :S |
15:55
🔗
|
SmileyG |
../mobileme-grab/wget-warc -v -a gamestm.com_30072012.log -e robots=off --adjust-extension --convert-links --page-requisites -nv --user-agent="Googlebot hurr durr" --warc-file=gamestm_30072012 http://www.gamestm.co.uk/forum |
15:55
🔗
|
Schbirid |
-m |
15:55
🔗
|
SmileyG |
whats -m do then? |
15:55
🔗
|
Schbirid |
mirror |
15:55
🔗
|
Schbirid |
-nv is non-verbose btw |
15:55
🔗
|
SmileyG |
:D |
15:55
🔗
|
SmileyG |
even without -nv and with -v its still silent :| |
15:56
🔗
|
SmileyG |
oh eek |
15:56
🔗
|
Schbirid |
because of -a |
15:56
🔗
|
SmileyG |
its straying outside of /forum/ |
15:56
🔗
|
Schbirid |
-np |
15:57
🔗
|
SmileyG |
so remove -a? add -np?! |
15:58
🔗
|
Schbirid |
man wget and see what those options are |
15:58
🔗
|
Schbirid |
sorry :P |
15:58
🔗
|
SmileyG |
yeah I went to do that and accidently closed my terminal XD |
15:59
🔗
|
SmileyG |
Oh you can't append to log file AND view it :/ |
15:59
🔗
|
Schbirid |
you could use tee |
15:59
🔗
|
Schbirid |
i always use another screen and tail -f the log |
15:59
🔗
|
SmileyG |
--adjust-extension isn't in the help file ;) |
15:59
🔗
|
SmileyG |
Schbirid: ctrl+Z; bg; tail |
16:00
🔗
|
SmileyG |
or convert-links :( |
16:02
🔗
|
* |
SmileyG figured them out he thinks |
16:03
🔗
|
SmileyG |
wtf |
16:03
🔗
|
SmileyG |
with -np on it STILL went to a different dir instead of /forum/ |
16:08
🔗
|
omf_ |
It has been confirmed. I am giving a talk in September about big data and arichve.org and AT are main topics |
16:08
🔗
|
omf_ |
getting the word out |
16:17
🔗
|
Schbirid |
should have excluded /news/local/records/ |
18:14
🔗
|
SketchCow |
Ops, please. |
18:14
🔗
|
SketchCow |
Everyone, I'm still doing stuff here in Vegas, but by Wednesday, I will be DESTROYING my backlog. |
18:19
🔗
|
SmileyG |
:) |
18:19
🔗
|
SmileyG |
I finally started doing something \o/ |
18:19
🔗
|
SmileyG |
figuring out when to do what is hard; I'm trying to do kernel testing for gentoo too |
18:35
🔗
|
DFJustin |
"stuff" being "cocaine" |
18:45
🔗
|
Schbirid |
SketchCow: any idea when underscor will be back? |
19:18
🔗
|
SketchCow |
This stuff is DELICIOUS |
19:31
🔗
|
chronomex |
balrog_: spread the @ |
19:33
🔗
|
SketchCow |
Thank youuuuuuu |
19:34
🔗
|
chronomex |
\o/ |
19:34
🔗
|
chronomex |
ORDER HAS BEEN RESTORED. |
19:34
🔗
|
balrog_ |
SketchCow: still busy as ****? |
19:36
🔗
|
goekesmi |
SketchCow: How many Tera of video did you collect at Defcon? |
19:38
🔗
|
sexfilmle |
http://www.sexfilmler.com free hardcore movies! |
19:38
🔗
|
chronomex |
grrrrr |
19:39
🔗
|
Schbirid |
that was the answer |
19:41
🔗
|
chronomex |
over 1, twitter reports |
19:41
🔗
|
Nemo_bis |
SketchCow: can you delete http://archive.org/details/Wiki-BibliotecaWikimedia ? it's a test item |
19:42
🔗
|
goekesmi |
I was wondering if a better tally showed up between closing ceremonies and now. |
19:47
🔗
|
SketchCow |
Nemo_bis: Darked out |
19:48
🔗
|
Nemo_bis |
thanks |
20:55
🔗
|
SmileyG |
can haz ops? |
23:54
🔗
|
godane |
how the hell did i get a torrent for one item: http://archive.org/details/GBTV_01_25_2012 |
23:54
🔗
|
godane |
i want them for all :-D |