Time |
Nickname |
Message |
00:01
🔗
|
Schbirid |
godane: --span-hosts -D hackadaycom.files.wordpress.com,hackaday.com |
00:01
🔗
|
Schbirid |
they do that as load balancing |
00:02
🔗
|
Schbirid |
serving static images from another domain = another server plus browsers can handle it better |
00:03
🔗
|
godane |
i been downloading hackaday for 2 hours |
00:03
🔗
|
godane |
so i'm not going to do that now |
00:05
🔗
|
godane |
i'm getting the images thur a script i wrote |
01:19
🔗
|
godane |
maybe Jason Scott should get this: http://www.ebay.com/itm/MTV-Presents-Spyder-Games-Soap-Opera-VHS-Video-Complete-Series-Sunset-Beach-OOP-/290705456005?pt=VHS&hash=item43af653b85 |
01:42
🔗
|
godane |
how do you stop wget-warc from downloading anything with "?replytocom=" and "?shared=" |
02:20
🔗
|
godane |
just got the youtube embed list |
02:21
🔗
|
godane |
from hackaday.com |
02:23
🔗
|
godane |
fucking A |
02:24
🔗
|
godane |
got youtube list to not have ' in from http |
02:24
🔗
|
godane |
*front |
02:36
🔗
|
godane |
good news everyone |
02:37
🔗
|
godane |
i can maybe download all youtube embeds videos off of hackaday.com |
04:30
🔗
|
shaqfu |
Do the A/V Geeks have a shirt design up? I'd get the $50 if it's a nice shirt |
08:58
🔗
|
Schbirid |
are SketchCow and underscor at HOPE or something? |
08:58
🔗
|
godane |
i think SketchCow is at defcon |
08:59
🔗
|
Schbirid |
or that |
08:59
🔗
|
godane |
anyways, I changed my attack on archiving hackaday.com |
09:00
🔗
|
godane |
i do hackaday.com/2004/12/ and just get all hackaday data from dec 2004 |
09:02
🔗
|
godane |
this way it shouldn't take forever |
09:02
🔗
|
godane |
i downloads a lot of ?shared=service crap and i don't kow how to make that stop |
09:04
🔗
|
Schbirid |
--reject-regex= maybe |
09:05
🔗
|
Schbirid |
AT really could use a blog or something |
09:05
🔗
|
Schbirid |
or tutorials, howtos, howwedid |
09:07
🔗
|
godane |
there is --reject option |
09:07
🔗
|
godane |
doesn't work for me |
09:08
🔗
|
godane |
and when i reject does work it will still download it |
09:08
🔗
|
godane |
it just will not store it |
09:08
🔗
|
Schbirid |
yeah, that one is weird. i think the regexp one works with files like that |
09:08
🔗
|
Schbirid |
oh :( |
09:10
🔗
|
godane |
i get stuff like index.html?share=*.html crap |
09:10
🔗
|
godane |
i hate it now |
09:11
🔗
|
godane |
its meant to share stuff on facebook and digg and email |
09:54
🔗
|
godane |
i still can't get reject to work that way i want it too |
09:54
🔗
|
godane |
Schbirid: can you help me with this? |
09:55
🔗
|
godane |
i want wget to only download index.html files |
09:55
🔗
|
Schbirid |
i could try with no garantuee (how do you spell that...) |
09:55
🔗
|
godane |
wget-warc "http://$1/2004/12/" --mirror --warc-file="$1-2004-12" --warc-cdx -np -A "index.html" -R "*\?" --html-extension --domains hackaday.com -p |
09:55
🔗
|
godane |
thats my code for my script |
09:56
🔗
|
Schbirid |
--reject-regex='/.*?' maybe? |
09:57
🔗
|
godane |
example url will be stuff like this: http://hackaday.com/2004/12/31/treo-650-hack-how-tos/?shared=email&msg=fail |
09:58
🔗
|
godane |
--reject-regex doesn't work |
09:58
🔗
|
godane |
it doesn't even exist |
09:58
🔗
|
Schbirid |
oh yuck, i just realised. i am using a very recent wget and thatoption is not even on the manpage :( |
09:59
🔗
|
Schbirid |
that works well though |
09:59
🔗
|
Schbirid |
wget -m -np --reject-regex='/.*' "http://hackaday.com/2004/12/31/treo-650-hack-how-tos/" |
09:59
🔗
|
Schbirid |
only gets the root (index.html) for me |
09:59
🔗
|
Schbirid |
GNU Wget 1.13.4-2608 |
09:59
🔗
|
godane |
oh |
09:59
🔗
|
godane |
i'm using 2582 |
10:01
🔗
|
Schbirid |
hm, not sure how to make it greedy though. wget -m -np --reject-regex='/.*?' "http://hackaday.com/2004/12/31/" does not traverse into the "subdirs" |
10:01
🔗
|
Schbirid |
lunch time |
10:10
🔗
|
Schbirid |
maybe --reject-regex='/.*\?.*' |
10:10
🔗
|
Schbirid |
as in reject anything with a ? |
10:11
🔗
|
Schbirid |
oh, '\?' would work then :D |
10:11
🔗
|
Schbirid |
2012-07-28 12:10:02 URL:http://hackaday.com/2004/12/31/ [43557] -> "hackaday.com/2004/12/31/index.html" [1] |
10:11
🔗
|
Schbirid |
2012-07-28 12:10:03 URL:http://hackaday.com/2004/12/31/treo-650-hack-how-tos/ [74383] -> "hackaday.com/2004/12/31/treo-650-hack-how-tos/index.html" [1] |
10:11
🔗
|
Schbirid |
2012-07-28 12:10:04 URL:http://hackaday.com/2004/12/31/treo-650-hack-how-tos/feed/ [15930] -> "hackaday.com/2004/12/31/treo-650-hack-how-tos/feed/index.html" [1] |
10:11
🔗
|
Schbirid |
2012-07-28 12:10:05 URL:http://hackaday.com/2004/12/31/treo-650-hack-how-tos/ [74383] -> "hackaday.com/2004/12/31/treo-650-hack-how-tos/comment-page-1/index.html" [1] |
10:12
🔗
|
Schbirid |
looks good |
10:14
🔗
|
godane |
cool |
10:14
🔗
|
godane |
thanks |
10:16
🔗
|
Schbirid |
np |
10:29
🔗
|
godane |
looks like wget-warc can't be complied without downloading something |
10:29
🔗
|
godane |
:-D |
10:29
🔗
|
godane |
:-( |
10:29
🔗
|
godane |
the last one i meant |
10:30
🔗
|
godane |
i'm been trying to make a source dvd that can recompile a os fully offline |
11:31
🔗
|
godane |
Schbind: i'm starting to hate wget-warc |
11:31
🔗
|
godane |
i can't compile it like the normal tarballs |
11:33
🔗
|
Schbirid |
godane: i am using a bazaar checkout.wget has warc now |
11:34
🔗
|
godane |
i hate it cause i want to compile it offline |
11:35
🔗
|
godane |
no download crap in order to compile |
11:35
🔗
|
godane |
i don't want gnulib being some depend for it |
12:03
🔗
|
godane |
i got error even when doing it right |
12:03
🔗
|
godane |
damn wget-warc |
12:08
🔗
|
godane |
i can't build the newer wget-warc |
12:09
🔗
|
godane |
Schbirld: can you help me? |
12:10
🔗
|
godane |
there is no css.c file wget |
12:10
🔗
|
godane |
*in wget source |
12:10
🔗
|
godane |
thats my error |
12:33
🔗
|
godane |
looks like i maybe able to just pack the old wget-warc with regex support |
12:33
🔗
|
godane |
nothing got broken out side of changelogs |
12:56
🔗
|
godane |
its working now |
12:56
🔗
|
godane |
only download stuff thats need now |
14:40
🔗
|
Schbirid |
no idea, sorry |
19:09
🔗
|
Soojin |
http://vimeo.com/43530099 |
21:52
🔗
|
SmileyG |
it will sound bonkers to you winr4r as it sounds bonkers to me evne now |
21:52
🔗
|
SmileyG |
but I overcame my depression by willing myself better :< |
21:54
🔗
|
winr4r |
SmileyG: i solve it by being insanely productive |
21:55
🔗
|
winr4r |
SmileyG: among many other things i'm doing today, i'm writing a piece explaining all the cultural and historical references in "alice's restaurant" |
21:57
🔗
|
winr4r |
SmileyG: i think that a lot of kids in the west wouldn't get most of the jokes *today*, let alone someone from another culture who heard it |
22:00
🔗
|
SmileyG |
I have no clue what you just said. |
22:00
🔗
|
winr4r |
sorry to hear that |
22:01
🔗
|
SmileyG |
What is Alice's Resturant? |
22:01
🔗
|
SmileyG |
err spelling :D |
22:02
🔗
|
winr4r |
SmileyG: http://www.youtube.com/watch?v=LjKF7aQthcQ |
22:02
🔗
|
winr4r |
it is actually a 20-minute song btw |
22:02
🔗
|
winr4r |
which is another reason it is so funny, most of arlo's songs were of normal length |
22:03
🔗
|
SmileyG |
this reminds me of tribute |
22:03
🔗
|
SmileyG |
This isn't the best song in the world, its merely a tribute... |
22:07
🔗
|
winr4r |
good song too |
22:07
🔗
|
* |
SmileyG still listening... |
22:11
🔗
|
winr4r |
SmileyG: it's a later version of the song, played live, so there's quite a few self-references in there |
22:14
🔗
|
* |
SmileyG doesn't understand |
22:14
🔗
|
winr4r |
you'll see towards the end |
22:20
🔗
|
SmileyG |
o_O |
22:23
🔗
|
winr4r |
the thing about 18 minutes and 20 seconds, by the way, might justify having a whole new appendix for alternative endings |
22:25
🔗
|
SmileyG |
hmmm |
22:25
🔗
|
SmileyG |
Ok, I didn't understand any of that :/ |
22:26
🔗
|
winr4r |
SmileyG: didn't understand the accent, or understand the cultural references? |
22:27
🔗
|
SmileyG |
I didn't even notice any cultural references |
22:28
🔗
|
winr4r |
there are lots of them |
22:28
🔗
|
winr4r |
8x10 pictures? |
22:28
🔗
|
winr4r |
the significance of that is quite important |
22:28
🔗
|
SmileyG |
...? |
22:29
🔗
|
* |
SmileyG is at a loss. |
22:29
🔗
|
winr4r |
8x10 film cameras are what you shot back then when you wanted ultimate technical quality |
22:29
🔗
|
SmileyG |
Right... |
22:29
🔗
|
winr4r |
that would be like the police photographing littering with £20,000 hasselblads |
22:30
🔗
|
SmileyG |
hmmm ok? |
22:30
🔗
|
SmileyG |
XD |
22:30
🔗
|
* |
SmileyG never understood people reading into songs. :/ |
22:30
🔗
|
SmileyG |
Paintings yes, I can get that |
22:30
🔗
|
SmileyG |
Songs.... nope, they are "just songs" to me :S |
22:30
🔗
|
SmileyG |
Then again some of the stuff people read into paintings I think is pure BS too :/ |
22:31
🔗
|
winr4r |
SmileyG: i'm not reading into it, it's how people would have understood the song 45 years ago |
22:31
🔗
|
SmileyG |
artistic licence because it rhymes? |
22:33
🔗
|
winr4r |
nothing in that song rhymes :P |
22:33
🔗
|
SmileyG |
you know what I mean :P |
22:33
🔗
|
* |
SmileyG just doesn't understand |
22:34
🔗
|
SmileyG |
:S |
22:34
🔗
|
winr4r |
"8x10 colour glossy photographs" isn't because it rhymes, it's because "8x10 colour glossy photographs" had significance back then |
22:34
🔗
|
SmileyG |
I've tried to explain to people how i experience the world before and I fail to |
22:34
🔗
|
winr4r |
1) it was 8x10 which was overkill for photographing a crime scene |
22:34
🔗
|
SmileyG |
or they fail to comprehend what I experience. |
22:34
🔗
|
winr4r |
2) it was in colour, at a time when colour processing was expensive |
22:34
🔗
|
winr4r |
which the cops used to photograph littering in a very very small town |
22:35
🔗
|
SmileyG |
So the song is deliberately absurd? |
22:35
🔗
|
winr4r |
SmileyG: yes, but apparently a true story |
22:36
🔗
|
winr4r |
SmileyG: you only get the full feel of the absurdity once you understand all the historical/cultural references |
22:36
🔗
|
SmileyG |
winr4r: now I understand |
22:36
🔗
|
SmileyG |
to a present day listener its just a song |
22:37
🔗
|
SmileyG |
but when you know the cirucmstance you understand how absurd it is. |
22:37
🔗
|
winr4r |
yup, which is the whole reason i am writing |
22:45
🔗
|
winr4r |
hi mistym |
22:45
🔗
|
mistym |
Hey winr4r |
23:08
🔗
|
godane |
hey winr4r |
23:08
🔗
|
godane |
i'm backing up hackaday.com |
23:08
🔗
|
winr4r |
godane: yes, i saw that :) |
23:08
🔗
|
godane |
i doing it by year |
23:09
🔗
|
godane |
i have 2004 and 2005 backed up |
23:09
🔗
|
godane |
also got data dumps of images |
23:10
🔗
|
godane |
also uploading gbtv episode 2012-01-16 |
23:10
🔗
|
winr4r |
you the man :) |
23:11
🔗
|
godane |
learned about grep -o option |
23:12
🔗
|
godane |
makes alot easier to grep all www.weblogsinc.com/common/images/[0-9]*.JPG |