#archiveteam-bs 2012-07-28,Sat

↑back Search

Time Nickname Message
00:01 🔗 Schbirid godane: --span-hosts -D hackadaycom.files.wordpress.com,hackaday.com
00:01 🔗 Schbirid they do that as load balancing
00:02 🔗 Schbirid serving static images from another domain = another server plus browsers can handle it better
00:03 🔗 godane i been downloading hackaday for 2 hours
00:03 🔗 godane so i'm not going to do that now
00:05 🔗 godane i'm getting the images thur a script i wrote
01:19 🔗 godane maybe Jason Scott should get this: http://www.ebay.com/itm/MTV-Presents-Spyder-Games-Soap-Opera-VHS-Video-Complete-Series-Sunset-Beach-OOP-/290705456005?pt=VHS&hash=item43af653b85
01:42 🔗 godane how do you stop wget-warc from downloading anything with "?replytocom=" and "?shared="
02:20 🔗 godane just got the youtube embed list
02:21 🔗 godane from hackaday.com
02:23 🔗 godane fucking A
02:24 🔗 godane got youtube list to not have ' in from http
02:24 🔗 godane *front
02:36 🔗 godane good news everyone
02:37 🔗 godane i can maybe download all youtube embeds videos off of hackaday.com
04:30 🔗 shaqfu Do the A/V Geeks have a shirt design up? I'd get the $50 if it's a nice shirt
08:58 🔗 Schbirid are SketchCow and underscor at HOPE or something?
08:58 🔗 godane i think SketchCow is at defcon
08:59 🔗 Schbirid or that
08:59 🔗 godane anyways, I changed my attack on archiving hackaday.com
09:00 🔗 godane i do hackaday.com/2004/12/ and just get all hackaday data from dec 2004
09:02 🔗 godane this way it shouldn't take forever
09:02 🔗 godane i downloads a lot of ?shared=service crap and i don't kow how to make that stop
09:04 🔗 Schbirid --reject-regex= maybe
09:05 🔗 Schbirid AT really could use a blog or something
09:05 🔗 Schbirid or tutorials, howtos, howwedid
09:07 🔗 godane there is --reject option
09:07 🔗 godane doesn't work for me
09:08 🔗 godane and when i reject does work it will still download it
09:08 🔗 godane it just will not store it
09:08 🔗 Schbirid yeah, that one is weird. i think the regexp one works with files like that
09:08 🔗 Schbirid oh :(
09:10 🔗 godane i get stuff like index.html?share=*.html crap
09:10 🔗 godane i hate it now
09:11 🔗 godane its meant to share stuff on facebook and digg and email
09:54 🔗 godane i still can't get reject to work that way i want it too
09:54 🔗 godane Schbirid: can you help me with this?
09:55 🔗 godane i want wget to only download index.html files
09:55 🔗 Schbirid i could try with no garantuee (how do you spell that...)
09:55 🔗 godane wget-warc "http://$1/2004/12/" --mirror --warc-file="$1-2004-12" --warc-cdx -np -A "index.html" -R "*\?" --html-extension --domains hackaday.com -p
09:55 🔗 godane thats my code for my script
09:56 🔗 Schbirid --reject-regex='/.*?' maybe?
09:57 🔗 godane example url will be stuff like this: http://hackaday.com/2004/12/31/treo-650-hack-how-tos/?shared=email&msg=fail
09:58 🔗 godane --reject-regex doesn't work
09:58 🔗 godane it doesn't even exist
09:58 🔗 Schbirid oh yuck, i just realised. i am using a very recent wget and thatoption is not even on the manpage :(
09:59 🔗 Schbirid that works well though
09:59 🔗 Schbirid wget -m -np --reject-regex='/.*' "http://hackaday.com/2004/12/31/treo-650-hack-how-tos/"
09:59 🔗 Schbirid only gets the root (index.html) for me
09:59 🔗 Schbirid GNU Wget 1.13.4-2608
09:59 🔗 godane oh
09:59 🔗 godane i'm using 2582
10:01 🔗 Schbirid hm, not sure how to make it greedy though. wget -m -np --reject-regex='/.*?' "http://hackaday.com/2004/12/31/" does not traverse into the "subdirs"
10:01 🔗 Schbirid lunch time
10:10 🔗 Schbirid maybe --reject-regex='/.*\?.*'
10:10 🔗 Schbirid as in reject anything with a ?
10:11 🔗 Schbirid oh, '\?' would work then :D
10:11 🔗 Schbirid 2012-07-28 12:10:02 URL:http://hackaday.com/2004/12/31/ [43557] -> "hackaday.com/2004/12/31/index.html" [1]
10:11 🔗 Schbirid 2012-07-28 12:10:03 URL:http://hackaday.com/2004/12/31/treo-650-hack-how-tos/ [74383] -> "hackaday.com/2004/12/31/treo-650-hack-how-tos/index.html" [1]
10:11 🔗 Schbirid 2012-07-28 12:10:04 URL:http://hackaday.com/2004/12/31/treo-650-hack-how-tos/feed/ [15930] -> "hackaday.com/2004/12/31/treo-650-hack-how-tos/feed/index.html" [1]
10:11 🔗 Schbirid 2012-07-28 12:10:05 URL:http://hackaday.com/2004/12/31/treo-650-hack-how-tos/ [74383] -> "hackaday.com/2004/12/31/treo-650-hack-how-tos/comment-page-1/index.html" [1]
10:12 🔗 Schbirid looks good
10:14 🔗 godane cool
10:14 🔗 godane thanks
10:16 🔗 Schbirid np
10:29 🔗 godane looks like wget-warc can't be complied without downloading something
10:29 🔗 godane :-D
10:29 🔗 godane :-(
10:29 🔗 godane the last one i meant
10:30 🔗 godane i'm been trying to make a source dvd that can recompile a os fully offline
11:31 🔗 godane Schbind: i'm starting to hate wget-warc
11:31 🔗 godane i can't compile it like the normal tarballs
11:33 🔗 Schbirid godane: i am using a bazaar checkout.wget has warc now
11:34 🔗 godane i hate it cause i want to compile it offline
11:35 🔗 godane no download crap in order to compile
11:35 🔗 godane i don't want gnulib being some depend for it
12:03 🔗 godane i got error even when doing it right
12:03 🔗 godane damn wget-warc
12:08 🔗 godane i can't build the newer wget-warc
12:09 🔗 godane Schbirld: can you help me?
12:10 🔗 godane there is no css.c file wget
12:10 🔗 godane *in wget source
12:10 🔗 godane thats my error
12:33 🔗 godane looks like i maybe able to just pack the old wget-warc with regex support
12:33 🔗 godane nothing got broken out side of changelogs
12:56 🔗 godane its working now
12:56 🔗 godane only download stuff thats need now
14:40 🔗 Schbirid no idea, sorry
19:09 🔗 Soojin http://vimeo.com/43530099
21:52 🔗 SmileyG it will sound bonkers to you winr4r as it sounds bonkers to me evne now
21:52 🔗 SmileyG but I overcame my depression by willing myself better :<
21:54 🔗 winr4r SmileyG: i solve it by being insanely productive
21:55 🔗 winr4r SmileyG: among many other things i'm doing today, i'm writing a piece explaining all the cultural and historical references in "alice's restaurant"
21:57 🔗 winr4r SmileyG: i think that a lot of kids in the west wouldn't get most of the jokes *today*, let alone someone from another culture who heard it
22:00 🔗 SmileyG I have no clue what you just said.
22:00 🔗 winr4r sorry to hear that
22:01 🔗 SmileyG What is Alice's Resturant?
22:01 🔗 SmileyG err spelling :D
22:02 🔗 winr4r SmileyG: http://www.youtube.com/watch?v=LjKF7aQthcQ
22:02 🔗 winr4r it is actually a 20-minute song btw
22:02 🔗 winr4r which is another reason it is so funny, most of arlo's songs were of normal length
22:03 🔗 SmileyG this reminds me of tribute
22:03 🔗 SmileyG This isn't the best song in the world, its merely a tribute...
22:07 🔗 winr4r good song too
22:07 🔗 * SmileyG still listening...
22:11 🔗 winr4r SmileyG: it's a later version of the song, played live, so there's quite a few self-references in there
22:14 🔗 * SmileyG doesn't understand
22:14 🔗 winr4r you'll see towards the end
22:20 🔗 SmileyG o_O
22:23 🔗 winr4r the thing about 18 minutes and 20 seconds, by the way, might justify having a whole new appendix for alternative endings
22:25 🔗 SmileyG hmmm
22:25 🔗 SmileyG Ok, I didn't understand any of that :/
22:26 🔗 winr4r SmileyG: didn't understand the accent, or understand the cultural references?
22:27 🔗 SmileyG I didn't even notice any cultural references
22:28 🔗 winr4r there are lots of them
22:28 🔗 winr4r 8x10 pictures?
22:28 🔗 winr4r the significance of that is quite important
22:28 🔗 SmileyG ...?
22:29 🔗 * SmileyG is at a loss.
22:29 🔗 winr4r 8x10 film cameras are what you shot back then when you wanted ultimate technical quality
22:29 🔗 SmileyG Right...
22:29 🔗 winr4r that would be like the police photographing littering with £20,000 hasselblads
22:30 🔗 SmileyG hmmm ok?
22:30 🔗 SmileyG XD
22:30 🔗 * SmileyG never understood people reading into songs. :/
22:30 🔗 SmileyG Paintings yes, I can get that
22:30 🔗 SmileyG Songs.... nope, they are "just songs" to me :S
22:30 🔗 SmileyG Then again some of the stuff people read into paintings I think is pure BS too :/
22:31 🔗 winr4r SmileyG: i'm not reading into it, it's how people would have understood the song 45 years ago
22:31 🔗 SmileyG artistic licence because it rhymes?
22:33 🔗 winr4r nothing in that song rhymes :P
22:33 🔗 SmileyG you know what I mean :P
22:33 🔗 * SmileyG just doesn't understand
22:34 🔗 SmileyG :S
22:34 🔗 winr4r "8x10 colour glossy photographs" isn't because it rhymes, it's because "8x10 colour glossy photographs" had significance back then
22:34 🔗 SmileyG I've tried to explain to people how i experience the world before and I fail to
22:34 🔗 winr4r 1) it was 8x10 which was overkill for photographing a crime scene
22:34 🔗 SmileyG or they fail to comprehend what I experience.
22:34 🔗 winr4r 2) it was in colour, at a time when colour processing was expensive
22:34 🔗 winr4r which the cops used to photograph littering in a very very small town
22:35 🔗 SmileyG So the song is deliberately absurd?
22:35 🔗 winr4r SmileyG: yes, but apparently a true story
22:36 🔗 winr4r SmileyG: you only get the full feel of the absurdity once you understand all the historical/cultural references
22:36 🔗 SmileyG winr4r: now I understand
22:36 🔗 SmileyG to a present day listener its just a song
22:37 🔗 SmileyG but when you know the cirucmstance you understand how absurd it is.
22:37 🔗 winr4r yup, which is the whole reason i am writing
22:45 🔗 winr4r hi mistym
22:45 🔗 mistym Hey winr4r
23:08 🔗 godane hey winr4r
23:08 🔗 godane i'm backing up hackaday.com
23:08 🔗 winr4r godane: yes, i saw that :)
23:08 🔗 godane i doing it by year
23:09 🔗 godane i have 2004 and 2005 backed up
23:09 🔗 godane also got data dumps of images
23:10 🔗 godane also uploading gbtv episode 2012-01-16
23:10 🔗 winr4r you the man :)
23:11 🔗 godane learned about grep -o option
23:12 🔗 godane makes alot easier to grep all www.weblogsinc.com/common/images/[0-9]*.JPG

irclogger-viewer