[01:07] don't know if I'm persona non grata around here now, but just wanted to let y'all know I'm restarting my archivebot run-pipeline after b68ktbj36sum7rhkjrhc48aw5 finishes… [01:32] I must have missed drama. botheration. [02:12] looks like it was not a bad idea in grabbing the xml sitemap video files [02:12] wayback machine has none [02:12] for cbsnews.com [02:16] i'm grabbing the 1296.m4v version of cbsnews stuff [02:16] that will be the best version of the video [04:49] if anyone has the time, i'd appreciate some feedback on my wget-like python downloader: https://github.com/chfoo/wpull [04:59] SketchCow: can i get access to https://archive.org/details/wsjtechpm and https://archive.org/details/wsjtecham [05:03] also wsjtechpm is a item not a collection for some reason still [05:03] that needs to be fixed before access [05:04] also again both of those collections should have been one collection [05:04] not 2 [05:07] chfoo: I like how it uses sqlite out the gate [05:08] when Lua support is added, I'll definitely check that out as a wget replacement for archivebot [05:31] chfoo: <3 [05:41] ha, why would you need lua when you have python [05:43] :P [05:43] ivan`: because I don't want to rewrite archivebot.lua in Python [11:01] i got a 17 minute interview of Robert Iger from july 2009 [11:01] thanks to cnn money [12:22] hey, when did IA get book preview [12:29] i think this past month [12:29] i also noticed that the video image gif is now just 4 images [12:30] only a gif in search if its the file is the same as item id [13:17] so i have some more good news on my archiving of news videos [13:17] i can maybe grab world news tonight for the past 4 year [13:17] *years [13:47] also i may be able to get night line too [13:57] wth... I got a announcement email from IA on my old email adress, which I changed to an other one... [13:57] so damn strange [13:58] email stuff is a bit bugged maybe for me... [14:00] question: I believe I've changed my email when there was maintenance (server on read-only)... could that have fucked up something for me? [14:24] looks like my night line grab can go far back as 2008 [14:25] i really hope they can keep this stuff up when i get it uploaded [16:53] actually, archive-related question, does any of the common crawl stuff make it into the wayback machine [16:55] SketchCow ^ ? [19:34] All of it does.