#archiveteam 2014-08-22,Fri

↑back Search

Time Nickname Message
02:07 🔗 nitro2k01 Is there a script to mirror a site from Google's cache? I just noticed a site in my bookmarks disappeared but it's still cached.
03:31 🔗 godane nitro2k01: some thing like this: http://webcache.googleusercontent.com/search?q=cache:$url
03:31 🔗 godane $url being the url in the cache
15:42 🔗 qwebirc74 Hi guys
15:43 🔗 qwebirc74 I wanted to ask you about the yahoo voices content, it's still not available through the wayback machine, it's been almost 2 weeks now and I don't understand why it hasn't showed up yet.
15:43 🔗 qwebirc74 Just to explain better the issue, keeping in mind that Yahoo voices has more than 1.8 million articles, the links showing on this page http://web.archive.org/web/*/http://voices.yahoo.com/* are less than 300.000 links. Also, doing a search for voices.yahoo.com in the way back machine and trying to access it using any date since august 1st renders a 302 response.
15:43 🔗 qwebirc74 Does anyone have an idea why?
15:44 🔗 SketchCow I uploaded them all.
15:44 🔗 SketchCow Let's go see!
15:45 🔗 SketchCow https://archive.org/details/archiveteam_yahoovoices is the warc collection.
15:46 🔗 SketchCow All are mediatype web
15:48 🔗 qwebirc74 Yes but how do I access them? I was under the impression it's going to be available through a simple wayback machine search
15:50 🔗 SketchCow http://web.archive.org/web/*/http://voices.yahoo.com/ shows the explosion of scanning we did at the end of July.
15:51 🔗 SketchCow Do you have an article or URL you're searching for?
15:51 🔗 qwebirc74 For example, let's take this article: http://web.archive.org/web/20140707172250/http://voices.yahoo.com/the-passing-dith-pran-brings-subject-of-1343051.html
15:52 🔗 qwebirc74 Got an HTTP 302 response at crawl time
15:52 🔗 qwebirc74 almost all of those I checked return this message
15:52 🔗 SketchCow After it said 302 it redirected and I'm looking at it.
15:53 🔗 qwebirc74 Holy shit
15:54 🔗 qwebirc74 I for some reason just assumed they were unavailable
15:55 🔗 qwebirc74 Thank you
15:58 🔗 Nemo_bis The 302 redirection must be made faster for impatient visitors!
16:07 🔗 aaaaaaaaa there is literally a link called "impatient?" but I agree, most people don't care about the redirection, or don't know what that means, they just see a red error message.
16:28 🔗 SketchCow I'm emotionally over their concern.
16:37 🔗 xmc .... wat
16:38 🔗 xmc watever
21:23 🔗 ivan`- !status
21:23 🔗 ivan`- gah sorry

irclogger-viewer