[00:49] db48x: what are you NSAing? :D [00:57] gopher://port70.net/1chan [00:59] nice [17:05] http://www.legacy.com/obituaries/tricities/obituary.aspx?n=silas-clark&pid=163255486&fhid=10509 [17:06] * closure sighs. I forgot to pdf that obit. [17:06] anyone know of any work to deal with legacy.com's horrible retention? [17:06] (their search doesn't find it either of course..) [18:05] just reading SketchCow twitter account [18:05] looks like there are 1400 bbs still running [21:26] http://www.csmonitor.com/World/Europe/2014/0717/Web-evidence-points-to-pro-Russia-rebels-in-downing-of-MH17 [21:33] I would only be capturing things in this 48 hour period, but not assuming any of it is true [21:40] this is the sort of stuff (well the primary sources at least) where we need to be concerned about robots.txt takedowns [22:37] maybe we should create a whitelist of urls [22:37] just a thought [22:42] archivebot all the URLs