#archiveteam-bs 2017-06-04,Sun

↑back Search ←Prev date Next date→ (Showing only urls - See all)(Click on time to show url line in full context)

WhoWhatWhen
JRWRLOOK AT THESE GRAPHS http://jrwr.io:19999 [02:28]
MrRadarI found an article on SSD unpowered data retention: http://www.anandtech.com/show/9248/the-truth-about-ssd-data-retention
Includes this graph, showing how many weeks a drive is expected to retain data based on the temperature at which the data was written and the temperature the drive is stored: http://images.anandtech.com/doci/9248/3_575px.PNG
[03:02]
Looks like there's already a bug for the missing data issue: https://github.com/ArchiveTeam/seesaw-kit/issues/48 [03:52]
***SN4T14 has quit IRC (Quit: ZNC 1.6.3 - http://znc.in) [04:56]
timmcPurpleSym: There was this *very* interesting report joepie91 generated for (I think) just one URL, and by gosh it does look like garbled chunked transfer-encoding: http://sprunge.us/RjWi [14:18]
PurpleSym: I think https://archive.org/download/archiveteam_portalgraphics_20160727140857/portalgraphics_20160727140857.megawarc.warc.gz and https://web.archive.org/web/20160724001629/http://www.portalgraphics.net/pg/illust/?image_id=10575 [14:28]
PurpleSymNope, hex: https://tools.ietf.org/html/rfc2616#section-3.6.1 [14:35]
JAAIs this page broken for anyone else? https://archive.org/search.php?query=collection%3Aarchivebot&sort=-publicdate [20:30]
JRWRhttps://hastebin.com/ipupobevun.xml [20:33]
JAAhttps://archive.org/search.php works, but any actual search has the same issue. [20:44]
SanquiJAA: try the viewer then https://archive.fart.website/archivebot/viewer/ [22:21]
JAAI'm trying to look into the corruption issue that was discussed yesterday. Much of the discussion focused on wget-lua, but as DoomTay mentioned earlier in #archivebot, at least one ArchiveBot grab was also affected: https://web.archive.org/web/20160615222159/http://www.portalgraphics.net/pg/illust/?image_id=10575 [22:25]
Here's a list mapping the URLs SketchCow posted yesterday to the corresponding IA item: https://hastebin.com/raw/iruvobodor . I also included some additional examples I found. [23:24]
Here's the WARC records for https://web.archive.org/web/20160725184715/http://www.portalgraphics.net/pg/illust/?image_id=21107&lang=en : https://hastebin.com/raw/gokiwuzage [23:50]
I suspect that it's due to the space character after "5c". This space doesn't conform to the specs ( https://tools.ietf.org/html/rfc7230#section-4.1 ), which define a chunk as `chunk-size [ chunk-ext ] CRLF chunk-data CRLF`. chunk-size is the size of the chunk in hexadecimal digits (upper or lower case), chunk-ext is an optional extension of the algorithm and is `*( ";" chunk-ext-name [ "=" chunk-ext-v [23:59]

↑back Search ←Prev date Next date→ (Showing only urls - See all)(Click on time to show url line in full context)