#archiveteam 2014-06-27,Fri

↑back Search

Time Nickname Message
00:25 🔗 db48x http://db48x.net/pixorial.urls.2014-06-26.sorted.bz2
06:58 🔗 Spring Are WARC archives meant to preserve Javascript? As increasingly pages on the web rely on JS for their content, and the pages using JS I've seen archived on IA are mostly broken.
07:00 🔗 Spring Here's for example a brilliant piece I just saw on HN which when saved on IA lacks the core JS animations that make up half the content: http://wayback.archive.org/web/20140627065355/http://bost.ocks.org/mike/algorithms/
07:15 🔗 db48x Spring: WARC files save the exact contents of any HTTP request the caller makes, and the exact response returned by the server
07:16 🔗 db48x if the crawler doesn't know to request a particular file, then the WARC won't have anything about it
07:16 🔗 db48x even if the crawler runs all of the javascript in the page, anything that requires user interaction to trigger an HTTP request will likely be missed
07:21 🔗 Spring Doesn't this become an issue for archivists wanting to preserve such pages?
07:23 🔗 Spring I mean obviously it would the ideal, but as more and more of the web uses JS for things I had imagined something would be created to handle archiving pages/sites more like the user would have viewed them.
07:36 🔗 db48x yes, it's a growing problem
07:37 🔗 db48x in many ways the ideal archival method would be to record the traffic of real users interacting with the site in an everyday manner
07:37 🔗 db48x it's not particularly fast or methodical though
07:41 🔗 db48x great page btw,
07:44 🔗 db48x I'm not quite sure what's causing the error, but none of the files are missing
19:42 🔗 NovaKing SketchCow: you around by any chance?
19:43 🔗 SketchCow ssfgsdgf
19:44 🔗 SketchCow E-mail me.
19:44 🔗 NovaKing email = ?
19:45 🔗 SketchCow jason@textfiles.com
19:51 🔗 NovaKing email sent.
20:10 🔗 midas wuuuu NovaKing here?
20:12 🔗 NovaKing ?
20:12 🔗 NovaKing i've been here for like.... too long
20:21 🔗 yipdw TOOOOOOO LONG
20:23 🔗 xmc oh my
20:23 🔗 xmc it's novaking
20:23 🔗 Smiley o/
20:24 🔗 NovaKing i mainly lurk now
20:24 🔗 NovaKing helped more back in the day (like a year ago)

irclogger-viewer