[00:08] *** yuitimoth has quit IRC (hub.efnet.us irc.efnet.nl) [00:08] *** Fusl has quit IRC (hub.efnet.us irc.efnet.nl) [00:08] *** jodizzle has quit IRC (hub.efnet.us irc.efnet.nl) [00:09] *** HUBI_ has quit IRC (Read error: Operation timed out) [00:11] *** HUBI_ has joined #internetarchive [00:13] *** yuitimoth has joined #internetarchive [00:13] *** Fusl has joined #internetarchive [00:13] *** jodizzle has joined #internetarchive [00:13] *** irc.efnet.nl sets mode: +o Fusl [00:14] *** Fusl_ sets mode: +o Fusl [00:17] *** Fusl sets mode: +o kiska1 [00:17] *** Fusl sets mode: +o kiskabak [00:17] *** Fusl sets mode: +o kiska [00:17] *** Fusl sets mode: +o JAA [00:18] *** Fusl sets mode: +o svchfoo3 [00:18] *** Fusl sets mode: +o jrwr [00:18] *** Fusl sets mode: +o dxrt [00:18] *** Fusl sets mode: +o svchfoo1 [00:18] *** Fusl sets mode: +o ivan_ [00:18] *** Fusl sets mode: +o dxrt_ [00:18] *** Fusl sets mode: +o AlsoJAA [00:18] *** Fusl sets mode: +o arkiver [00:18] *** Fusl sets mode: +o SketchCow [00:18] *** Fusl sets mode: +o Fusl_ [00:18] *** Fusl sets mode: +o hook54321 [00:18] *** Fusl sets mode: +o Kaz [00:18] *** Fusl sets mode: +o HCross [02:18] *** qw3rty111 has joined #internetarchive [02:24] *** qw3rty119 has quit IRC (Read error: Operation timed out) [02:59] *** Flashfire has quit IRC (Remote host closed the connection) [02:59] *** kiska has quit IRC (Remote host closed the connection) [03:00] *** Flashfire has joined #internetarchive [03:00] *** kiska has joined #internetarchive [03:00] *** Fusl sets mode: +o kiska [03:00] *** Fusl_ sets mode: +o kiska [03:29] *** qw3rty112 has joined #internetarchive [03:35] *** qw3rty111 has quit IRC (Read error: Operation timed out) [03:51] *** odemg has quit IRC (Read error: Operation timed out) [04:06] *** odemg has joined #internetarchive [09:04] *** Cameron_D has quit IRC (Ping timeout: 1212 seconds) [10:03] *** Cameron_D has joined #internetarchive [13:56] The Internet Archive clearly doesn't like wget 1.20's WARC files, which added the angle brackets to WARC-Target-URI: http://web.archive.org/web/20190626051544/http://%3Chttps//nratv2api.nra.tv/series/bill-whittles-hot-mic/episode-data/1-1-2013%3E [13:56] Coming from https://archive.org/details/nratv2api.nra.tv-episode-date-2018-2019-json-20190625 [14:37] are angle brackets in-spec? [14:44] "It's complicated." [14:45] https://github.com/webrecorder/pywb/issues/294 [14:47] The spec technically says there have to be angle brackets, but not a single implementation ever did that. A wget dev noticed this and, instead of raising a discussion about this and getting the standard amended, proceeded to add the brackets, breaking every single WARC tool. [14:48] At least the records are accessible in the WBM. But still... [14:50] the specs were wrong [14:50] In recent deduplication code I´ve added some stuff to remove the brackets [14:50] but yeah [14:50] there´s been a message about this error in the WARC specs I believe [14:51] Big challenge now is to make wget adopt the new standard and fix this error [14:53] Yes, it got fixed in WARC/1.1. [14:54] Worth mentioning that all examples in the 1.0 spec had no angle brackets, but the BNF grammar had them. [14:55] ABNF* [14:57] yeah [14:58] I think this is the only backwards incompatibility between the two versions, so moving wget to WARC/1.1 should really just be removing those angle brackets and bumping the spec version. [17:36] *** Dj-Wawa has quit IRC (Quit: Connection closed for inactivity) [17:41] *** systwi_ is now known as systwi [18:18] *** Dj-Wawa has joined #internetarchive [19:05] *** deevious has quit IRC (Ping timeout: 252 seconds) [19:19] *** Stiletto has quit IRC (Read error: Connection reset by peer) [19:28] *** Stiletto has joined #internetarchive [23:57] *** Flashfire has quit IRC (Remote host closed the connection) [23:57] *** kiska has quit IRC (Remote host closed the connection) [23:58] *** Flashfire has joined #internetarchive [23:58] *** kiska has joined #internetarchive [23:58] *** Fusl sets mode: +o kiska [23:58] *** Fusl_ sets mode: +o kiska