#internetarchive 2019-08-07,Wed

↑back Search

Time Nickname Message
00:08 πŸ”— yuitimoth has quit IRC (hub.efnet.us irc.efnet.nl)
00:08 πŸ”— Fusl has quit IRC (hub.efnet.us irc.efnet.nl)
00:08 πŸ”— jodizzle has quit IRC (hub.efnet.us irc.efnet.nl)
00:09 πŸ”— HUBI_ has quit IRC (Read error: Operation timed out)
00:11 πŸ”— HUBI_ has joined #internetarchive
00:13 πŸ”— yuitimoth has joined #internetarchive
00:13 πŸ”— Fusl has joined #internetarchive
00:13 πŸ”— jodizzle has joined #internetarchive
00:13 πŸ”— irc.efnet.nl sets mode: +o Fusl
00:14 πŸ”— Fusl_ sets mode: +o Fusl
00:17 πŸ”— Fusl sets mode: +o kiska1
00:17 πŸ”— Fusl sets mode: +o kiskabak
00:17 πŸ”— Fusl sets mode: +o kiska
00:17 πŸ”— Fusl sets mode: +o JAA
00:18 πŸ”— Fusl sets mode: +o svchfoo3
00:18 πŸ”— Fusl sets mode: +o jrwr
00:18 πŸ”— Fusl sets mode: +o dxrt
00:18 πŸ”— Fusl sets mode: +o svchfoo1
00:18 πŸ”— Fusl sets mode: +o ivan_
00:18 πŸ”— Fusl sets mode: +o dxrt_
00:18 πŸ”— Fusl sets mode: +o AlsoJAA
00:18 πŸ”— Fusl sets mode: +o arkiver
00:18 πŸ”— Fusl sets mode: +o SketchCow
00:18 πŸ”— Fusl sets mode: +o Fusl_
00:18 πŸ”— Fusl sets mode: +o hook54321
00:18 πŸ”— Fusl sets mode: +o Kaz
00:18 πŸ”— Fusl sets mode: +o HCross
02:18 πŸ”— qw3rty111 has joined #internetarchive
02:24 πŸ”— qw3rty119 has quit IRC (Read error: Operation timed out)
02:59 πŸ”— Flashfire has quit IRC (Remote host closed the connection)
02:59 πŸ”— kiska has quit IRC (Remote host closed the connection)
03:00 πŸ”— Flashfire has joined #internetarchive
03:00 πŸ”— kiska has joined #internetarchive
03:00 πŸ”— Fusl sets mode: +o kiska
03:00 πŸ”— Fusl_ sets mode: +o kiska
03:29 πŸ”— qw3rty112 has joined #internetarchive
03:35 πŸ”— qw3rty111 has quit IRC (Read error: Operation timed out)
03:51 πŸ”— odemg has quit IRC (Read error: Operation timed out)
04:06 πŸ”— odemg has joined #internetarchive
09:04 πŸ”— Cameron_D has quit IRC (Ping timeout: 1212 seconds)
10:03 πŸ”— Cameron_D has joined #internetarchive
13:56 πŸ”— JAA The Internet Archive clearly doesn't like wget 1.20's WARC files, which added the angle brackets to WARC-Target-URI: http://web.archive.org/web/20190626051544/http://%3Chttps//nratv2api.nra.tv/series/bill-whittles-hot-mic/episode-data/1-1-2013%3E
13:56 πŸ”— JAA Coming from https://archive.org/details/nratv2api.nra.tv-episode-date-2018-2019-json-20190625
14:37 πŸ”— ivan_ are angle brackets in-spec?
14:44 πŸ”— JAA "It's complicated."
14:45 πŸ”— JAA https://github.com/webrecorder/pywb/issues/294
14:47 πŸ”— JAA The spec technically says there have to be angle brackets, but not a single implementation ever did that. A wget dev noticed this and, instead of raising a discussion about this and getting the standard amended, proceeded to add the brackets, breaking every single WARC tool.
14:48 πŸ”— JAA At least the records are accessible in the WBM. But still...
14:50 πŸ”— arkiver the specs were wrong
14:50 πŸ”— arkiver In recent deduplication code IΒ΄ve added some stuff to remove the brackets
14:50 πŸ”— arkiver but yeah
14:50 πŸ”— arkiver thereΒ΄s been a message about this error in the WARC specs I believe
14:51 πŸ”— arkiver Big challenge now is to make wget adopt the new standard and fix this error
14:53 πŸ”— JAA Yes, it got fixed in WARC/1.1.
14:54 πŸ”— JAA Worth mentioning that all examples in the 1.0 spec had no angle brackets, but the BNF grammar had them.
14:55 πŸ”— JAA ABNF*
14:57 πŸ”— arkiver yeah
14:58 πŸ”— JAA I think this is the only backwards incompatibility between the two versions, so moving wget to WARC/1.1 should really just be removing those angle brackets and bumping the spec version.
17:36 πŸ”— Dj-Wawa has quit IRC (Quit: Connection closed for inactivity)
17:41 πŸ”— systwi_ is now known as systwi
18:18 πŸ”— Dj-Wawa has joined #internetarchive
19:05 πŸ”— deevious has quit IRC (Ping timeout: 252 seconds)
19:19 πŸ”— Stiletto has quit IRC (Read error: Connection reset by peer)
19:28 πŸ”— Stiletto has joined #internetarchive
23:57 πŸ”— Flashfire has quit IRC (Remote host closed the connection)
23:57 πŸ”— kiska has quit IRC (Remote host closed the connection)
23:58 πŸ”— Flashfire has joined #internetarchive
23:58 πŸ”— kiska has joined #internetarchive
23:58 πŸ”— Fusl sets mode: +o kiska
23:58 πŸ”— Fusl_ sets mode: +o kiska

irclogger-viewer