[00:07] are they claiming that those links infringe, or that they have exclusive rights to all those things? [00:08] oh, yeah, the former [00:18] And they can do this without fear of legal repercussions [00:27] yes, you have to prove bad faith, not mere idiocy [00:29] Like i could claim i own the rights to half of the internet archive [00:29] and it'd only be perjury if i lied about who i am, no what i hold rights to [00:30] Would ignoring a clearly erronious claim like that create liability? [00:31] Yes, this is the place to discuss law [00:31] -bs [00:31] i am so tired, been up all night coding [00:51] SketchCow: i uploading more G4 Icons series [00:55] also looks like i'm getting a korean cartoon [00:56] this is part of the e_history_korea_StreamRoot_MH collection [01:20] *** BiggieJon has quit IRC (Read error: Operation timed out) [01:27] *** BiggieJon has joined #archiveteam-bs [01:44] *** Start has joined #archiveteam-bs [02:54] *** kyan has joined #archiveteam-bs [02:56] *** primus104 has quit IRC (Leaving.) [03:00] *** BiggieJon has quit IRC (Read error: Operation timed out) [03:04] *** mistym has quit IRC (Remote host closed the connection) [03:05] *** mistym has joined #archiveteam-bs [03:05] *** mistym has quit IRC (Remote host closed the connection) [03:09] *** mistym has joined #archiveteam-bs [03:12] *** mistym has quit IRC (Remote host closed the connection) [03:14] *** BiggieJon has joined #archiveteam-bs [03:42] *** BlueMaxim has joined #archiveteam-bs [03:44] *** mistym has joined #archiveteam-bs [04:13] daaaang it's cold outside [04:38] *** C-apple-a is now known as C-apple [04:52] *** BlueMaxim has quit IRC (Ping timeout: 370 seconds) [04:53] *** BlueMaxim has joined #archiveteam-bs [05:06] *** Nertsy has quit IRC (Quit: Nertsy) [05:09] *** mistym has quit IRC (Remote host closed the connection) [05:15] *** aaaaaaaaa has quit IRC (Leaving) [05:34] *** mistym has joined #archiveteam-bs [05:45] *** ben_ has quit IRC (Read error: Connection reset by peer) [05:45] *** ex-parrot has joined #archiveteam-bs [05:59] baby it's cold outside [06:00] there's no kind of datasphere [06:01] Web ain't the kind of place to raise your kids [06:02] in fact, it's cold as hell [06:03] *** bsmith093 has joined #archiveteam-bs [06:17] *** mistym_ has joined #archiveteam-bs [06:18] *** ionpulse has joined #archiveteam-bs [06:19] *** mistym has quit IRC (Read error: Operation timed out) [06:21] http://www.funstock.co.uk/commodore-amiga-a-visual-commpendium-book-collectors-edition just came across this book and now I want it so goddamn bad [06:24] BlueMaxim: I have the C64 version, it's so good [06:24] thinking about getting the amiga one too [06:25] ex-parrot, out of curiosity, if I wanted to dig into the c64 library, would just playing every game in the c64 book do the job? (I assume you know a lot about the c64) [06:26] everything in there is a classic pretty much [06:26] it'd certainly be a good start [06:26] ...a good start? A 200+ page book is a good start? how many goddamn games did the c64 have?! [06:26] 10,000+ iirc [06:27] ain't nobody got time for that >___> [06:28] I've got an sd2iec hooked up to my SX-64 with a good few thousand D64s in it [06:28] most of 'em I've never heard of, ofcourse :) [06:28] this isn't completely exhaustive but http://www.gamebase64.com/search.php?h=0 [06:30] *** antomatic has quit IRC (Read error: Operation timed out) [06:30] *** antomatic has joined #archiveteam-bs [06:31] heeeeeesh [06:32] how many of those games are worth playing though :P [06:33] god I wanna buy both these books but it comes to $125 Australian ._. [06:35] *** mistym_ has quit IRC (Remote host closed the connection) [06:36] *** mistym has joined #archiveteam-bs [06:37] *** sep332 has quit IRC (Read error: Operation timed out) [06:52] *** GLaDOS has quit IRC (Ping timeout: 246 seconds) [07:43] *** Baljem has quit IRC (ircd.choopa.net irc.teksavvy.ca) [07:43] *** Kazzy has quit IRC (ircd.choopa.net irc.teksavvy.ca) [07:43] *** closure has quit IRC (ircd.choopa.net irc.teksavvy.ca) [07:43] *** Baljem_ has joined #archiveteam-bs [07:44] *** sep332 has joined #archiveteam-bs [07:44] *** Kazzy_ has joined #archiveteam-bs [07:50] *** mutoso has quit IRC (Read error: Operation timed out) [07:56] *** mutoso has joined #archiveteam-bs [07:59] *** Kazzy_ is now known as Kazzy [08:08] C-apple: so we get to save ~50% on space while not losing indexability or seekability [08:08] that's the main reason [08:08] OK-- bzip2 is not as happy when it gets concatenated? [08:09] *** mistym has quit IRC (Remote host closed the connection) [08:10] I don't know why bzip2 isn't used [08:10] warc.gz was present in IA when we started using WARCs so we just did that [08:11] Ah, OK. [08:13] So WARC is always concatenated gzips, or sometimes just one gzip with a WARC format inside? [08:16] a WARC file itself is a sequence of WARC records, which are plaintext [08:16] *** primus104 has joined #archiveteam-bs [08:16] a warc.gz is typically a sequence of gzipped WARC records [08:16] a (solid) gzipped WARC can't really be indexed [08:16] the sequence-of-gzipped-records bit is important for indexing and seeking [08:19] OK. The WARC records contain the content and metadata, or the content is in other files that get gzipped with specific filenames too? (Maybe I should just be asking if there's a simple summary of WARC files instead of having to wade through the full spec.) [08:20] the full spec is probably the best place; the WARCs you'll find around here use most of the recod types [08:21] the bulk of most WARCs is a sequence of request and response records [08:21] they aren't necessarily interleaved [08:22] each request/response contains a WARC header that describes e.g. which response goes to which request, which requests are related [08:24] Ah, OK. And the content is in there somewhere even if it's binary, I assume. [08:25] yes, typically in the response [08:25] you can download a WARC and page through it to get an idea of the structure at a textual level [08:25] http://archive-access.sourceforge.net/warc/WARC_ISO_28500_final_draft%20v018%20Zentveld%20080618.doc is the latest ISO draft [08:26] .doc! [08:26] https://archive.org/download/archiveteam_archivebot_go_20150223210001/gist.github.com-shallow-20150223-102033-a85hd-00000.warc.gz is a smallish one [08:26] The things we do for love... [08:27] all ISO drafts I've seen are .docs [08:27] it must be some ISO thing [08:27] anyway, https://archive.org/download/archiveteam_archivebot_go_20150223210001 is the source collection for that WARC; you'll also see .os.cdx.gz, -meta.warc.gz, and -meta.warc.os.cdx.gz files with that same basename [08:27] From the institution that brought you OOXML as a standard. [08:27] .cdx.gz is a WARC index, which is generated by IA [08:27] -meta.warc.gz is generated by wpull [08:28] it contains information about the wpull job, like the wpull log, as a WARC [08:28] -meta.warc.os.cdx.gz is an index of the meta-WARC which so far isn't used by anything AFAIK [08:28] -meta.warc.gz is not required by any WARC standard but it is nice to have [08:29] What does wpull do for us vs. wget? [08:29] a few things: Python hooks, concurrent fetch, phantomjs integration, written in Python (often easier to read and maintain), has unit tests [08:30] database-as-URL queue [08:30] (wget maintains it in memory) [08:30] archivebot used to use wget+lua and now uses wpull for most of the above reasons [08:31] also wpull's maintainer hangs out here so that's a big point in its favor [08:31] Ah-- and wget still doesn't have a fast way to figure out where in the queue it left off, does it? [08:31] I don't know [08:35] OK, so we like wpull, and it will generate the .cdx files also? [08:36] wpull can generate a cdx but in current workflows a cdx is typically generated by the viewer program [08:36] wayback, pywb, warc-proxy, etc [08:42] *** godane has quit IRC (Read error: Operation timed out) [08:42] OK, so as far as grabbing it, I wouldn't care much about .cdx? [08:43] probably not needed [08:57] *** godane has joined #archiveteam-bs [09:02] *** zenguy_pc has joined #archiveteam-bs [09:04] *** primus104 has quit IRC (Leaving.) [09:06] *** boozehoun has quit IRC (Ping timeout: 512 seconds) [09:10] https://webrecorder.io/ is excellent! [09:10] uploaded one of my warcs as a test, works really well; https://webrecorder.io/replay/20150205112919/http://puppylinux.ic.cz/ [09:10] s/;/:/ [09:24] neat [09:29] *** rejon has quit IRC (Ping timeout: 512 seconds) [09:38] *** rejon has joined #archiveteam-bs [10:00] *** schbirid has joined #archiveteam-bs [11:18] *** Sk1d has quit IRC (Ping timeout: 265 seconds) [11:25] *** Sk1d has joined #archiveteam-bs [11:30] *** Sk2d has joined #archiveteam-bs [11:32] *** Sk1d has quit IRC (Read error: Operation timed out) [11:32] *** Sk2d is now known as Sk1d [11:37] *** Sk1d has quit IRC (Ping timeout: 265 seconds) [11:41] *** Sk1d has joined #archiveteam-bs [11:47] *** Sk2d has joined #archiveteam-bs [11:48] *** Sk1d has quit IRC (Read error: Operation timed out) [11:53] *** Sk2d has quit IRC (Ping timeout: 265 seconds) [11:54] *** Sk1d has joined #archiveteam-bs [11:58] *** dashcloud has quit IRC (Ping timeout: 246 seconds) [12:01] *** dashcloud has joined #archiveteam-bs [12:01] *** Sk2d has joined #archiveteam-bs [12:04] *** Sk1d has quit IRC (Read error: Operation timed out) [12:04] *** Sk2d is now known as Sk1d [12:07] *** primus104 has joined #archiveteam-bs [12:09] *** Sk1d has quit IRC (Ping timeout: 265 seconds) [12:09] *** dashcloud has quit IRC (Read error: Operation timed out) [12:11] *** Sk1d has joined #archiveteam-bs [12:12] *** dashcloud has joined #archiveteam-bs [12:17] *** Sk2d has joined #archiveteam-bs [12:18] *** Sk1d has quit IRC (Read error: Operation timed out) [12:18] *** Sk2d is now known as Sk1d [12:42] *** Sk1d has quit IRC (Ping timeout: 265 seconds) [12:45] *** Sk1d has joined #archiveteam-bs [12:53] *** Sk1d has quit IRC (Read error: Operation timed out) [12:54] *** Sk1d has joined #archiveteam-bs [13:01] *** Sk1d has quit IRC (Read error: Operation timed out) [13:02] *** Sk1d has joined #archiveteam-bs [13:06] *** dashcloud has quit IRC (Read error: Operation timed out) [13:10] *** dashcloud has joined #archiveteam-bs [13:17] http://envisage-project.eu/proving-android-java-and-python-sorting-algorithm-is-broken-and-how-to-fix-it/ [13:17] *** Sk1d has quit IRC (Ping timeout: 265 seconds) [13:20] *** Sk1d has joined #archiveteam-bs [13:25] *** closure has joined #archiveteam-bs [13:26] *** Sk2d has joined #archiveteam-bs [13:30] *** Sk1d has quit IRC (Read error: Operation timed out) [13:30] *** Sk2d is now known as Sk1d [13:31] *** dashcloud has quit IRC (Read error: Operation timed out) [13:36] *** dashcloud has joined #archiveteam-bs [13:47] *** sankin has joined #archiveteam-bs [13:49] *** rejon has quit IRC (Remote host closed the connection) [13:51] *** Sk1d has quit IRC (Ping timeout: 265 seconds) [13:53] *** primus104 has quit IRC (Leaving.) [13:55] *** Sk1d has joined #archiveteam-bs [14:00] *** Sk2d has joined #archiveteam-bs [14:02] *** Sk1d has quit IRC (Read error: Operation timed out) [14:06] *** Sk2d has quit IRC (Ping timeout: 265 seconds) [14:07] *** Sk1d has joined #archiveteam-bs [14:10] *** Sk1d has quit IRC (Read error: Operation timed out) [14:14] *** Sk1d has joined #archiveteam-bs [14:19] *** Sk1d has quit IRC (Ping timeout: 265 seconds) [14:21] *** sankin has quit IRC (Leaving.) [14:21] *** Sk1d has joined #archiveteam-bs [14:24] *** Sk1d has quit IRC (Read error: Operation timed out) [14:29] *** Sk1d has joined #archiveteam-bs [14:34] *** Sk1d has quit IRC (Ping timeout: 265 seconds) [14:38] *** Sk1d has joined #archiveteam-bs [14:46] *** Sk1d has quit IRC (Read error: Operation timed out) [14:49] *** Sk1d has joined #archiveteam-bs [14:49] *** dashcloud has quit IRC (Ping timeout: 240 seconds) [14:54] *** Sk1d has quit IRC (Ping timeout: 265 seconds) [14:55] *** dashcloud has joined #archiveteam-bs [14:56] *** BiggieJo1 has joined #archiveteam-bs [14:58] *** Sk1d has joined #archiveteam-bs [15:01] *** BiggieJon has quit IRC (Ping timeout: 600 seconds) [15:21] *** sankin has joined #archiveteam-bs [15:24] *** Start has quit IRC (Disconnected.) [15:32] *** mistym has joined #archiveteam-bs [15:35] *** primus104 has joined #archiveteam-bs [15:36] *** BiggieJon has joined #archiveteam-bs [15:37] *** mistym has quit IRC (Read error: Operation timed out) [15:37] *** mistym has joined #archiveteam-bs [15:39] *** BiggieJo1 has quit IRC (Ping timeout: 600 seconds) [15:40] *** mistym has quit IRC (Remote host closed the connection) [15:46] *** dashcloud has quit IRC (Read error: Operation timed out) [15:49] *** dashcloud has joined #archiveteam-bs [16:01] *** mistym has joined #archiveteam-bs [16:02] *** Start has joined #archiveteam-bs [16:15] *** Smiley has joined #archiveteam-bs [16:20] *** primus104 has quit IRC (hub.se irc.efnet.pl) [16:20] *** schbirid has quit IRC (hub.se irc.efnet.pl) [16:20] *** miljo has quit IRC (hub.se irc.efnet.pl) [16:20] *** S[h]O[r]T has quit IRC (hub.se irc.efnet.pl) [16:20] *** primus has quit IRC (hub.se irc.efnet.pl) [16:20] *** Coderjoe has quit IRC (hub.se irc.efnet.pl) [16:20] *** SmileyG has quit IRC (hub.se irc.efnet.pl) [16:20] *** altlabel has quit IRC (hub.se irc.efnet.pl) [16:23] *** BlueMaxim has quit IRC (Read error: Operation timed out) [16:24] *** BlueMaxim has joined #archiveteam-bs [16:25] *** primus_ has joined #archiveteam-bs [16:26] *** altlabel_ has joined #archiveteam-bs [16:26] *** aaaaaaaaa has joined #archiveteam-bs [16:29] *** dashcloud has quit IRC (Ping timeout: 306 seconds) [16:30] *** dashcloud has joined #archiveteam-bs [16:35] *** S[h]O[r]T has joined #archiveteam-bs [16:38] *** schbirid has joined #archiveteam-bs [16:40] *** closure has quit IRC (Ping timeout: 306 seconds) [16:41] *** closure has joined #archiveteam-bs [16:51] *** mistym has quit IRC (Remote host closed the connection) [16:52] *** Start has quit IRC (Disconnected.) [16:55] *** Coderjoe has joined #archiveteam-bs [16:55] *** Jonimus has quit IRC (Write error: Broken pipe) [16:56] *** Laverne has quit IRC (Read error: Operation timed out) [16:57] *** atlogbot has quit IRC (Ping timeout: 369 seconds) [16:58] *** primus104 has joined #archiveteam-bs [17:00] *** miljo has joined #archiveteam-bs [17:01] *** dashcloud has quit IRC (Read error: Operation timed out) [17:02] *** Jonimus has joined #archiveteam-bs [17:07] *** mistym has joined #archiveteam-bs [17:08] *** dashcloud has joined #archiveteam-bs [17:11] *** Lord_Nigh has quit IRC (Ping timeout: 246 seconds) [17:22] *** Lord_Nigh has joined #archiveteam-bs [17:23] *** chazchaz has quit IRC (Ping timeout: 369 seconds) [17:23] *** C-apple has quit IRC (Quit: Woohoo.) [17:24] *** dcmorton has quit IRC (Read error: Operation timed out) [17:35] *** dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.) [17:36] *** dashcloud has joined #archiveteam-bs [17:41] *** chazchaz has joined #archiveteam-bs [17:41] *** atlogbot has joined #archiveteam-bs [17:42] *** Jonimus has quit IRC (Read error: Operation timed out) [17:45] *** dcmorton has joined #archiveteam-bs [17:47] *** Nertsy has joined #archiveteam-bs [17:50] *** Laverne has joined #archiveteam-bs [17:50] *** Nertsy has quit IRC (Client Quit) [17:52] *** Nertsy has joined #archiveteam-bs [17:55] *** Start has joined #archiveteam-bs [18:31] so i got a german CD with Quake and other FPS files from 1996 off ebay. i have never seen it before. it is still wrapped. [18:32] time to ruin its collectors value! [18:32] omg [18:32] if i was a music company, i would be bankrupt now [18:39] *** mschfr has joined #archiveteam-bs [18:42] *** Start has quit IRC (Disconnected.) [18:59] *** thechip has quit IRC (Ping timeout: 252 seconds) [19:03] Give me [19:10] *** wp494_ has joined #archiveteam-bs [19:10] *** wp494_ has quit IRC (Excess Flood) [19:10] *** wp494_ has joined #archiveteam-bs [19:10] *** wp494_ has quit IRC (Excess Flood) [19:11] *** BlueMaxim has quit IRC (ircd.shaw.ca irc.shaw.ca) [19:11] *** xtr-201 has quit IRC (ircd.shaw.ca irc.shaw.ca) [19:11] *** pikhq has quit IRC (ircd.shaw.ca irc.shaw.ca) [19:11] *** underscor has quit IRC (ircd.shaw.ca irc.shaw.ca) [19:11] *** wp494 has quit IRC (ircd.shaw.ca irc.shaw.ca) [19:11] *** lytv has quit IRC (ircd.shaw.ca irc.shaw.ca) [19:11] *** dx has quit IRC (ircd.shaw.ca irc.shaw.ca) [19:11] *** rduser has quit IRC (ircd.shaw.ca irc.shaw.ca) [19:11] *** DFJustin has quit IRC (ircd.shaw.ca irc.shaw.ca) [19:11] *** SadDM has quit IRC (ircd.shaw.ca irc.shaw.ca) [19:11] *** useretail has quit IRC (ircd.shaw.ca irc.shaw.ca) [19:11] *** torvik has quit IRC (ircd.shaw.ca irc.shaw.ca) [19:11] *** wp494_ has joined #archiveteam-bs [19:14] *** wm_ has quit IRC (Ping timeout: 240 seconds) [19:17] *** Kirk has quit IRC (Ping timeout: 240 seconds) [19:17] *** Nertsy has quit IRC (Ping timeout: 512 seconds) [19:19] *** Nertsy has joined #archiveteam-bs [19:27] *** BlueMaxim has joined #archiveteam-bs [19:29] *** lytv has joined #archiveteam-bs [19:32] *** raylee has quit IRC (Ping timeout: 240 seconds) [19:34] *** Kirk has joined #archiveteam-bs [19:36] *** wm_ has joined #archiveteam-bs [19:36] *** raylee has joined #archiveteam-bs [19:38] *** DFJustin has joined #archiveteam-bs [19:46] *** rduser has joined #archiveteam-bs [19:49] *** raylee has quit IRC (Ping timeout: 240 seconds) [19:51] *** raylee has joined #archiveteam-bs [19:53] *** useretail has joined #archiveteam-bs [19:53] *** underscor has joined #archiveteam-bs [19:53] *** dx has joined #archiveteam-bs [19:53] *** torvik has joined #archiveteam-bs [19:53] *** irc.shaw.ca sets mode: +o torvik [19:54] *** xtr-201 has joined #archiveteam-bs [19:55] *** SadDM has joined #archiveteam-bs [19:56] *** pikhq has joined #archiveteam-bs [19:58] *** mschfr has quit IRC (Ping timeout: 240 seconds) [20:24] *** thechip has joined #archiveteam-bs [20:55] *** schbirid has quit IRC (Quit: Leaving) [21:12] *** atlogbot has quit IRC (Remote host closed the connection) [21:12] *** swebb has quit IRC (badcheese.com - where crap sometimes gets done) [21:14] *** swebb has joined #archiveteam-bs [21:23] *** swebb has left ["Textual IRC Client: www.textualapp.com"] [21:23] *** swebb has joined #archiveteam-bs [21:23] *** swebb has left ["Textual IRC Client: www.textualapp.com"] [21:23] *** swebb has joined #archiveteam-bs [21:24] *** BlueMaxim has quit IRC (Quit: Leaving) [21:32] *** xmc sets mode: +o swebb [22:00] *** BlueMaxim has joined #archiveteam-bs [22:07] *** sankin has quit IRC (Leaving.) [22:52] *** wp494_ is now known as wp494 [23:52] so i just found the 1988 olympics was in south korea [23:52] i'm getting alots of videos about it