[00:17] *** MrPenguin has quit IRC (Ping timeout: 240 seconds) [00:52] *** Spacedawg has quit IRC (hub.dk irc.homelien.no) [00:52] *** Wyatt has quit IRC (hub.dk irc.homelien.no) [00:52] *** Ymgve has quit IRC (hub.dk irc.homelien.no) [00:52] *** Meeh has quit IRC (hub.dk irc.homelien.no) [00:52] *** Pythia has quit IRC (hub.dk irc.homelien.no) [00:52] *** mafrasi2 has quit IRC (hub.dk irc.homelien.no) [00:52] *** i0npulse_ has quit IRC (hub.dk irc.homelien.no) [00:53] *** JesseW has joined #archiveteam [00:57] *** Spacedawg has joined #archiveteam [00:57] *** Wyatt has joined #archiveteam [00:57] *** Ymgve has joined #archiveteam [00:57] *** Meeh has joined #archiveteam [00:57] *** Pythia has joined #archiveteam [00:57] *** mafrasi2 has joined #archiveteam [00:57] *** i0npulse_ has joined #archiveteam [01:03] *** xk_id has quit IRC (Remote host closed the connection) [01:04] *** xk_id has joined #archiveteam [01:14] *** xk_id has quit IRC (Read error: Connection reset by peer) [01:23] *** primus104 has quit IRC (Leaving.) [01:41] *** JesseW has quit IRC (Read error: Operation timed out) [01:50] *** Atom__ has joined #archiveteam [02:30] SketchCow: i *think* the upload to fos is done, it just occurred to me that my pv had 500gb as the size, which was stupid because it was piping gzipped data. [02:30] SketchCow: can you check the integrity [02:35] DFJustin: how do i check if my stupidly huge gzip file is valid? [02:44] *** JesseW has joined #archiveteam [02:47] gzip -t [02:47] could take forever to finish on a file that large [02:51] aaaaaaaaa: running anyway [02:52] *** vitzli has joined #archiveteam [02:53] aaaaaaaaa: doesnt seem to be taking much memory [02:56] its always fun to stumble on an old looking webpage [02:56] http://www.procolharum.com/ [02:57] I don't believe gzip uses much memory due to the hardware it had to run on when invented [02:57] didn't it replace 'compress'? [02:58] I've added a section on dumping DOS/Windows 3.5'' (and possibly 5.25'') floppies using Linux to the digitizetheplanet wiki: http://digitize.archiveteam.org/index.php/Floppy_Disks . Comments welcome! [02:58] I think it was an unencumbered version or something like that [02:58] i dont think most *nixes even come with compress anymore [02:58] oh yeah, patents [03:00] Said patents have expired. [03:00] FWIW, if you really need to read a compress archive, gzip decompresses them. [03:00] yeah, those were the LZW patents right? [03:00] Yep. [03:00] the ones that encumbered GIF [03:00] (gzip has *always* decompressed them -- the patents applied to compression only) [03:01] oh, it is unencumbered and uses a slightly different compression scheme, LZ77 + huffman coding vs LZW [03:01] It's a more efficient compression scheme, as well. [03:02] And yeah, it shouldn't take much memory. The window is 64k. [03:03] (because the algorithm is from pkzip, which was for DOS.) [03:04] wait, so by renaming a .gz file to a .zip can windows unpack it? [03:04] or is it modified? [03:05] No. The actual container format is different, they just happen to use the same compression algorithm. [03:05] In this regard it's similar to .xz vs .lzma. [03:05] (actually, better example, .xz vs .7z) [03:05] *** Guest100 has joined #archiveteam [03:05] ah [03:05] now I get it [03:06] I believe any further conversation is firmly in -bs territory [03:06] gzip has a short header and then a bunch of compressed data. ZIP has a footer, which points to directory structures, which point to files, which are a short header followed by compressed data; all together in a single file. [03:06] 'container' instantly triggers memories of ffmpeg because I do so much video encoding, but they work similarly there - containers can have different or the same algorithms [03:06] Kk [03:06] aaaaaaaaa: okay [04:11] *** Start has joined #archiveteam [04:18] *** aaaaaaaaa has quit IRC (Leaving) [04:55] *** Guest100 has quit IRC (Quit: My Mac has gone to sleep. ZZZzzz…) [05:12] *** Spacedawg has quit IRC (Ping timeout: 506 seconds) [05:24] *** antomati_ has joined #archiveteam [05:24] *** swebb sets mode: +o antomati_ [05:24] *** RichardG_ has joined #archiveteam [05:25] *** SmileyG has joined #archiveteam [05:25] *** Sanqui has quit IRC (Read error: Operation timed out) [05:25] *** cloudmons has quit IRC (Read error: Operation timed out) [05:25] *** Infreq has quit IRC (Read error: Operation timed out) [05:26] *** lrkj has quit IRC (Read error: Operation timed out) [05:26] *** fenn has quit IRC (Read error: Operation timed out) [05:26] *** pfallenop has quit IRC (Read error: Operation timed out) [05:26] *** PepsiMax has quit IRC (Read error: Operation timed out) [05:26] *** Infreq has joined #archiveteam [05:26] *** edsu has quit IRC (Read error: Operation timed out) [05:26] *** Zebranky has quit IRC (Read error: Operation timed out) [05:26] *** Smiley has quit IRC (Read error: Operation timed out) [05:26] *** tephra_ has quit IRC (Read error: Operation timed out) [05:26] *** Zebranky has joined #archiveteam [05:26] *** ersi has quit IRC (Read error: Operation timed out) [05:26] *** antomatic has quit IRC (Read error: Operation timed out) [05:26] *** Nemo_bis has quit IRC (Read error: Operation timed out) [05:26] *** Peetz0r has quit IRC (Read error: Operation timed out) [05:26] *** SketchCow has quit IRC (Read error: Operation timed out) [05:27] *** Peetz0r has joined #archiveteam [05:27] *** lrkj has joined #archiveteam [05:27] *** Jogie has quit IRC (Read error: Operation timed out) [05:27] *** lbft has quit IRC (Read error: Operation timed out) [05:27] *** RichardG has quit IRC (Read error: Operation timed out) [05:27] *** Jogie has joined #archiveteam [05:28] *** Sanqui has joined #archiveteam [05:28] *** lbft has joined #archiveteam [05:28] *** Baljem has quit IRC (Read error: Operation timed out) [05:29] *** Baljem has joined #archiveteam [05:29] *** chfoo- has quit IRC (Read error: Operation timed out) [05:29] *** chfoo- has joined #archiveteam [05:30] *** chazchaz_ has quit IRC (Read error: Operation timed out) [05:31] *** edsu has joined #archiveteam [05:31] *** swebb sets mode: +o edsu [05:31] *** robink has quit IRC (Ping timeout: 492 seconds) [05:31] *** tephra has joined #archiveteam [05:31] *** Nemo_bis has joined #archiveteam [05:31] *** ersi has joined #archiveteam [05:31] *** swebb sets mode: +o ersi [05:33] *** fenn has joined #archiveteam [05:49] *** PepsiMax has joined #archiveteam [05:54] *** SketchCow has joined #archiveteam [05:54] *** swebb sets mode: +o SketchCow [05:56] Yay [05:59] *** ivan` has joined #archiveteam [05:59] http://googleappsdeveloper.blogspot.com/2015/08/deprecating-web-hosting-support-in.html [06:22] *** Kazzy has quit IRC (Read error: Operation timed out) [06:23] *** Kazzy has joined #archiveteam [06:43] *** JesseW has quit IRC (Read error: Operation timed out) [07:14] *** lrkj_ has joined #archiveteam [07:15] *** lrkj has quit IRC (Ping timeout: 600 seconds) [07:22] *** atomotic has joined #archiveteam [07:33] *** dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.) [07:33] *** Start has quit IRC (Read error: Connection reset by peer) [07:34] *** Start has joined #archiveteam [07:35] *** dashcloud has joined #archiveteam [08:07] *** VADemon has joined #archiveteam [08:16] *** schbirid has joined #archiveteam [08:19] *** godane has quit IRC (Leaving.) [08:19] *** godane has joined #archiveteam [08:41] *** khaoohs_ has joined #archiveteam [08:43] *** khaoohs has quit IRC (Ping timeout: 306 seconds) [08:58] *** khaoohs has joined #archiveteam [09:02] *** khaoohs_ has quit IRC (Ping timeout: 483 seconds) [09:06] *** vitzli has quit IRC (Quit: Leaving) [09:15] *** Stilett0 has joined #archiveteam [09:17] *** primus104 has joined #archiveteam [09:20] *** Stiletto has quit IRC (Ping timeout: 483 seconds) [09:26] *** primus104 has quit IRC (Leaving.) [09:26] *** boozehoun has quit IRC (Read error: Connection reset by peer) [09:27] *** boozehoun has joined #archiveteam [09:29] *** scyther has joined #archiveteam [09:35] *** BlueMaxim has quit IRC (Quit: Leaving) [09:38] *** MrRadar_ has joined #archiveteam [09:43] *** MrRadar has quit IRC (Ping timeout: 483 seconds) [09:57] *** dan- has quit IRC (Ping timeout: 483 seconds) [09:58] *** vitzli has joined #archiveteam [10:00] *** dan- has joined #archiveteam [10:03] *** primus104 has joined #archiveteam [10:05] *** Ravenloft has quit IRC (Remote host closed the connection) [10:26] *** scyther has quit IRC (Read error: Connection reset by peer) [10:52] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [11:29] *** xk_id has joined #archiveteam [11:33] *** primus104 has quit IRC (Leaving.) [11:43] *** xk_id has quit IRC (Remote host closed the connection) [12:01] *** atomotic has joined #archiveteam [12:19] *** scyther has joined #archiveteam [12:32] *** SmileyG has quit IRC (Quit: http://www.milkme.co.uk - You'll never understand.) [12:33] *** Smiley has joined #archiveteam [13:26] *** MrRadar_ is now known as MrRadar [13:30] *** xk_id has joined #archiveteam [14:01] *** primus104 has joined #archiveteam [14:37] *** scyther has quit IRC (Read error: Connection reset by peer) [14:38] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [14:40] *** scyther has joined #archiveteam [14:54] *** scyther has quit IRC (Leaving) [15:16] *** Atom__ has quit IRC (Read error: Connection reset by peer) [15:16] *** Atom__ has joined #archiveteam [15:22] *** PurpleSym has joined #archiveteam [15:28] *** cloudmons has joined #archiveteam [15:41] *** boozehoun has quit IRC (Read error: Connection reset by peer) [15:42] *** boozehoun has joined #archiveteam [15:48] *** Guest100 has joined #archiveteam [15:59] *** JesseW has joined #archiveteam [16:21] *** JesseW has quit IRC (Read error: Operation timed out) [16:40] *** nertzy has joined #archiveteam [16:48] *** Guest100 has quit IRC (Quit: My Mac has gone to sleep. ZZZzzz…) [16:49] *** vitzli has quit IRC (Quit: Leaving) [16:57] *** primus104 has quit IRC (Leaving.) [17:11] *** K4k has joined #archiveteam [17:14] *** K4k_ has joined #archiveteam [17:15] *** K4k has quit IRC (Ping timeout: 258 seconds) [17:19] *** Guest100 has joined #archiveteam [17:27] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [17:30] *** K4k_ has quit IRC (Read error: Operation timed out) [17:34] *** Guest100 has quit IRC (My Mac has gone to sleep. ZZZzzz…) [17:36] *** atomotic has joined #archiveteam [17:36] *** aaaaaaaaa has joined #archiveteam [17:36] *** swebb sets mode: +o aaaaaaaaa [17:52] *** nertzy has joined #archiveteam [17:52] *** Stilett0 is now known as Stiletto [17:56] *** Guest100 has joined #archiveteam [18:00] *** aaaaaaaaa has quit IRC (Read error: Connection reset by peer) [18:01] *** aaaaaaaaa has joined #archiveteam [18:01] *** swebb sets mode: +o aaaaaaaaa [18:01] *** atomotic has quit IRC (Ping timeout: 252 seconds) [18:10] *** khaoohs has quit IRC (Read error: Connection reset by peer) [18:10] *** khaoohs has joined #archiveteam [18:15] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [18:17] *** K4k_ has joined #archiveteam [18:18] *** Start-mob has joined #archiveteam [18:18] *** K4k_ has quit IRC (Client Quit) [18:20] *** Start-mob has quit IRC (Remote host closed the connection) [18:23] Hi all, got a server ready again. What project needs the most attention [18:24] currently no project are running [18:24] *** primus104 has joined #archiveteam [18:24] some projects will be running very soon though [18:24] Ok. Are we talking next 24 hours soon? [18:24] We're talking asap soon [18:24] Ok. What have we got? [18:25] Google Code upcoming, frontback and blingee [18:26] Chan for google code? [18:26] #googlecodeblue [18:28] Ok. I will wait for google code [18:34] SketchCow: ok the gzip test was fine, it's all there, ready for you to do whatever. [18:40] *** aaaaaaaa_ has joined #archiveteam [18:40] *** swebb sets mode: +o aaaaaaaa_ [18:46] *** aaaaaaaaa has quit IRC (Ping timeout: 600 seconds) [18:46] *** aaaaaaaa_ is now known as aaaaaaaaa [18:47] *** atomotic has joined #archiveteam [18:51] *** primus104 has quit IRC (Leaving.) [18:51] *** Stiletto has quit IRC (Read error: Connection reset by peer) [18:51] *** Stiletto has joined #archiveteam [18:53] *** scyther has joined #archiveteam [19:06] *** aaaaaaaaa has quit IRC (Ping timeout: 600 seconds) [19:11] *** schbirid has quit IRC (Quit: Leaving) [19:49] *** habi has joined #archiveteam [19:49] *** habi has left [19:55] *** primus104 has joined #archiveteam [20:03] *** Stiletto has quit IRC (Read error: Connection reset by peer) [20:05] *** Stiletto has joined #archiveteam [20:11] *** Rotlink has joined #archiveteam [20:22] *** Stiletto has quit IRC () [20:25] *** nertzy has joined #archiveteam [20:34] *** Stiletto has joined #archiveteam [20:35] *** MrPenguin has joined #archiveteam [20:36] joepie91: who controls the ArchiveBot instance at http://dashboard.at.ninjawedding.org [20:36] ? [20:37] #archivebot [20:37] is it doing something you don't want to your site? [20:37] ivan`: thanks :) [20:38] ah, yes, archivebot operators asleep at the wheel [20:38] No, I proposed a website to mirror yesterday and I think it may be limited to 2 page views per day unless registered [20:39] Which means the capture may be broken if it only captured the login-wall page or something [20:44] *** PurpleSym has quit IRC (Remote host closed the connection) [20:44] *** Guest100 has quit IRC (Quit: My Mac has gone to sleep. ZZZzzz…) [20:45] *** aaaaaaaaa has joined #archiveteam [20:45] *** swebb sets mode: +o aaaaaaaaa [20:45] it's fine to be asleep at the wheel, it's a self-driving car [20:48] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [20:50] *** xk_id has quit IRC (Remote host closed the connection) [20:53] *** SimpBrain has joined #archiveteam [21:07] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [21:09] *** xk_id has joined #archiveteam [21:13] *** brayden_ has quit IRC (Read error: Connection reset by peer) [21:27] *** scyther has quit IRC (Leaving) [21:40] *** primus104 has quit IRC (Leaving.) [22:07] *** nertzy has joined #archiveteam [22:10] *** MrPenguin has quit IRC (Ping timeout: 240 seconds) [22:28] *** ivan` has left [22:35] *** nertzy has quit IRC (Quit: This computer has gone to sleep) [22:48] *** xk_id has quit IRC (Remote host closed the connection) [22:49] *** xk_id has joined #archiveteam [22:59] *** xk_id has quit IRC (Read error: Connection reset by peer) [23:00] *** wvdp has joined #archiveteam [23:00] *** MichaelOH has joined #archiveteam [23:01] what is the best tool for testing if a warc is valid? [23:02] *** MichaelOH has quit IRC (Client Quit) [23:02] i tried the internet archive warc library @ https://github.com/internetarchive/warc [23:02] *** MrPenguin has joined #archiveteam [23:03] but it only says that i'm wrong and not where i'm wrong [23:04] *** primus104 has joined #archiveteam [23:07] *** xmc changes topic to: Archive Team: We're not archive.org | http://archiveteam.org/ | lengthy/off-topic in #archiveteam-bs | 1. Shut up | it's fine to be asleep a [23:07] awww [23:08] *** xmc changes topic to: Archive Team: We're not archive.org | http://archiveteam.org/ | lengthy/off-topic in #archiveteam-bs | 1. Shut up [23:16] *** MrPenguin has quit IRC (Ping timeout: 240 seconds) [23:18] https://github.com/chfoo/warcat has a verify function [23:25] *** Start_ has joined #archiveteam [23:25] *** Start has quit IRC (Read error: Connection reset by peer)