[00:10] *** furrie has quit IRC (Quit: Page closed) [00:17] *** furrie has joined #archiveteam-bs [00:18] yeah, hard work done by others should be archived [00:18] too bad some of it got lost in the rapidly-changing internet [00:19] i wish the archiveteam had a program that could make the lost sites reappear in thin air on to the Wayback Machine, complete. [00:23] i figure we should start archiving educational stuff [00:33] xmc: wow, i'm checking for random sonic stuff in the wayback machine, and it's been all recently added in those days I added all of those sonic sites in the archiveBot. [00:39] *** furrie has quit IRC (Quit: Page closed) [00:39] *** mistym has quit IRC (Remote host closed the connection) [00:51] *** GLaDOS has quit IRC (Read error: Operation timed out) [00:53] *** GLaDOS has joined #archiveteam-bs [00:59] *** GLaDOS has quit IRC (Ping timeout: 252 seconds) [01:08] so someone made a Markov chain with Spring Framework class names [01:08] should repeat that with Core Media function names [01:09] I mean this is a function I am actually reading about now: CMMetadataFormatDescriptionCreateWithMetadataFormatDescriptionAndMetadataSpecifications [01:11] that said I guess it doesn't compare to -[MTLBlitCommandEncoder copyFromTexture:sourceSlice:sourceLevel:sourceOrigin:sourceSize:toBuffer:destinationOffset:destinationBytesPerRow:destinationBytesPerImage:] [01:17] *** dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.) [01:18] *** dashcloud has joined #archiveteam-bs [01:18] *** JesseW has joined #archiveteam-bs [01:33] SketchCow: Still wanting someone to test Internet Arcade on Windows 10 & MS Edge? [01:33] MS Edge is (of course) not the browser I regularly use, but I do have a Windows 10 system right here. :) [01:38] ... Well, I believe it's moot because I currently can't access archive.org. [01:39] *** schbirid has quit IRC (Read error: Operation timed out) [01:44] *** JesseW has quit IRC (Quit: Leaving.) [01:47] *** dashcloud has quit IRC (Read error: Operation timed out) [01:48] *** JesseW has joined #archiveteam-bs [01:50] *** dashcloud has joined #archiveteam-bs [01:52] *** schbirid has joined #archiveteam-bs [02:02] What is the process between WARCs being uploaded to IA and them showing up in the Wayback Machine, anyway? Is it manual, semi-manual, or? What is the typical time scale? Is there an easy way to check? [02:03] it's automatic and should be within a day or so [02:03] easy way to check is http://web.archive.org/*/http://www.example.com/ [02:03] *** yipdw has quit IRC (Remote host closed the connection) [02:05] *** yipdw has joined #archiveteam-bs [02:07] I meant, for a given item (containing a MegaWARC with 1000s of pages) a way to quickly make sure they have been included... [02:07] *** tomwsmf-a has joined #archiveteam-bs [02:08] For that matter, I'm not (yet) sure how to identify the list of pages in a megawarc item... [02:11] you can scan the generated cdx [02:12] *** kyan has joined #archiveteam-bs [02:12] Is archive.org down for other people? https://archive.org/details/archiveteam_zapd is unresponsive for me... [02:13] yep, been that way for awhile [02:13] hm -- not status updates, I presume. [02:13] er, s/not/no/ [02:16] *** kyanz`bot has joined #archiveteam-bs [02:16] works for me [02:16] *** tomwsmf-a has quit IRC (Ping timeout: 258 seconds) [02:17] does for me again [02:17] back [02:17] interesting. [02:17] (works for me, also) [02:17] Yay? [02:18] is there a offsite status page for IA? [02:20] JesseW, I think they sometimes tweet at https://twitter.com/@internetarchive [02:21] kyan: thanks -- checked there, no mention of the current problems [02:21] yeah i didn't see one either :P [02:23] is there a "official" IA IRC channel? [02:23] well, it is 7:20 PDT on a friday [02:23] depending on the problem there may not be any time to tweet [02:24] there's #internetarchive (EFnet) [02:24] but as it says in the topic, it's unofficial [02:25] I see. [02:26] * JesseW really needs to hack my copy of TsLogBot.py to support changing the list of channels logged without a hard reboot... [02:28] So, about CDX files -- what is the .cdx.idx (as opposed to .cdx.gz)? The _files.xml file says it is a "Item CDX Meta-Index"... [02:29] *** primus104 has quit IRC (Leaving.) [02:29] I'm looking at https://archive.org/download/archiveteam_zapd_20131029051118 [02:30] you want the megawarc CDX [02:31] e.g [02:31] curl -sL 'https://archive.org/download/archiveteam_zapd_20131016071259/zapd_20131016071259.megawarc.warc.os.cdx.gz' | gunzip -c | cut -f3 -d' ' [02:31] that'll list all URLs, in URL order, in that megawarc [02:33] nice. [02:33] * JesseW goes to add that to the wiki... [02:34] that said there are better ways [02:35] Wayback, for example [02:36] the os.cdx.gz IIRC is a megawarc-specific thing [02:37] but that pipeline should work for any WARC CDX you find [02:38] the rest of the cdx record is useful as well; for example, if you want to know what WARC to look in, you can get the filename, offset, and size from the record also [02:38] in the case of megawarc you will get the original WARC name, offset, and size [02:49] added to http://archiveteam.org/index.php?title=The_WARC_Ecosystem#CDX_File_Format [02:53] *** JesseW1 has joined #archiveteam-bs [02:55] *** JesseW has quit IRC (Read error: Operation timed out) [03:02] *** GLaDOS has joined #archiveteam-bs [03:22] *** GLaDOS has quit IRC (Ping timeout: 252 seconds) [03:25] *** GLaDOS has joined #archiveteam-bs [04:06] *** kyan has quit IRC (Read error: Connection reset by peer) [04:06] *** kyan has joined #archiveteam-bs [04:22] *** mistym has joined #archiveteam-bs [04:25] *** kyanz`bot has quit IRC (Ping timeout: 1221 seconds) [04:30] *** aaaaaaaaa has quit IRC (Leaving) [05:06] *** dashcloud has quit IRC (Read error: Connection reset by peer) [05:06] *** dashcloud has joined #archiveteam-bs [05:23] *** xtr-201 has quit IRC (Read error: Operation timed out) [05:28] *** BlueMaxim has quit IRC (Ping timeout: 306 seconds) [05:28] *** BlueMaxim has joined #archiveteam-bs [05:30] *** dashcloud has quit IRC (Quit: No Ping reply in 210 seconds.) [05:35] *** dashcloud has joined #archiveteam-bs [05:44] *** ewrerwt has joined #archiveteam-bs [05:44] *** ewrerwt is now known as skrp [05:44] *** skrp is now known as rejk [05:51] http://imagebin.ca/v/29uYWbJ6ijEn [05:51] freebsd on all three file servers: ntfs ufs zfs [05:54] 100TB archive of traffic information. funny things happen in internet traffic. you get traffic like 56.2GB The Master Collection of How To Date Women. [05:54] *** mistym has quit IRC (Remote host closed the connection) [06:15] *** mistym has joined #archiveteam-bs [06:32] *** Sanqui is now known as Sanqui|go [06:58] *** mistym has quit IRC (Remote host closed the connection) [07:00] *** mistym has joined #archiveteam-bs [07:17] *** dashcloud has quit IRC (Read error: Operation timed out) [07:33] *** dashcloud has joined #archiveteam-bs [07:51] *** JesseW1 has quit IRC (Quit: Leaving.) [08:12] *** rejk has quit IRC (Remote host closed the connection) [08:17] *** mistym has quit IRC (Remote host closed the connection) [08:26] *** mistym has joined #archiveteam-bs [08:30] *** mistym has quit IRC (Remote host closed the connection) [08:31] do you guys archive the various linux distros, microsoft patches/releases/etc and other things like that? [08:33] *** mistym has joined #archiveteam-bs [08:42] *** mistym has quit IRC (Remote host closed the connection) [08:51] *** dashcloud has quit IRC (Read error: Operation timed out) [08:58] *** dashcloud has joined #archiveteam-bs [09:21] *** primus104 has joined #archiveteam-bs [09:41] *** BlueMaxim has quit IRC (Quit: Leaving) [09:42] *** mistym has joined #archiveteam-bs [09:44] *** BlueMaxim has joined #archiveteam-bs [09:46] *** dashcloud has quit IRC (Read error: Operation timed out) [09:49] *** mistym has quit IRC (Read error: Operation timed out) [09:50] *** dashcloud has joined #archiveteam-bs [10:08] *** dashcloud has quit IRC (Read error: Operation timed out) [10:14] *** dashcloud has joined #archiveteam-bs [10:26] Ctrl-S: not on a regular basis, I think [10:26] but occasionally, yes [10:26] you're welcome to change that, ofc ;) [10:28] it seems like the kind of thing thaw would be essential to keep archived somewhere [10:28] *** joepie91_ is now known as joepie91 [10:48] *** BlueMaxim has quit IRC (Quit: Leaving) [12:32] *** vitzli has joined #archiveteam-bs [12:33] *** schbirid has quit IRC (Read error: Operation timed out) [12:45] *** schbirid has joined #archiveteam-bs [12:56] *** Ravenloft has quit IRC (Remote host closed the connection) [13:25] *** godane has joined #archiveteam-bs [13:27] i'm back [13:27] i'm on my new system [13:28] i found a work around to boot into my ext4 partition even when UEFI doesn't see it [13:39] *** tomwsmf-a has joined #archiveteam-bs [13:54] *** godane1 has joined #archiveteam-bs [13:55] *** godane has quit IRC (Read error: Operation timed out) [13:56] *** xtr-201 has joined #archiveteam-bs [14:09] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [14:15] *** xtr-201 has quit IRC (Read error: Operation timed out) [14:16] *** dashcloud has quit IRC (Read error: Operation timed out) [14:31] *** xtr-201 has joined #archiveteam-bs [14:31] *** dashcloud has joined #archiveteam-bs [14:33] *** godane1 has quit IRC (Ping timeout: 306 seconds) [14:40] *** tomwsmf-a has joined #archiveteam-bs [14:42] *** godane has joined #archiveteam-bs [14:56] *** kyan has quit IRC (Quit: This computer has gone to sleep) [14:57] *** dashcloud has quit IRC (Read error: Operation timed out) [15:03] *** dashcloud has joined #archiveteam-bs [15:04] *** primus104 has quit IRC (Leaving.) [15:09] *** SadDM has quit IRC (Ping timeout: 483 seconds) [15:12] *** SadDM has joined #archiveteam-bs [15:20] *** tomwsmf-a has quit IRC (Ping timeout: 258 seconds) [15:30] *** SimpBrain has joined #archiveteam-bs [15:36] *** dashcloud has quit IRC (Read error: Connection reset by peer) [15:40] *** dashcloud has joined #archiveteam-bs [15:56] *** tomwsmf-a has joined #archiveteam-bs [16:03] *** SimpBrain has quit IRC (Quit: Leaving) [16:04] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [16:06] *** SimpBrain has joined #archiveteam-bs [16:24] *** JesseW has joined #archiveteam-bs [16:27] *** JesseW has left [16:29] *** Kirk has joined #archiveteam-bs [16:29] *** raylee has joined #archiveteam-bs [16:29] *** wm_ has joined #archiveteam-bs [16:36] *** raylee is now known as Rye [16:43] *** mistym has joined #archiveteam-bs [16:45] *** vitzli has quit IRC (Quit: Leaving) [17:18] *** godane has quit IRC (Quit: Leaving.) [17:20] *** Ravenloft has joined #archiveteam-bs [17:34] *** aaaaaaaaa has joined #archiveteam-bs [17:36] *** dashcloud has quit IRC (Ping timeout: 483 seconds) [17:42] *** dashcloud has joined #archiveteam-bs [17:45] *** primus104 has joined #archiveteam-bs [18:03] *** aaaaaaaaa has quit IRC (Leaving) [18:03] *** mistym has quit IRC (Ping timeout: 252 seconds) [18:06] *** godane has joined #archiveteam-bs [18:06] *** mistym has joined #archiveteam-bs [18:07] looks like we could grab speedtest.net [18:08] it goes as far back as march 2007: http://www.speedtest.net/my-result/106000000 [18:10] *** aaaaaaaaa has joined #archiveteam-bs [18:11] thats cool godane :D [18:15] i figure it could be useful data [18:16] we can then look at data by isp over time based on location [18:20] *** primus104 has quit IRC (Read error: Connection reset by peer) [18:26] *** primus104 has joined #archiveteam-bs [19:04] *** mistym has quit IRC (Ping timeout: 252 seconds) [19:07] *** mistym has joined #archiveteam-bs [19:16] *** Start has quit IRC (Quit: Disconnected.) [19:17] *** Start has joined #archiveteam-bs [19:17] *** JesseW has joined #archiveteam-bs [19:47] *** tomwsmf-a has joined #archiveteam-bs [19:51] *** tomwsmf-a has quit IRC (Ping timeout: 258 seconds) [19:55] *** JesseW has quit IRC (Quit: Leaving.) [20:07] *** JesseW has joined #archiveteam-bs [20:11] *** tomwsmf-a has joined #archiveteam-bs [20:13] *** JesseW has quit IRC (Client Quit) [20:16] *** JesseW has joined #archiveteam-bs [20:20] *** mistym_ has joined #archiveteam-bs [20:26] *** mistym has quit IRC (Read error: Operation timed out) [20:37] *** rejk has joined #archiveteam-bs [20:45] *** JesseW has quit IRC (Leaving.) [20:46] Well, looks like Microsoft has in app purchases in the default software: http://www.wired.co.uk/news/archive/2015-07/30/windows-10-paid-ad-removal-solitaire [20:47] no surprising since the xbox one dashboard is covered in ads. fkn bs [21:22] *** SimpBrain has quit IRC (Read error: Connection reset by peer) [21:22] *** SimpBrai1 has joined #archiveteam-bs [21:25] *** SimpBrai1 has quit IRC (Read error: Connection reset by peer) [21:27] *** schbirid has quit IRC (Leaving) [21:31] *** SimpBrain has joined #archiveteam-bs [21:36] *** RichardG has quit IRC (Read error: Connection reset by peer) [21:38] *** RichardG has joined #archiveteam-bs [21:43] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [21:52] *** mistym has joined #archiveteam-bs [21:57] *** mistym__ has joined #archiveteam-bs [21:57] *** mistym_ has quit IRC (Ping timeout: 483 seconds) [21:59] *** mistym has quit IRC (Read error: Operation timed out) [22:02] *** mistym has joined #archiveteam-bs [22:07] *** mistym__ has quit IRC (Ping timeout: 606 seconds) [22:10] *** primus104 has quit IRC (Leaving.) [22:14] *** tomwsmf-a has joined #archiveteam-bs [22:27] *** lexicon has joined #archiveteam-bs [22:27] *** lexicon is now known as lexiconda [22:27] *** lexiconda is now known as lexicon [22:40] *** mistym has quit IRC (Remote host closed the connection) [22:43] aaaaaaaaa: apparently that's not new- it's been there since windows 8, but since no one used that, no one noticed until now [23:17] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [23:21] *** SimpBrain has quit IRC (Leaving) [23:41] Right [23:41] http://fffff.at/ [23:41] *** mistym has joined #archiveteam-bs [23:48] *** tomwsmf-a has joined #archiveteam-bs [23:52] *** mistym has quit IRC (Ping timeout: 606 seconds)