[00:11] *** primus104 has quit IRC (Leaving.) [00:12] *** mistym_ has joined #archiveteam-bs [00:13] *** mistym_ has quit IRC (Client Quit) [00:35] *** erbylnt has joined #archiveteam-bs [00:39] *** mistym_ has joined #archiveteam-bs [00:44] OH MY GOD I am an idiot [00:44] I was returning a reference to a temporary object in C++ [00:44] and was wondering "why the hell am I getting these huge floats" [00:45] this has occupied about two days' worth of bughunting [00:45] normally gcc/clang warn about this sort of thing. wonder why I didn't see the warning in this case [01:18] *** yipdw_ is now known as yipdw [01:21] *** pir^2 has joined #archiveteam-bs [01:21] I suspect it'd be metadata, given that even in 2011 a crawl of YouTube would have been super-gigantic. [01:22] Well it's obviously not all or even more than like 1% of YT [01:23] the arc.gz here is tiny so it's not videos https://archive.org/download/IA-YOUTUBE-000-20070113002952-00000-crawling02.us.archive.org [01:23] It'll be interesting to see what happens when YouTube goes away. I highly doubt anyone has a copy, and even the most dedicated of scraping efforts would generally be quickly outstripped by uploads. [01:23] some of it is likely video [01:23] https://archive.org/download/YTV-20111101093915-crawl446 [01:23] Obviously Google probably has backups, but eventually YouTube or Google is likely to cease existing. [01:24] when YouTube goes away we're going to lose a lot of shit [01:24] Could be that crawl446 was just lots of metadata/comments, though? [01:25] could be [01:25] kenji@archive.org may be willing to answer questions about the dump organization [01:25] Maybe compare size of CDX vs ARC? [01:26] CDX is the index [01:26] I know. [01:26] comparing sizes of the two doesn't tell you much [01:27] I would expect videos to have lower CDX but higher ARC, but I have no idea how many URLs each video is [01:27] right [01:27] comparing sizes of the two doesn't tell you much :) [01:30] Why aren't people at least saving videos with >10K views? Or are they? [01:30] or another arbitrary minimum [01:31] Oh boy, debate [01:31] Tell me more [01:33] You work at IA. Do you have any idea whether the YouTube crawls a metadata or content? [01:33] ^ SketchCow, also s/a metadata/are metadata/ [01:35] and please forgive me for my n00b-ish questions [01:39] *** wyatt8750 has joined #archiveteam-bs [01:41] wyatt8750 you mean like this? https://web.archive.org/web/20080102194449/http://youtube.com/ [01:43] 1. We grab Youtube [01:43] 2. We grab youtube as best we can. [01:43] 3. Asking anybody but me in this channel is guesswork [01:43] (About youtube) [01:43] yeah, pir^2, those were the days [01:44] SketchCow: thanks [01:44] though pre-october 2006 was truly before the empire [01:44] Oh, you meant empire=Google [01:47] SketchCow: someone told me I should ask you to put https://archive.org/details/debatesoireachtasie-XML in the ArchiveTeam collection [01:48] if possible https://archive.org/details/pdp10nocrew too [01:49] Both are in. [01:49] Thank you! [01:54] *** erbylnt has quit IRC (Read error: Connection reset by peer) [02:12] *** pir^2 has quit IRC (-) [02:47] *** mistym_ has quit IRC (Remote host closed the connection) [03:12] *** mistym_ has joined #archiveteam-bs [03:37] *** chfoo has quit IRC (Remote host closed the connection) [03:40] *** chfoo has joined #archiveteam-bs [04:03] *** Ravenloft has joined #archiveteam-bs [04:10] *** vitzli has joined #archiveteam-bs [04:14] *** vitzli has quit IRC (Client Quit) [04:19] *** aaaaaaaaa has quit IRC (Leaving) [05:17] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [05:18] *** Control-S has joined #archiveteam-bs [05:19] *** BlueMaxim has joined #archiveteam-bs [05:24] *** Ctrl-S has quit IRC (Read error: Operation timed out) [05:24] *** Control-S is now known as Ctrl-S [05:39] *** SN4T14_ has quit IRC (Ping timeout: 306 seconds) [05:45] *** SN4T14 has joined #archiveteam-bs [06:51] *** mistym_ has quit IRC (Remote host closed the connection) [07:13] *** primus104 has joined #archiveteam-bs [07:50] *** primus104 has quit IRC (Leaving.) [07:53] *** schbirid has joined #archiveteam-bs [09:00] i'm converting the Flight Magazine pdf pages into png files [09:00] that way i can make cbz files of it [09:55] *** primus104 has joined #archiveteam-bs [10:02] *** wm_ has quit IRC (Ping timeout: 240 seconds) [10:05] *** wm_ has joined #archiveteam-bs [10:41] *** BlueMaxim has quit IRC (Quit: Leaving) [11:06] *** anomie has quit IRC (Read error: Operation timed out) [11:20] *** anomie has joined #archiveteam-bs [12:50] *** sankin has joined #archiveteam-bs [13:01] *** erbylnt has joined #archiveteam-bs [13:26] *** Ravenloft has quit IRC (Ping timeout: 265 seconds) [13:41] *** primus104 has quit IRC (Leaving.) [14:54] *** mistym_ has joined #archiveteam-bs [14:54] *** mistym_ has quit IRC (Remote host closed the connection) [15:07] *** mistym_ has joined #archiveteam-bs [15:14] Nravo [15:14] Bravo [15:42] -- [15:42] The hacker, using the alias NinjaDoge24, analyzed the NQ Vault app, which supposedly encrypts files on smartphones and other gadgets. Ninja claims the software uses only XOR (exclusive or) and a single-byte key to scramble the first 128 bytes of a .PNG test subject. [15:42] [...] [15:42] The company behind the app stands by its product, calling its security "appropriate," and claiming that messages, chats, call logs and contact information is encrypted using AES with a 128-bit key – but that list doesn't include pics and vids. [15:42] "Image and video files are stored in a format not readily readable by other applications and can only be viewed in Vault after entering the correct password on the device," the company said in a statement. [15:42] "These standards are appropriate for the consumer use cases this application is meant for." [15:42] -- [15:42] * joepie91_ slow clap [15:43] translation: it's good enough to keep your partner from finding your porn stash [15:45] *** primus104 has joined #archiveteam-bs [15:46] *** erbylnt has quit IRC (Ping timeout: 370 seconds) [15:48] *** primus105 has joined #archiveteam-bs [15:53] *** primus104 has quit IRC (Read error: Operation timed out) [16:02] lol [16:03] *** mistym_ has quit IRC (Remote host closed the connection) [16:23] *** Start-mob has joined #archiveteam-bs [16:35] *** erbylnt has joined #archiveteam-bs [16:40] *** Start has quit IRC (Disconnected.) [16:55] *** Start-mob has quit IRC (Remote host closed the connection) [16:55] *** aaaaaaaaa has joined #archiveteam-bs [16:55] *** Start-mob has joined #archiveteam-bs [17:02] *** Start-mob has quit IRC (Ping timeout: 370 seconds) [18:04] turns out non-enterprise users deserve nothing more than trash [18:37] but i hate star trek [18:44] *** Start has joined #archiveteam-bs [18:45] *** Start has quit IRC (Read error: Connection reset by peer) [18:45] *** Start has joined #archiveteam-bs [18:50] wow can't you get better than that in the basic libs? [18:51] like from encrypt import aes256 [18:53] that reminds me [18:53] https://news.ycombinator.com/item?id=9333582 [18:55] *** Start has quit IRC (Disconnected.) [18:55] WUT [18:55] static keys [18:56] this is the kind of situation where having someone backdoor the app would actually make it MORE secure [18:56] what is this even [18:57] it's military grade encryption, don't hate [18:59] what millenium tho [18:59] ancient romans had better shit [19:01] "Oh hey augustus, make sure to keep this new caeser cipher codepage away from the actual message when you aren't using it" [19:03] *** mistym has quit IRC (Quit: Leaving) [19:07] *** Start has joined #archiveteam-bs [19:20] *** mistym has joined #archiveteam-bs [19:25] *** Start has quit IRC (Disconnected.) [19:29] *** monod has joined #archiveteam-bs [19:46] *** Start-mob has joined #archiveteam-bs [19:49] *** SN4T14_ has joined #archiveteam-bs [19:51] *** mistym has quit IRC (Remote host closed the connection) [19:53] *** SN4T14 has quit IRC (Ping timeout: 306 seconds) [19:59] *** Start-mob has quit IRC (Remote host closed the connection) [20:09] *** Start-mob has joined #archiveteam-bs [20:12] *** mistym has joined #archiveteam-bs [20:18] *** dashcloud has quit IRC (Ping timeout: 260 seconds) [20:18] *** Start-mob has quit IRC (Leaving) [20:18] *** Start-mob has joined #archiveteam-bs [20:20] *** dashcloud has joined #archiveteam-bs [20:32] *** Start has joined #archiveteam-bs [20:33] *** Start-mob has quit IRC (Leaving) [20:34] *** Start-mob has joined #archiveteam-bs [20:36] *** Start-mob has quit IRC (Remote host closed the connection) [20:54] *** sankin has quit IRC (Leaving.) [21:06] *** acridAxid has quit IRC (Quit: Quitting) [21:09] *** acridAxid has joined #archiveteam-bs [21:09] *** BlueMaxim has joined #archiveteam-bs [21:12] *** dashcloud has quit IRC (Ping timeout: 260 seconds) [21:16] *** dashcloud has joined #archiveteam-bs [21:21] *** Start has quit IRC (Read error: Connection reset by peer) [21:21] *** Start has joined #archiveteam-bs [21:23] *** Start has quit IRC (Read error: Connection reset by peer) [21:23] *** Start has joined #archiveteam-bs [21:26] *** Start-mob has joined #archiveteam-bs [21:42] *** Start-mob has quit IRC (Ping timeout: 370 seconds) [21:46] *** Start_ has joined #archiveteam-bs [21:46] *** Start has quit IRC (Read error: Connection reset by peer) [21:51] *** Start_ is now known as Start [21:52] there could be 256 possible keys, that's just as secure as aes256 right [21:52] I mean they're both 256 [22:08] *** monod has quit IRC (Ping timeout: 512 seconds) [22:19] this might be of some interest to folks here: https://www.kickstarter.com/projects/opengoldberg/kimiko-ishizaka-plays-chopin-on-an-1832-pleyel more modern recordings of classical music released under very liberal licenses, and many new photos of the piano used under the same license [22:21] *** kvieta has quit IRC (Read error: Operation timed out) [22:23] *** Start has quit IRC (Disconnected.) [22:27] *** Start-mob has joined #archiveteam-bs [22:34] *** Start-mob has quit IRC (Remote host closed the connection) [22:39] *** kvieta has joined #archiveteam-bs [22:39] *** kvieta has quit IRC (Excess Flood) [22:55] *** Start has joined #archiveteam-bs [23:00] *** kvieta has joined #archiveteam-bs [23:06] *** kvieta has quit IRC (Excess Flood) [23:08] *** kvieta has joined #archiveteam-bs [23:08] *** kvieta has quit IRC (Excess Flood) [23:09] *** Start-mob has joined #archiveteam-bs [23:09] '[Babel Business Edition] is intended for virtually everybody and anybody who needs to keep their communication safe and "of the radar" from outside threats of eavesdropping and industrial espionage.' [23:09] maybe "of the radar" is not a typo :P [23:12] *** wp494_ has joined #archiveteam-bs [23:15] *** wp494 has quit IRC (Ping timeout: 740 seconds) [23:30] *** kvieta has joined #archiveteam-bs [23:42] *** kvieta has quit IRC (Read error: Operation timed out) [23:57] *** kvieta has joined #archiveteam-bs [23:58] *** primus has quit IRC (Read error: Operation timed out)