[00:12] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [00:26] *** BlueMaxim has quit IRC (Read error: Operation timed out) [00:27] *** BlueMaxim has joined #archiveteam [00:43] *** BlueMaxim has quit IRC (Read error: Operation timed out) [00:45] *** BlueMaxim has joined #archiveteam [00:56] *** dashcloud has quit IRC (Read error: Operation timed out) [00:59] *** zerkalo has quit IRC (Read error: Connection reset by peer) [01:01] *** zerkalo has joined #archiveteam [01:04] *** dashcloud has joined #archiveteam [01:04] *** philpem_ has quit IRC (Read error: Operation timed out) [01:11] *** zerkalo has quit IRC (Read error: Connection reset by peer) [01:16] *** zerkalo has joined #archiveteam [01:18] joepie91: conference talks: https://hackerhotel.sigio.nl/ on it. [02:06] *** SN4T14 has quit IRC (Read error: Operation timed out) [02:10] *** bwn_ has quit IRC (Ping timeout: 250 seconds) [02:32] *** JesseW has quit IRC (Ping timeout: 370 seconds) [02:45] *** xXx_ndidd has joined #archiveteam [02:50] *** ndizzle has joined #archiveteam [02:57] *** ndiddy has quit IRC (Read error: Operation timed out) [02:59] *** ndiddy has joined #archiveteam [03:03] *** xXx_ndidd has quit IRC (Read error: Operation timed out) [03:03] *** JesseW has joined #archiveteam [03:05] *** xXx_ndidd has joined #archiveteam [03:07] *** ndizzle has quit IRC (Read error: Operation timed out) [03:11] *** ndiddy has quit IRC (Ping timeout: 492 seconds) [03:26] *** yakfish has quit IRC (Read error: Operation timed out) [03:26] *** Gfy has quit IRC (Read error: Operation timed out) [03:27] *** Coderjoe has quit IRC (Read error: Operation timed out) [03:27] *** brabo has quit IRC (Read error: Operation timed out) [03:27] *** SadDM has quit IRC (Read error: Operation timed out) [03:27] *** Atom__ has quit IRC (Read error: Operation timed out) [03:28] *** matthusb- has quit IRC (Ping timeout: 246 seconds) [03:28] *** Fusl has quit IRC (Ping timeout: 246 seconds) [03:29] *** vOYtEC_ has joined #archiveteam [03:30] *** Fusl has joined #archiveteam [03:31] *** Gfy has joined #archiveteam [03:31] *** Infreq_ has joined #archiveteam [03:31] *** lukeman_ has joined #archiveteam [03:32] *** jspiros has quit IRC (Read error: Operation timed out) [03:33] *** no2pencil has quit IRC (hub.efnet.us irc.Prison.NET) [03:33] *** Infreq has quit IRC (hub.efnet.us irc.Prison.NET) [03:33] *** lukeman has quit IRC (hub.efnet.us irc.Prison.NET) [03:33] *** bai has quit IRC (hub.efnet.us irc.Prison.NET) [03:33] *** SketchCo1 has quit IRC (hub.efnet.us irc.Prison.NET) [03:33] *** yipdw has quit IRC (hub.efnet.us irc.Prison.NET) [03:33] *** vOYtEC has quit IRC (hub.efnet.us irc.Prison.NET) [03:33] *** wyatt8740 has quit IRC (hub.efnet.us irc.Prison.NET) [03:33] *** achip has quit IRC (hub.efnet.us irc.Prison.NET) [03:33] *** db48x has quit IRC (hub.efnet.us irc.Prison.NET) [03:36] !a http://dev.iqeye.com/ [03:37] Oooooops [03:37] *** bai_ has joined #archiveteam [03:37] *** no2penci1 has joined #archiveteam [03:39] *** wyatt8740 has joined #archiveteam [03:40] *** xXx_ndidd has quit IRC (Read error: Operation timed out) [03:40] *** yipdw_ has joined #archiveteam [03:45] *** Coderjoe has joined #archiveteam [03:51] *** achip has joined #archiveteam [04:02] *** ariscop has quit IRC (Leaving) [04:03] *** bwn_ has joined #archiveteam [04:05] *** ariscop has joined #archiveteam [04:11] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [04:15] *** wp494_ has quit IRC (Read error: Connection reset by peer) [04:17] *** Sk1d has joined #archiveteam [04:23] *** ariscop has quit IRC (Quit: Leaving) [04:24] *** ariscop has joined #archiveteam [05:04] *** arkhive has joined #archiveteam [05:06] *** wp494 has joined #archiveteam [05:35] *** SirCmpwn has quit IRC (Read error: Operation timed out) [05:47] *** SirCmpwn has joined #archiveteam [05:57] *** bai_ is now known as bai [06:01] *** ariscop_ has joined #archiveteam [06:06] *** ariscop has quit IRC (Read error: Operation timed out) [06:08] *** Fletcher has quit IRC (Ping timeout: 244 seconds) [06:26] *** schbirid has joined #archiveteam [06:28] *** Fletcher has joined #archiveteam [06:31] http://www.gtfs-data-exchange.com/ [06:33] http://185.100.87.84/ :( [06:35] *** ariscop_ has quit IRC (Quit: Leaving) [06:39] *** bwn_ has quit IRC (Read error: Operation timed out) [07:03] *** WinterFox has joined #archiveteam [07:07] *** bwn_ has joined #archiveteam [07:24] *** JesseW has quit IRC (Ping timeout: 370 seconds) [07:24] *** achip has quit IRC (Read error: Operation timed out) [07:25] *** schbirid has quit IRC (hub.efnet.us irc.Prison.NET) [07:25] *** wyatt8740 has quit IRC (hub.efnet.us irc.Prison.NET) [07:30] *** schbirid2 has joined #archiveteam [07:31] *** Honno has joined #archiveteam [07:31] *** Honno has quit IRC (Client Quit) [07:33] *** Honno has joined #archiveteam [07:33] *** achip has joined #archiveteam [07:37] *** atomotic has joined #archiveteam [07:41] *** wyatt8740 has joined #archiveteam [07:45] *** db48x has joined #archiveteam [08:00] *** metalcamp has joined #archiveteam [08:23] *** ariscop has joined #archiveteam [08:35] *** vtyl has joined #archiveteam [08:39] *** lytv has quit IRC (Read error: Operation timed out) [08:57] *** vtyl has quit IRC (Ping timeout: 260 seconds) [08:59] *** Atom__ has joined #archiveteam [09:00] *** lytv has joined #archiveteam [09:09] schbirid2: fuuuuck [09:09] schbirid2: hey do you remember when atomicgamer was going down and you were involved in archiving stuff? [09:09] schbirid2: back then we got archivebot to crawl teamtnt.com which was at-risk; it has now gone [09:10] schbirid2: I never followed whether the WARC got injected into archive.org or whatever [10:01] *** ariscop has quit IRC (Leaving) [10:04] you can check that on the dashboard Jon [10:04] midas: oK thanks, I'll take a look [10:30] *** ariscop has joined #archiveteam [10:51] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [11:03] *** BlueMaxim has quit IRC (Leaving) [12:21] *** lytv has quit IRC (Read error: Operation timed out) [12:27] *** bentpins has quit IRC (Read error: Operation timed out) [12:35] *** metalcamp has quit IRC (Ping timeout: 244 seconds) [12:42] *** atomotic has joined #archiveteam [12:49] *** hictooth has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client) [12:55] *** WinterFox has quit IRC (Remote host closed the connection) [13:04] *** VADemon has joined #archiveteam [13:40] *** SketchCow has joined #archiveteam [13:43] *** Honno has quit IRC (Read error: Connection reset by peer) [13:45] I'm back baby [13:47] welcome back [13:47] welcome back! [13:48] Welcome Back! [13:49] Welcome back. [13:49] *** Honno has joined #archiveteam [13:49] SketchCow: btw i have less then 2 years of KPFA to grab now [13:52] Welcome back. [13:59] *** vitzli has joined #archiveteam [14:09] *** Start has quit IRC (Quit: Disconnected.) [14:48] *** Start has joined #archiveteam [14:57] *** Asparagir has quit IRC (Quit: Asparagir) [14:59] *** no2penci1 is now known as no2pencil [15:17] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [15:23] *** ploopkazo has quit IRC (Quit: ZNC - 1.6.0 - http://znc.in) [15:25] *** ploopkazo has joined #archiveteam [15:32] *** ploopkazo has quit IRC (Quit: ZNC - 1.6.0 - http://znc.in) [15:32] *** ploop has joined #archiveteam [15:41] *** jut has joined #archiveteam [15:43] *** Start has quit IRC (Read error: Connection reset by peer) [15:43] *** Start has joined #archiveteam [15:50] *** atomotic has joined #archiveteam [15:54] *** lytv has joined #archiveteam [16:06] *** Start has quit IRC (Quit: Disconnected.) [16:10] *** JesseW has joined #archiveteam [16:19] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [16:27] Excellent [16:28] *** metalcamp has joined #archiveteam [16:31] *** JesseW has quit IRC (Ping timeout: 370 seconds) [16:32] So, Mark (Wayback) is interested in getting a better census of archiveteam uploads. [16:32] so that's on my mind this week [16:32] A census? [16:36] How many the uploads are, and how many children to each upload family? [16:43] Anyone dumped this torrent into IA yet? http://185.100.87.84/ [16:47] Cant I just feed the torrent file in as an upload, then the IA does the rest? [16:47] Yes. [16:48] yes, it doesn't seed further, but it downloads the torrent [16:54] *** vitzli has quit IRC (Quit: Leaving) [17:06] maybe doxxing millions is not a good thing to do? [17:06] *** xmc sets mode: +oo swebb SketchCow [17:06] *** swebb sets mode: +o DFJustin [17:06] *** swebb sets mode: +o SketchCow [17:06] *** swebb sets mode: +o balrog [17:13] Let me get the file. [17:13] DFJustin: True. I’d rather have that PanamaPapers leak. [17:14] I'll get the Turkey thing and dark it, and talk to our people. [17:14] OK? [17:14] PurpleSym HCross ok? [17:14] Yes, please dark it. [17:15] So, DO we have access to the Panama Papers data? I said I'd try to find it. [17:16] It's not fully online yet [17:16] I don’t think so. Unless someone leaks the leak. [17:16] Some 150 documents I think [17:16] I don't think the whole thing will be online anytime soon [17:17] wikileaks themself tweeted they was unsure to publish all for everyone [17:17] or just let press have access [17:18] Yeah, this is probably going be handled comparably to the Snowden archive. Handled by journalists exclusively, with the entire document repo being released maybe 10-30 years after the fact. [17:19] *** philpem_ has joined #archiveteam [17:19] this opens for the possibility of a leak leak [17:19] wikileakleaks.com [17:19] :P [17:19] lol [17:20] SketchCow, ok [17:20] sorry -BS alert on me [17:22] IA wants a copy so if ANY copy leaks, grab it. [17:22] And if there are subsets, let me know [17:25] Turkish database acquired. The torrent.... was quick. [17:31] *** dxrt- has quit IRC (Read error: Operation timed out) [17:34] It's been uploaded to IA, and then darked. [17:34] OK, quick help, please. [17:35] *** bsmith094 has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client) [17:35] Help me find any archiveteam uploads that are NOT in archiveteam or in an archiveteam collection. [17:35] I'm going to quickly make sure all archiveteam* collections are under archiveteam [17:36] SketchCow: grab requested by schbirid2 https://archive.org/details/demos.igmdb.org [17:36] I really don't like we have stuff falling out of band like this. [17:37] We should really cook up archivebot to be able to do a hard-cannon approach when needed. [17:37] Some "mid-range" [17:37] I mean, I see this one was a 40gb-or-so download, so a pure archivebot grab would have been annoying. [17:37] But doesn't archivebot have the capability for a straight shot like this? [17:45] https://archive.org/search.php?query=archiveteam+-collection%3Aarchiveteam [17:46] Wow, what a fuckin' mess, thanks DFJustin [17:48] some of that seems to be subcollections that are in fact in archiveteam but it hasn't propagated that to the individual items for whatever reason [17:50] So, to be clear [17:50] IA now has a guy (Mark Graham) who is the product manager for Wayback. Part of that is him staying on top of data going into the Wayback. [17:50] he does it with spreadsheets. So he has scripts that run queries, and then he knows what's been "added" this week. [17:51] When we jam things into this sidehatch for any reason, then all the numbers are broken. [17:51] I get we're routing around shitty network bandwidth, but people shoving things into the open collections and then wondering why stuff is missing, that's why. [17:52] FalconK: ping, ^ [17:53] I know in the past I've seen copycat archivers uploading stuff into community collections with archive team keywords without having been in here to receive marching orders [17:53] fuck you, all in archive team, and all that [17:54] Well, today I am cleaning it all up, while I still can't feel my legs [17:54] This was a lot of manuals [17:56] So, good news, there's no collections for archiveteam* that are not in archiveteam. [17:56] at the phone museum this weekend we took custody of a large personal collection, so i know the legs feeling... [17:56] Nice [17:56] I got to get out there [17:56] I guess you could call it....manual labour [17:56] Do a round of interviews [17:57] Record things [17:57] started inventorying our "attic" http://spmh.us/c:attic [17:57] A colon in a url? What are you [17:57] WHAT ARE YOU [17:57] i am a monster [17:57] why [17:58] You're a monster that monsters tell other monsters to scare them [17:58] :D [17:59] *** SimpBrain has joined #archiveteam [18:03] So, I'm going to try and force no-destroy derives on the archiveteam things so that the archiveteam collection listing is right. [18:04] I know some of those words. [18:04] You know how much more beneficial you are to the world when you ask questions instead of declaring your ignorance? [18:05] I can only imagine. :-/ [18:05] But in seriousness: I think I can deduce. Something, something, merge. [18:05] https://archive.org/search.php?query=archiveteam%20-collection%3Aarchiveteam%2A is better. If there's subcollections they get through. [18:06] So only (?) 3,400 misplaced items. [18:06] zino: derives can optionally delete derived data first, so a no-delete derive (the default iirc) doesn't do that [18:07] delete derives are dangerous because when you get the archive to download from bittorrent then the downloaded file counts as a derived file, because it comes from the derivationp rocess [18:07] xmc: OK. Thanks. [18:07] you're welcome, any time [18:08] \m/ breaking the cycle of snark and badfeelings \m/ [18:11] \o/ (will stop now. I feel the OT ban hammer hovering over my head.) [18:13] *** scyther has joined #archiveteam [18:13] *** scyther has quit IRC (Connection closed) [18:17] But... the cycle [18:17] * SketchCow cries over the cycle [18:17] How are we supposed to get supervillans now [18:17] arkiver: You're the source of a lot of these "not in archiveteam" items. [18:17] I know why you did it. Let's do better going forward. [18:17] You have access, etc. [18:21] *** Start has joined #archiveteam [18:23] Can you give me an example item? [18:24] SketchCow ^ [18:24] I'm... right here [18:24] https://archive.org/details/archiveteam_ftp_items_2015120601 [18:24] They aren't web items, they contain the lists with FTP files for the FTP grab [18:25] A list is fetched from there and the URLs in the list are grabbed by the FTP project [18:25] So they join the project's collection. [18:25] Yes [18:25] Yes! [18:25] So they're a manifest. They should be in archiveteam_ftp. I'm putting them there. [18:25] I uploaded them before I had access to the collections, that's why they're not in the collection archiveteam_ftp [18:26] Literally what I wrote 10 lines up [18:26] 14:17 <@SketchCow> I know why you did it. Let's do better going forward. Infreq_ [18:26] 14:17 <@SketchCow> You have access, etc. [18:26] arkiver: ^ [18:26] Will be in archiveteam_ftp next time [18:27] See? We are violently agreeing. [18:30] I also remember yipdw uploaded some items outside of the archiveteam collection [18:30] I'm going to find them all! [18:30] Yep [18:30] find all the mutants [18:30] https://archive.org/search.php?query=Archive%20Team%20Docstoc%20Dry%20Dock [18:30] Don't give me suggestions yet [18:30] A lot are still in opensource [18:30] I'm still trying to murder https://archive.org/search.php?query=archiveteam%20-collection%3Aarchiveteam%2A%20-collection%3Agithub%2A [18:31] Everything's going under the lamp [18:31] Ok! Let me know if you want suggestions [18:31] Hey cfhoo, sorry, is it possible to archive the profile pages of the last few games of the GameMaker Sandbox project? [18:31] I have no clue in making archives myself, we don't have much time, 4 days jee [18:31] Just 49 pages tho [18:32] *chfoo [18:32] o it's like, ch, foo, ahhhh [18:35] https://archive.org/search.php?query=archiveteam%20-collection%3Aarchiveteam%2A%20-collection%3Agithub%2A%20-identifier%3Afav%2A%20-collection%3Alinux_dist%2A [18:35] Ot [18:36] It's getting longer, but that's because I'm discovering alternate collections with Archive Team in the name [18:36] Honno: i already did, it's at https://archive.org/details/archiveteam_archivebot_go_20160325020001 [18:36] chfoo, ooo, thanks! [18:37] Wowzers what do I need to look for here chfoo sorry? [18:38] chfoo: i messed up on the filename, but the id is 3kxjz [18:38] k, ty [18:39] Good show on docstoc, 1,000+ files in the wrong place. [18:39] (Good show on pointing me to them, arkiver) [18:39] again, I know why they're in the wrong place. [18:39] *** jut has quit IRC (Read error: Connection reset by peer) [18:40] https://bitlove.org/ [18:44] *** RichardG has quit IRC (Quit: Keyboard not found, press F1 to continue) [18:48] *** RichardG has joined #archiveteam [18:53] So, I'm thinking about a census of "just in archiveteam" and seeing about subcollections [18:54] *** RichardG has quit IRC (Quit: Keyboard not found, press F1 to continue) [18:55] *** Burak has joined #archiveteam [18:57] What does "os" stand for in relation to warc index files? ie warc.os.cdx.gz as opposed to warc.cdx.gz [18:57] *** RichardG has joined #archiveteam [19:08] *** RichardG has quit IRC (Read error: Connection reset by peer) [19:10] *** RichardG has joined #archiveteam [19:39] *** atomotic has joined #archiveteam [19:42] 3000+ views on the aljazeera archive since we uploaded it to IA a few days ago. [19:44] *** Start has quit IRC (Quit: Disconnected.) [19:50] *** Start has joined #archiveteam [19:54] *** Honno has quit IRC (Ping timeout: 244 seconds) [19:59] *** Smiley has joined #archiveteam [20:00] *** Tomcat_ has joined #archiveteam [20:04] *** Honno has joined #archiveteam [20:04] *** Start has quit IRC (Quit: Disconnected.) [20:06] *** bwn_ has quit IRC (Ping timeout: 244 seconds) [20:08] *** Start has joined #archiveteam [20:13] *** schbirid2 has quit IRC (Quit: Leaving) [20:20] *** VADemon has quit IRC (Quit: left4dead) [20:22] *** Tomcat_ has quit IRC (Remote host closed the connection) [20:39] *** bwn has joined #archiveteam [20:50] *** metalcamp has quit IRC (Ping timeout: 244 seconds) [20:51] *** ariscop has quit IRC (Ping timeout: 633 seconds) [20:54] *** atomotic has quit IRC (Quit: My Mac has gone to sleep. ZZZzzz…) [21:10] *** ndiddy has joined #archiveteam [21:11] *** Start has quit IRC (Quit: Disconnected.) [21:14] *** Froggypwn has quit IRC (Read error: Connection reset by peer) [21:44] *** Honno has quit IRC (Ping timeout: 633 seconds) [22:00] *** atomotic has joined #archiveteam [22:05] *** Start has joined #archiveteam [22:08] *** Froggypwn has joined #archiveteam [22:21] *** atomotic has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [22:27] *** ariscop has joined #archiveteam [22:51] *** BlueMaxim has joined #archiveteam [23:00] *** bsmith093 has quit IRC (Ping timeout: 370 seconds) [23:18] *** bsmith093 has joined #archiveteam [23:24] more of yuku coming tomorrow! [23:33] I'm done with my nap [23:33] Back to work [23:36] Yes [23:36] I'm just about to start with my nap [23:36] Have a good day [23:56] *** bsmith093 has quit IRC (Quit: Leaving.)