[01:05] *** lennier1 has quit IRC (Quit: Going offline, see ya! (www.adiirc.com)) [01:11] *** lennier1 has joined #archiveteam-bs [01:13] *** Soni has joined #archiveteam-bs [01:13] *** Soni has quit IRC (Remote host closed the connection) [01:59] *** Mayonaise has joined #archiveteam-bs [01:59] *** Mayonaise has quit IRC (Client Quit) [02:27] *** Mayonaise has joined #archiveteam-bs [03:35] *** qw3rty_ has quit IRC (Read error: Operation timed out) [03:48] *** Wingy has joined #archiveteam-bs [05:28] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [06:39] *** qw3rty has joined #archiveteam-bs [07:22] *** jshoard has joined #archiveteam-bs [07:50] *** sirvy has quit IRC (Ping timeout: 615 seconds) [08:26] *** sirvy has joined #archiveteam-bs [09:54] SketchCow: where do i learn more about this stuff so im not reinventing everything [10:10] *** BlueMax has quit IRC (Quit: Leaving) [10:53] *** VerifiedJ has joined #archiveteam-bs [13:34] *** ats has quit IRC (Remote host closed the connection) [13:34] *** ats has joined #archiveteam-bs [13:44] What stuff [13:44] I guess I'm trying to understand the intent [15:00] *** Arcorann_ has quit IRC (Read error: Connection reset by peer) [15:23] https://hub.docker.com/search?q=bioinformatics&type=image&page=12 [15:23] (From #archiveteam) [15:23] A quick search reveals that there are only 298 images we'd need to backup (for bioinformatics) [15:23] I think this is doable [15:25] Correct [15:28] The real question is how do we make sure the docker images are accessible to people trying to reproduce the papers [15:29] Maybe contacting Docker or an annoucement to the bioinformatics community? [15:29] My point is it's not that useful if we archive the docker images and nobody else knows that they're on IA [15:29] Seeing how scant most of their metadata is, I'll guess there's more than the 298 that come up in that search [15:30] But yeah, I don't expect there to be a million [15:31] As I've said before, I'm not too familiar with Docker [15:31] Yeah, just it was just a quick feasibility check (I'm not claiming the # of images we're archiving is gonna be exactly 298) [15:31] Also other fields may have the same problem [15:32] (the audience on HN aren't super representative of all the sciences) [15:32] And I only have a vauge idea of what "bioinformatics" is [15:33] If there's a focus on sciences, maybe there's some way to look through links in citations or something like that? [15:34] I am WAYYYYY less worried about "people don't know it's on IA" than "we properly get them on IA". [15:34] How big is a typical image. [15:40] Pretty big actually (usually in the single-digit GB range) [17:17] Soo Docker Hub... This won't be fun or easy. [17:17] What we should *really* do is archive all images with all past versions. [17:18] For the example of bioinformatics, a change in the base system can affect the outcome, so it's important to preserve the base image that was used at the time. [17:18] The problem is: as far as I know, there is no way to find past versions. [17:20] The website and I believe also the API only returns what each tag for an image currently points to. But you need to find the image IDs used in the past. [17:20] For 'official' images, there is a repository on GitHub that has the necessary metadata, but that's not the case for any unofficial images. [17:20] I haven't looked into it deeper, but this won't be easy to tackle. [17:22] As for storage, in the ideal world, we'd have on IA item for each layer I guess and one for each image (which would basically just be metadata listing the layers). That's probably not feasible though. [18:25] *** DLoader_ has joined #archiveteam-bs [18:26] Inevitable consequence of centralization, re: docker problems now. [18:36] *** DLoader has quit IRC (Ping timeout: 745 seconds) [18:36] *** DLoader_ is now known as DLoader [18:39] *** Stilett0 has joined #archiveteam-bs [18:46] *** Stiletto has quit IRC (Read error: Operation timed out) [19:08] JAA: Good point [19:55] *** Smiley has quit IRC (Remote host closed the connection) [19:55] *** Dallas has quit IRC (Read error: Connection reset by peer) [19:55] *** Smiley has joined #archiveteam-bs [19:58] *** ats_ has joined #archiveteam-bs [19:58] *** Hoolootwo has joined #archiveteam-bs [20:03] *** SJon____ has joined #archiveteam-bs [20:04] *** SJon___ has quit IRC (Ping timeout: 265 seconds) [20:04] *** ats has quit IRC (Ping timeout: 265 seconds) [20:04] *** lunik1 has quit IRC (Ping timeout: 265 seconds) [20:04] *** SJon____ is now known as SJon___ [20:04] *** Hooloovoo has quit IRC (Ping timeout: 265 seconds) [20:04] *** Tugboat has quit IRC (Ping timeout: 265 seconds) [20:04] *** Jens has quit IRC (Ping timeout: 265 seconds) [20:04] *** SJon___ has quit IRC (Killed (se.hub (Nick collision (new)))) [20:04] *** pew has quit IRC (Ping timeout: 265 seconds) [20:04] *** OrIdow6 has quit IRC (Ping timeout: 265 seconds) [20:04] *** arkiver has quit IRC (Ping timeout: 265 seconds) [20:04] *** robbi5 has quit IRC (Ping timeout: 265 seconds) [20:04] *** purplebot has quit IRC (Ping timeout: 265 seconds) [20:13] *** i0npulse has quit IRC (Ping timeout: 265 seconds) [20:18] *** OrIdow6 has joined #archiveteam-bs [20:18] *** Tugboat has joined #archiveteam-bs [20:19] *** i0npulse has joined #archiveteam-bs [20:20] *** Jens has joined #archiveteam-bs [20:25] *** Ajay1 has joined #archiveteam-bs [21:02] *** bleb has joined #archiveteam-bs [21:05] *** Ajay1 has quit IRC (se.hub irc.underworld.no) [21:05] *** i0npulse has quit IRC (se.hub irc.underworld.no) [21:05] *** OrIdow6 has quit IRC (se.hub irc.underworld.no) [21:05] *** Tugboat has quit IRC (se.hub irc.underworld.no) [21:05] *** Jens has quit IRC (se.hub irc.underworld.no) [21:05] *** cm has quit IRC (se.hub irc.underworld.no) [21:11] *** fredgido_ has joined #archiveteam-bs [21:15] *** fredgido has quit IRC (Read error: Operation timed out) [21:18] Oh yeah, haven't seen it mentioned yet, but: 'As the world’s largest repository of container images, Docker Hub stores more than 15PB of data. After detailed analysis of the container images stored on Docker Hub, we found that 4.5PB of the data have not been pushed or pulled within 6 months or longer.' (from https://www.docker.com/pricing/retentionfaq) [21:19] So 4.5 PB of data at immediate risk, 15+ PB of data in total [21:20] *** bleb is now known as cm [21:41] *** OrIdow6 has joined #archiveteam-bs [21:41] *** Tugboat has joined #archiveteam-bs [21:42] *** i0npulse has joined #archiveteam-bs [21:44] *** arkiver has joined #archiveteam-bs [21:44] *** Ajay19 has joined #archiveteam-bs [21:45] *** svchfoo3 sets mode: +o arkiver [21:48] *** Ajay19 is now known as Ajay1 [21:48] *** Ajay18 has joined #archiveteam-bs [21:50] *** Ajay18 has quit IRC (Client Quit) [21:51] *** Ajay19 has joined #archiveteam-bs [21:53] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [21:53] *** Ajay19 is now known as Ajay1 [21:53] *** Ajay14 has joined #archiveteam-bs [21:55] *** Ajay14 has quit IRC (Client Quit) [21:56] *** Ajay19 has joined #archiveteam-bs [21:57] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [21:57] *** Ajay19 is now known as Ajay1 [21:58] *** Ajay19 has joined #archiveteam-bs [22:00] *** Ajay19 has quit IRC (Client Quit) [22:01] *** Ajay16 has joined #archiveteam-bs [22:02] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [22:02] *** Ajay16 is now known as Ajay1 [22:03] *** Ajay10 has joined #archiveteam-bs [22:03] *** Jens has joined #archiveteam-bs [22:05] *** Ajay10 has quit IRC (Client Quit) [22:05] *** Ajay19 has joined #archiveteam-bs [22:07] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [22:07] *** Ajay19 is now known as Ajay1 [22:08] *** Ajay19 has joined #archiveteam-bs [22:10] *** Ajay19 has quit IRC (Client Quit) [22:10] *** Ajay17 has joined #archiveteam-bs [22:12] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [22:12] *** Ajay17 is now known as Ajay1 [22:12] *** Ajay15 has joined #archiveteam-bs [22:13] feels like a 'just tell us which 4.5PB to grab' type job [22:14] *** Ajay15 has quit IRC (Client Quit) [22:15] *** Ajay11 has joined #archiveteam-bs [22:17] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [22:17] *** Ajay11 is now known as Ajay1 [22:17] *** Ajay13 has joined #archiveteam-bs [22:18] Ajay1, Ajay13: Fix your connection please. [22:19] *** Ajay13 has quit IRC (Client Quit) [22:20] *** Ajay13 has joined #archiveteam-bs [22:21] Kaz: Yeah, and I doubt they'll tell us, unfortunately. We'll basically have to find a way to discover old builds' image IDs, and I'm not sure if that's possible. [22:22] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [22:22] *** Ajay13 is now known as Ajay1 [22:22] Also, maybe we can just periodically download every image to reset their counts and prevent deletion altogether. :-P [22:22] Since their criterion according to the FAQ is 'no pulls in the past 6 months'. [22:22] *** Ajay15 has joined #archiveteam-bs [22:24] *** Ajay15 has quit IRC (Client Quit) [22:25] *** Ajay13 has joined #archiveteam-bs [22:26] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [22:26] *** Ajay13 is now known as Ajay1 [22:27] *** Ajay18 has joined #archiveteam-bs [22:29] *** Ajay18 has quit IRC (Client Quit) [22:30] *** Ajay14 has joined #archiveteam-bs [22:31] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [22:31] *** Ajay14 is now known as Ajay1 [22:32] *** Ajay16 has joined #archiveteam-bs [22:36] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [22:36] *** Ajay16 is now known as Ajay1 [22:37] *** Ajay19 has joined #archiveteam-bs [22:39] *** Ajay19 has quit IRC (Client Quit) [22:39] *** Ajay13 has joined #archiveteam-bs [22:41] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [22:41] *** Ajay13 is now known as Ajay1 [22:41] *** Ajay12 has joined #archiveteam-bs [22:42] *** HP_Archiv has joined #archiveteam-bs [22:43] *** Ajay12 has quit IRC (Client Quit) [22:44] *** voltagex has joined #archiveteam-bs [22:44] *** Ajay15 has joined #archiveteam-bs [22:46] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [22:46] *** Ajay15 is now known as Ajay1 [22:46] *** Ajay14 has joined #archiveteam-bs [22:48] *** Ajay14 has quit IRC (Client Quit) [22:49] *** Ajay15 has joined #archiveteam-bs [22:51] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [22:51] *** Ajay15 is now known as Ajay1 [22:51] *** Ajay13 has joined #archiveteam-bs [22:53] *** Ajay13 has quit IRC (Client Quit) [22:54] *** Ajay18 has joined #archiveteam-bs [22:55] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [22:55] *** Ajay18 is now known as Ajay1 [22:56] *** sirvy has quit IRC (Ping timeout: 615 seconds) [22:56] *** Ajay15 has joined #archiveteam-bs [22:58] *** Ajay15 has quit IRC (Client Quit) [22:58] *** Ajay17 has joined #archiveteam-bs [22:59] *** Mateon1 has quit IRC (Read error: Operation timed out) [23:00] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [23:00] *** Ajay17 is now known as Ajay1 [23:01] *** Ajay14 has joined #archiveteam-bs [23:03] *** Ajay14 has quit IRC (Client Quit) [23:03] *** Ajay16 has joined #archiveteam-bs [23:05] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [23:05] *** Ajay16 is now known as Ajay1 [23:06] *** Ajay16 has joined #archiveteam-bs [23:08] *** Ajay16 has quit IRC (Client Quit) [23:08] *** Ajay16 has joined #archiveteam-bs [23:10] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [23:10] *** Ajay16 is now known as Ajay1 [23:10] *** Ajay11 has joined #archiveteam-bs [23:12] *** sirvy has joined #archiveteam-bs [23:12] *** Ajay11 has quit IRC (Client Quit) [23:13] *** Ajay12 has joined #archiveteam-bs [23:13] *** HP_Archiv has quit IRC (Quit: Leaving) [23:15] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [23:15] *** Ajay12 is now known as Ajay1 [23:15] *** Ajay16 has joined #archiveteam-bs [23:17] *** Ajay16 has quit IRC (Client Quit) [23:18] *** Ajay11 has joined #archiveteam-bs [23:19] *** Mateon1 has joined #archiveteam-bs [23:20] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [23:20] *** Ajay11 is now known as Ajay1 [23:20] *** Ajay17 has joined #archiveteam-bs [23:22] *** Ajay17 has quit IRC (Client Quit) [23:23] *** Ajay16 has joined #archiveteam-bs [23:24] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [23:24] *** Ajay16 is now known as Ajay1 [23:25] *** Ajay18 has joined #archiveteam-bs [23:27] *** Ajay18 has quit IRC (Client Quit) [23:27] *** Ajay19 has joined #archiveteam-bs [23:29] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [23:29] *** Ajay19 is now known as Ajay1 [23:30] *** Ajay12 has joined #archiveteam-bs [23:31] *** VerifiedJ has quit IRC (Quit: Leaving) [23:31] *** fredgido has joined #archiveteam-bs [23:32] *** Ajay12 has quit IRC (Client Quit) [23:32] *** Ajay16 has joined #archiveteam-bs [23:34] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [23:34] *** Ajay16 is now known as Ajay1 [23:35] *** Ajay12 has joined #archiveteam-bs [23:35] *** fredgido_ has quit IRC (Read error: Operation timed out) [23:37] *** Ajay12 has quit IRC (Client Quit) [23:37] *** Ajay15 has joined #archiveteam-bs [23:38] *** lennier2 has joined #archiveteam-bs [23:39] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [23:39] *** Ajay15 is now known as Ajay1 [23:39] *** Ajay18 has joined #archiveteam-bs [23:41] *** lennier1 has quit IRC (Ping timeout: 272 seconds) [23:41] *** lennier2 is now known as lennier1 [23:41] *** Ajay18 has quit IRC (Client Quit) [23:42] *** Ajay19 has joined #archiveteam-bs [23:44] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [23:44] *** Ajay19 is now known as Ajay1 [23:44] *** Ajay16 has joined #archiveteam-bs [23:46] *** Ajay16 has quit IRC (Client Quit) [23:47] *** Ajay13 has joined #archiveteam-bs [23:49] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [23:49] *** Ajay13 is now known as Ajay1 [23:49] *** Ajay11 has joined #archiveteam-bs [23:51] *** Ajay11 has quit IRC (Client Quit) [23:52] *** Ajay11 has joined #archiveteam-bs [23:53] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [23:53] *** Ajay11 is now known as Ajay1 [23:54] *** Arcorann_ has joined #archiveteam-bs [23:54] *** Ajay15 has joined #archiveteam-bs [23:56] *** Ajay15 has quit IRC (Client Quit) [23:56] *** Ajay18 has joined #archiveteam-bs [23:58] *** Ajay1 has quit IRC (Ping timeout: 265 seconds) [23:58] *** Ajay18 is now known as Ajay1 [23:59] *** Ajay19 has joined #archiveteam-bs