[01:12] How do we get this to be maintained? [01:13] Sorry, what is unmaintained? The ipfs stuff, or something else? [01:14] IPFS was never used here. [01:15] According to the wiki, the IPFS devs proposed using it for this project, but that never happened. [01:19] But internetarchive.bak _is_ maintained in general? [01:20] Nope [01:20] At least according to Kaz above. [01:20] I only came here 24 hours ago. [01:20] I know the project hasn't really been active in years though. [01:21] So yeah, unmaintained. [02:02] Is it worth trying to kick it and see if it will start? [02:02] The website is up.... [02:32] it is not [02:32] (to your first point) [04:15] *** iabak-reg has quit IRC (Remote host closed the connection) [04:31] *** iabak-reg has joined #internetarchive.bak [05:12] Kaz: is it worth trying an alternative? [05:13] I think you're underestimating the scale of such a project [05:14] I am certainly not disagreeing with you that it is a project of significant scale. [05:14] But you only eat an elephant one bite at a time. [05:15] It very well may be beyond me or any rag-tag band I could get together. [05:15] That said, it seems unlikely you have a basis to judge my technical skills or resources. [05:22] your resources aside, the way to 'fix' this project is to throw away the current implementation and start again with a new idea [05:22] which someone has to run, maintain, etc [05:23] What would you say is the primary issue with the current implementation? [05:23] It seems to be registering data. [05:24] human maintenance overhead doesn't scale, restores have never been tested fully, hard to monitor, somewhat slow [05:24] that's just off the top of my head, as this hasn't been running for years [05:24] not to mention that it's needlessly complex with a combo of bash, perl, haskell all around the place [05:25] Yeah, there are some other projects which seem to fill this kind of niche. [05:25] Something like dat: https://dat.foundation/ [05:27] probably [05:28] except we'd need to scale to (currently) 60PB [05:28] and most tools fall over at that point [05:28] Many do, it's true. [05:29] What happened to the previous maintainer? [05:29] Just wandered away? [05:29] Or is it you? [05:31] there was a team of us [05:31] it became too much work, other projects came up, it basically got forgotten [05:32] Gotcha. [05:33] I understand that there are a couple of copies of the archive, I wonder if they are still kept up-to-date. [05:37] I was under the impression that ia.bak was relatively curated. [05:37] The current page lists 107TB or something, [05:37] Which is a couple orders of magnitude off 60PB [05:38] But could help with "endangered" content. [05:38] it's lying to you [05:38] the stats are broken [05:41] Gotcha. [05:43] So totally borked [05:50] indeed [09:14] Would be good to mention that on the wiki page and remove the project from the homepage then. [09:14] Kaz: Any idea since when this has been broken? [09:15] Also, what's the deal with shard 13? :> [10:27] lets say.. december 2016 [10:28] as for SHARD13.. https://www.irccloud.com/pastebin/HolEgD4J/ [10:28] I think we forgot to make a shard 13 [10:35] Someone superstitious? :-) [10:37] Honestly, I think we just forgot how to count [11:44] *** kiska18 has quit IRC (Ping timeout (120 seconds)) [11:47] *** kiska18 has joined #internetarchive.bak [18:57] *** antomati_ is now known as antomatic [20:54] *** patrickod has quit IRC (Ping timeout: 255 seconds) [20:58] *** patrickod has joined #internetarchive.bak [23:51] *** wp494 has quit IRC (Quit: LOUD UNNECESSARY QUIT MESSAGES) [23:56] *** wp494 has joined #internetarchive.bak