[00:10] *** icedice2 has joined #archiveteam-bs [00:11] *** icedice2 has quit IRC (Client Quit) [00:14] *** icedice has quit IRC (Ping timeout: 260 seconds) [02:00] *** BlueMax has quit IRC (Leaving) [02:07] *** m007a83 has quit IRC (Read error: Connection reset by peer) [02:16] *** m007a83 has joined #archiveteam-bs [02:28] *** ta9le has quit IRC (Quit: Connection closed for inactivity) [02:39] *** BlueMax has joined #archiveteam-bs [02:41] *** m007a83 has quit IRC (Quit: Leaving) [02:47] *** m007a83 has joined #archiveteam-bs [02:50] *** lgorithm has joined #archiveteam-bs [03:13] *** dxrt- is now known as dxrt [03:15] *** dxrt_ has joined #archiveteam-bs [03:25] *** odemg has quit IRC (Ping timeout: 246 seconds) [03:39] *** odemg has joined #archiveteam-bs [03:58] *** qw3rty115 has joined #archiveteam-bs [04:04] *** qw3rty114 has quit IRC (Read error: Operation timed out) [04:31] SketchCow: https://archive.org/details/smartcomputing-learning-series-v6i7 [04:31] SketchCow: https://archive.org/details/smartcomputing-learning-series-v8i2 [04:48] *** beardicus has quit IRC (bye) [04:49] *** beardicus has joined #archiveteam-bs [04:54] *** davidar has joined #archiveteam-bs [05:03] *** dxrt has quit IRC (ZNC - http://znc.sourceforge.net) [05:04] *** dxrt has joined #archiveteam-bs [05:10] Kenshin, get in #getgit <3 [06:09] *** Atom has quit IRC (Read error: Connection reset by peer) [06:55] *** Flashfire has joined #archiveteam-bs [06:56] betamax: We could join the group, archive messages and leave again. I don’t think anyone is actively monitoring Yahoo! Groups for abuse right now. [06:56] I can do that I have spare time [07:08] *** Ing3b0rg has quit IRC (Read error: Operation timed out) [07:09] Is it a waste of time for me while I have some spare time to individually archive some of https://dnshistory.org/ manually? Will it be accepted by the internet archive or not? [07:12] *** Ing3b0rg has joined #archiveteam-bs [07:44] *** lgorithm has quit IRC (Quit: My Mac has gone to sleep. ZZZzzz…) [07:50] Flashfire: We did start an ArchiveBot job for StumbleUpon. Haven't looked into whether that'll grab everything. [07:51] Flashfire: Regarding the Tanzanian blogs, absolutely. Can you compile a list of blogs? Then we can determine how we best grab them (ArchiveBot !a < LIST if it's not too large, otherwise we might need to split it up or grab it with another method). [08:16] Flashfire: DNSHistory should definitely be archived. The problem is that anything manual won't grab any significant amount of the data. The site is massive. [08:26] Yeah I have been looking for them but not much luck [08:39] https://www.allymsangi.com/2016/05/tanzanias-best-blogs-you-should-know.html JAA this might be a bit outdated [08:39] but it might help a bit [08:47] Also jaa with DNSHistory every little bit counts [08:49] *** Flashfir_ has joined #archiveteam-bs [08:49] *** Flashfire has quit IRC (Read error: Connection reset by peer) [08:49] *** Flashfir_ has quit IRC (Client Quit) [08:50] *** Flashfire has joined #archiveteam-bs [08:58] *** BlueMax has quit IRC (Leaving) [09:30] *** Flashfire has quit IRC (Quit: Textual IRC Client: www.textualapp.com) [09:46] hmmmmm. https://twitter.com/josephfcox/status/1006549904758632448 [09:52] And here’s the code: https://github.com/motherboardgithub/archive_tweet/blob/master/archive_tweet.py [09:52] Does /save/ even work well without a browser? [10:30] I think so, but --page-requisites is probably a good idea. [11:17] yeah [11:17] that script does not save for example photos in the tweet [11:22] "ArchiveTeams self-righteous attitude" [11:22] we're gettig famous https://dnshistory.org/ [11:22] been up there for like a year now? [11:24] Ever since we tried to grab it in summer 2016, I think. [11:24] :) [11:24] Maybe we should try again. [11:28] we could try [11:28] slow and steady [11:28] I think it's too big to do it fast and quick [11:29] or contact them and nicely ask for a dump, apologize etc. [11:29] but yeaah [11:31] *** ta9le has joined #archiveteam-bs [11:32] Right [11:43] We have better things to do than hostage negotiation. [12:04] Well, We did try and contact them [12:04] If my memory serves [12:04] And they ignored all attempts of communication with them [12:09] *** schbirid has joined #archiveteam-bs [12:24] *** rbraun has quit IRC (Read error: Connection reset by peer) [12:25] *** rbraun has joined #archiveteam-bs [12:42] *** RichardG has quit IRC (Read error: Connection reset by peer) [12:44] *** RichardG has joined #archiveteam-bs [12:56] *** rbraun has quit IRC (Read error: Operation timed out) [13:03] *** chirlu has quit IRC (se.hub irc.efnet.nl) [13:03] *** Sue has quit IRC (se.hub irc.efnet.nl) [13:03] *** ReimuHaku has quit IRC (se.hub irc.efnet.nl) [13:03] *** kisspunch has quit IRC (se.hub irc.efnet.nl) [13:03] *** altlabel has quit IRC (se.hub irc.efnet.nl) [13:03] *** Fusl_ has quit IRC (se.hub irc.efnet.nl) [13:03] *** Ceryn^ has quit IRC (se.hub irc.efnet.nl) [13:03] *** BnAboyZ has quit IRC (se.hub irc.efnet.nl) [13:03] *** tsr has quit IRC (se.hub irc.efnet.nl) [13:03] *** chirlu` has joined #archiveteam-bs [13:04] *** tsr_ has joined #archiveteam-bs [13:06] *** kisspunch has joined #archiveteam-bs [13:08] there's nothing to be done, but in case anyone is interested, Tesco is closing the tesco.net email service on the 27th of June [13:13] *** rbraun has joined #archiveteam-bs [13:15] *** Sue_ has joined #archiveteam-bs [13:19] *** tsr_ is now known as tsr [13:27] *** Tenebrae has joined #archiveteam-bs [13:44] *** godane has quit IRC (Ping timeout: 260 seconds) [13:54] *** phillipsj has quit IRC (Leaving) [14:00] *** godane has joined #archiveteam-bs [14:01] *** Atom has joined #archiveteam-bs [14:51] *** Sk1d has quit IRC (Read error: Operation timed out) [14:54] *** Sk1d has joined #archiveteam-bs [16:13] *** Boppen has joined #archiveteam-bs [16:58] *** schbirid has quit IRC (Quit: Leaving) [17:07] https://archive.org/report/space.php [17:22] holy shit, quite the jump, was just looking at this the other day and saw 680T free [17:23] SketchCow, arkiver so what's the final word on how much of github ia is willing to take? [17:24] *** lgorithm has joined #archiveteam-bs [17:26] There is no situation where IA is going to mirror github [17:27] I have suggested that we back up the oldest projects, the longest since last update, as those are the ones that would be aged out in a purge. [17:27] Arkiver, I believe, wants to go after the top, most popular projects [17:27] cc Kaz [17:28] yeah arkiver suggested stared and forked first, but oldest 'at risk' makes more sense in the short term [17:30] SketchCow, so size wise what is ia willing to take on before they cut us off on this [17:36] Dude. [17:36] Today... not the day [17:37] Please stop thinking of "How much do you think we can totally overstay our welcome before they cotton on" [17:37] Mirroring Github would be stupid [17:38] Mirroring the parts of Github that seem most at risk makes sense, but even then, the code will quickly, QUICKLY go out of date [17:38] Something so fundamental should just be mirrored using a more live system [17:43] mirror the the less updated stuff make senses in my book [17:44] that way there is no/less worry about the archive of it be out of date [17:44] Makes sense, https://i.imgur.com/h1rW4JE.jpg [17:45] i figure if you can get code that has not be update in 5 years start from there [17:46] then slowly go after stuff that not been update in the last 4 then 3 years [17:46] when space is not a problem [17:46] i figure if its not be update in 4 or 5 years the project maybe dead [17:49] so i uploaded a 1TB in a month: size: 132,287,243,476 KB [17:50] fuck me it's been less the a month [17:51] last size was 131.1TB on 2018-05-26 [17:51] *** t2t2 has quit IRC (Ping timeout: 260 seconds) [17:53] *** t2t2 has joined #archiveteam-bs [19:40] *** jschwart has joined #archiveteam-bs [20:05] Any big repositories will always live on somewhere. Someone will always keep something large alive (linux kernel, mysql, etc...) No need to mirror those. I think that it's the long-tail of forgotten projects that won't be moved to the next place that would be lost. [20:10] *** tuluu has quit IRC (Ping timeout: 268 seconds) [20:38] *** RichardG has quit IRC (Read error: Connection reset by peer) [20:38] 18:44 < godane> that way there is no/less worry about the archive of it be out [20:38] whoops [20:39] no idea how I did that [20:39] *** RichardG has joined #archiveteam-bs [20:54] *** wp494 has quit IRC (Ping timeout: 252 seconds) [20:57] *** wp494 has joined #archiveteam-bs [21:09] SketchCow: https://archive.org/details/smartcomputing-learning-series-v6i11 [21:35] SketchCow: https://archive.org/details/smartcomputing-learning-series-v7i3 [21:43] Yeah, you're rocking it [21:43] And my automatic godanerator is doing the job! Only your one-offs and scans are not being automatically put in a home [22:16] at least i'm mostly done with the magazines i bought [22:16] i think i have a pc novice and 3 smart computing magazines left [22:16] the like 2 reference books and 6 learning series [22:17] i have shutdown the vbox every 90 pages or so cause the scanner takes longer to scan pages [22:17] most likely cause its in a vbox [22:20] *** jschwart has quit IRC (Quit: Konversation terminated!) [22:23] *** phillipsj has joined #archiveteam-bs [22:57] *** REiN^ has quit IRC (Read error: Operation timed out) [22:58] *** REiN^ has joined #archiveteam-bs [23:17] *** BlueMax has joined #archiveteam-bs