[00:36] *** dashcloud has joined #archiveteam-bs [00:48] *** BlueMax has joined #archiveteam-bs [00:57] *** ndiddy has joined #archiveteam-bs [01:29] *** zhongfu_ has joined #archiveteam-bs [01:29] *** zhongfu has quit IRC (Ping timeout: 260 seconds) [01:36] *** ndiddy has quit IRC () [02:08] *** caff__ has joined #archiveteam-bs [02:12] *** caff_ has quit IRC (Read error: Operation timed out) [03:15] *** davidar has joined #archiveteam-bs [03:35] *** bitBaron_ has quit IRC (Quit: My computer has gone to sleep. 😴😪ZZZzzz…) [03:37] *** archodg__ has joined #archiveteam-bs [03:40] *** archodg_ has quit IRC (Ping timeout: 252 seconds) [03:40] *** odemg has quit IRC (Ping timeout: 260 seconds) [03:52] *** odemg has joined #archiveteam-bs [04:19] *** godane has quit IRC (Leaving.) [04:26] *** dashcloud has quit IRC (Read error: Operation timed out) [04:44] *** godane has joined #archiveteam-bs [04:45] *** svchfoo1 sets mode: +o godane [05:51] *** caff_ has joined #archiveteam-bs [05:56] *** caff__ has quit IRC (Read error: Operation timed out) [05:56] *** nuc has quit IRC (Read error: Operation timed out) [05:59] *** Pixi` has quit IRC (Read error: Operation timed out) [06:00] *** caff_ has quit IRC (Read error: Operation timed out) [06:01] *** Pixi has joined #archiveteam-bs [06:01] *** nuc has joined #archiveteam-bs [06:07] *** xmc has joined #archiveteam-bs [06:07] *** swebb sets mode: +o xmc [06:08] *** astrid has quit IRC (Read error: Connection reset by peer) [06:13] *** RichardG_ has quit IRC (Ping timeout: 360 seconds) [06:15] *** xmc has quit IRC (Excess Flood) [06:15] *** Pixi` has joined #archiveteam-bs [06:15] *** RichardG has joined #archiveteam-bs [06:15] *** xmc has joined #archiveteam-bs [06:15] *** swebb sets mode: +o xmc [06:17] *** schbirid has joined #archiveteam-bs [06:19] *** Pixi has quit IRC (Read error: Operation timed out) [06:44] *** Stilett0 has joined #archiveteam-bs [07:54] Do we have a list somewhere for sites that disallow the IA crawler? [07:55] https://www.indeed.com/ is one that should be added to that list if we have on [07:55] *** RichardG has quit IRC (Ping timeout: 633 seconds) [08:03] *** vitzli has joined #archiveteam-bs [08:25] *** Rai-chan has quit IRC (Quit: ZNC - http://znc.in) [09:18] *** BlueMax has quit IRC (Quit: Leaving) [09:18] *** BlueMax has joined #archiveteam-bs [09:26] That list would be massive, unfortunately. [09:27] oh [09:27] And its not worth throwing some of them into the archivebot [09:31] We do sometimes archive sites that block IA. Depends on the site and available resources though. [09:47] *** vitzli has quit IRC (Quit: Leaving) [09:57] *** SynMonger has quit IRC (Read error: Operation timed out) [09:58] *** lindalap has joined #archiveteam-bs [10:02] *** SynMonger has joined #archiveteam-bs [10:46] "In late July 2018, Alucard's hosting provider suspended and terminated his account. Since Alucard had no backups, all files were lost. In early August, Alucard sold the domain to lesderid of fuwafuwa.moe." https://safe.moe/history.html [10:46] :/ [10:46] And now it's mining coins. [10:48] Lindalap at least they ask [10:50] It's given to lesderid, who now has: p.fuwafuwa.moe, cocaine.ninja, fuwa.se, safe.moe, u.fuwafuwa.moe. What could go wrong? (Eh, already went wrong because all disabled uploads.) [10:51] And lesderid was already arrested once (?) for Pomf hosting [10:52] or was that Alucard [10:52] https://fuwafuwa.moe/nr/freeme/ [10:53] robots.txt exclusion now on that page, which wasn't earlier. [11:30] *** dashcloud has joined #archiveteam-bs [11:34] *** RichardG has joined #archiveteam-bs [11:36] *** BlueMax has quit IRC (Quit: Leaving) [13:16] *** RichardG has quit IRC (Ping timeout: 260 seconds) [13:46] *** lindalap has quit IRC (Quit: lindalap) [13:50] *** RichardG has joined #archiveteam-bs [14:12] *** archodg__ has quit IRC (Quit: Leaving) [14:44] *** REiN^ has quit IRC (no.money.no.love) [14:48] *** wp494 has quit IRC (Ping timeout: 268 seconds) [14:48] *** wp494 has joined #archiveteam-bs [14:53] *** Arctic has joined #archiveteam-bs [14:55] *** dashcloud has quit IRC (Ping timeout: 480 seconds) [15:15] Alright, I'm here. [15:17] Also, the Chrome Store Foxified extension is odd is that versions of the extension have different versions of some versions between their GitHub repo and the Addon Store (AMO). Something to keep in mind. [15:18] *** t2t2 has quit IRC (Remote host closed the connection) [15:18] *** bitBaron has joined #archiveteam-bs [15:19] We do have a project for GitHub (#getgit). That would probably be the right place for the GitHub repos. [15:19] (It's dormant at the moment though.) [15:20] *** t2t2 has joined #archiveteam-bs [15:21] So, the search API (https://addons-server.readthedocs.io/en/latest/topics/api/addons.html#get--api-v4-addons-search-) says there are only 27543 items with type=extension on AMO. [15:22] I highly doubt that. [15:22] I thought there were more. [15:23] Then again, 27543 is still a lot. [15:23] One big issue with archiving AMO is that the site uses "src" parameters in the URLs absolutely everywhere. This means that maintaining browsability requires grabbing everything multiple times. [15:23] We're going to be archiving the pages as well as the extensions? [15:24] That's the idea, yeah. [15:24] And also subpages, like the version history (which has changelogs), maybe statistics etc. [15:24] Reviews [15:24] Screenshots [15:24] The whole thing really. [15:24] I thought we were just going to archive the extensions along with their short and long descriptions.. [15:24] .* [15:25] Also from #archiveteam [15:25] [2018-08-22 14:51:56] I've mentioned this a while ago, but now we have a definitive date and less time. [15:25] [2018-08-22 14:52:31] Mozilla is deleting all legacy extensions from their extension store on September 5th. [15:25] [2018-08-22 14:52:33] https://blog.mozilla.org/addons/2018/08/21/timeline-for-disabling-legacy-firefox-add-ons/ [15:25] [2018-08-22 14:53:18] Correction, they'll stop supporting the addons on September 5th. [15:25] [2018-08-22 14:53:33] They will be mass-deleted in early October. [15:25] It makes sense, though it may be beyond my help now. [15:25] The good news is the API makes it relatively easy to find and grab old versions. [15:25] See https://addons-server.readthedocs.io/en/latest/topics/api/addons.html#versions-list [15:26] I think that is enough context :) [15:26] I can't grab entire sets of pages as easily as I can extensions, and even then, I have to do it manually because I'm on a Mac and Macs don't have the same tools as Windows or Linux. [15:26] PurpleSym: Yeah, that would work for addons which are listed I guess. But as far as I can see, the result doesn't indicate whether a version is legacy or WebExtension. [15:27] JAA: It does, there’s a per-file key called is_webextension. [15:27] Ah [15:28] How can I help if I'm on a Mac? [15:29] Arctic: You can run wget, curl, and all the other cool tools on macOS as well. Windows is the platform where it's tricky. [15:29] *** Mateon1 has quit IRC (Ping timeout: 740 seconds) [15:30] Windows is the land of workarounds [15:30] *** Mateon1 has joined #archiveteam-bs [15:31] Last time I checked (about a month ago), there were a bit under 1 million addon IDs and just over 1 million file IDs. [15:31] Addon pages are at https://addons.mozilla.org/en-US/firefox/addon/$addonid [15:31] (Which redirects to the canonical URL) [15:31] Files are at https://addons.mozilla.org/firefox/downloads/file/$fileid/$filename.xpi [15:32] JAA: Ah. I'd think it'd be the other way around. [15:32] The filename doesn't matter for the download, but obviously it does matter for links. [15:32] As mentioned, there are "src" parameters everywhere. [15:32] I'd also have to figure out how to get around any space problems. [15:32] If you go to an addon page from the search, you get ?src=search. [15:33] My Mac has a decent amount of space, but it'll likely run out quickly. [15:33] If you click on the download then, same. If you go through the version history, ?src=version-history. And so on. [15:33] That parameter will be incredibly annoying. [15:33] Where are we going to put everything? [15:35] *** REiN^ has joined #archiveteam-bs [15:35] JAA: do we have a channel for this? [15:36] Not yet as far as I know. [15:36] let´s make one [15:36] time to archive all firefox addons/extension/everything [15:37] Yes [15:38] I'll go make the channel. [15:38] What should it be called? [15:40] Who here is good at coming up with names? [15:40] Not me. [15:40] addgone? farfox? adddones [15:40] That said, #outofammo? [15:41] Maybe #ExtenguishedFox? [15:41] #ExtinguishedFox* I did a thing that was more stupid than this name. [15:42] JAA: how did you get to #outofammo [15:42] #outofammo is pretty good. [15:42] AMO. [15:42] Ammo. [15:42] heh [15:43] Alright, #outofammo is open. [15:43] Not set up though. [15:45] Yay, I came up with a decent name. :-) [15:45] arkiver: Yeah, AMO -> ammo. [15:46] Or were you asking how the hell I came up with a decent name for once? :-P [15:48] *** chferfa has joined #archiveteam-bs [15:51] JAA: actually I didn´t realize firefox addons is known as AMO [15:52] Yeah, from *A*ddons.*M*ozilla.*O*rg. [15:52] yep [15:57] *** Arctic has quit IRC (Quit: Page closed) [16:01] *** Arctic has joined #archiveteam-bs [16:21] *** caff_ has joined #archiveteam-bs [16:39] *** jut has quit IRC (Read error: Operation timed out) [16:46] *** jut has joined #archiveteam-bs [17:20] *** Arctic has quit IRC (Quit: Page closed) [17:31] *** dashcloud has joined #archiveteam-bs [17:38] *** Arctic has joined #archiveteam-bs [17:43] *** dashcloud has quit IRC (Ping timeout: 480 seconds) [17:54] *** DragonMon has joined #archiveteam-bs [17:55] *** Arctic has quit IRC (Quit: Page closed) [18:32] *** underscor has quit IRC (Ping timeout: 268 seconds) [18:32] *** underscor has joined #archiveteam-bs [18:32] *** swebb sets mode: +o underscor [18:33] *** yuitimoth has quit IRC (Ping timeout: 268 seconds) [18:33] *** yuitimoth has joined #archiveteam-bs [18:34] *** Smiley has quit IRC (Remote host closed the connection) [18:34] *** tsr has quit IRC (Ping timeout: 268 seconds) [18:36] *** sknebel has quit IRC (Ping timeout: 268 seconds) [18:36] *** sknebel has joined #archiveteam-bs [18:36] *** MrRadar2 has quit IRC (Ping timeout: 268 seconds) [18:36] *** MrRadar2 has joined #archiveteam-bs [18:37] *** svchfoo3 sets mode: +o MrRadar2 [18:38] *** altlabel_ has quit IRC (Ping timeout: 268 seconds) [18:38] *** schbirid2 has joined #archiveteam-bs [18:39] *** altlabel_ has joined #archiveteam-bs [18:41] *** VoynichCr has quit IRC (Ping timeout: 268 seconds) [18:42] *** schbirid has quit IRC (Read error: Operation timed out) [18:42] *** Smiley has joined #archiveteam-bs [18:42] *** Frogging has quit IRC (Ping timeout: 268 seconds) [18:44] *** bsmith093 has quit IRC (Ping timeout: 268 seconds) [18:44] *** Frogging has joined #archiveteam-bs [18:45] *** bsmith093 has joined #archiveteam-bs [18:46] *** BnAboyZ has quit IRC (Ping timeout: 268 seconds) [18:47] *** tsr has joined #archiveteam-bs [18:48] *** BnAboyZ has joined #archiveteam-bs [18:49] *** kisspunch has quit IRC (Ping timeout: 268 seconds) [18:49] *** kisspunch has joined #archiveteam-bs [18:52] *** MrRadar2 has quit IRC (Ping timeout: 268 seconds) [18:55] *** adinbied has quit IRC (Ping timeout: 268 seconds) [18:56] *** BnAboyZ has quit IRC (Read error: Connection reset by peer) [18:57] *** adinbied has joined #archiveteam-bs [18:57] Anyone on Freenode having problems connecting? [18:58] *** BnAboyZ has joined #archiveteam-bs [18:58] *** MrRadar2 has joined #archiveteam-bs [18:58] *** VoynichCr has joined #archiveteam-bs [18:58] *** svchfoo3 sets mode: +o MrRadar2 [19:13] Jens: emerson (~emerson@freenode/staff/emerson): :[Global Notice] Services are going to be rebooted for maintenance now, apologies for the inconvenience. [19:25] *** schbirid2 has quit IRC (Remote host closed the connection) [19:33] *** C4K3 has quit IRC (leaving) [19:34] *** C4K3 has joined #archiveteam-bs [19:54] *** dashcloud has joined #archiveteam-bs [20:02] *** Arctic has joined #archiveteam-bs [20:03] *** odemg has quit IRC (Ping timeout: 260 seconds) [20:03] *** Arctic has quit IRC (Client Quit) [20:16] *** odemg has joined #archiveteam-bs [20:19] *** Stilett0 is now known as Stiletto [20:39] *** VADemon_m has joined #archiveteam-bs [20:52] I was reading this: https://en.wikipedia.org/wiki/Hubert_Dreyfus#Webcasting_philosophy [20:53] "[in] 2006, a recording of Dreyfus teaching a course called 'Man, God, and Society in Western Literature - From Gods to God and Back' rose to 58th most popular webcast on iTunes." [20:54] Wikipedia's links are broken, but this is one of the courses saved by Archive Team; it remains popular 10+ years later. https://archive.org/details/ucberkeley_webcast_itunesu_461120619 [20:55] No action needed, just good vibes. [21:06] *** jsa_ has quit IRC (Read error: Operation timed out) [21:08] *** thejsa has joined #archiveteam-bs [21:16] *** thejsa has quit IRC (Read error: Operation timed out) [21:17] *** actually_ has quit IRC (Ping timeout: 240 seconds) [21:19] *** thejsa has joined #archiveteam-bs [21:20] *** obskyr has joined #archiveteam-bs [21:51] *** eLbot has quit IRC (Read error: Operation timed out) [21:54] *** dashcloud has quit IRC (Ping timeout: 480 seconds) [21:57] *** eLbot has joined #archiveteam-bs [23:26] *** Smiley has quit IRC (Ping timeout: 268 seconds) [23:31] *** Smiley has joined #archiveteam-bs [23:56] *** JAA has quit IRC (Read error: Operation timed out) [23:56] *** jspiros has quit IRC (Read error: Operation timed out) [23:56] *** zyphlar has quit IRC (Read error: Operation timed out) [23:56] *** wabu has quit IRC (Read error: Operation timed out) [23:57] *** eLbot has quit IRC (Read error: Operation timed out) [23:57] *** Petri152 has quit IRC (Ping timeout: 246 seconds) [23:58] *** c4rc4s has quit IRC (Read error: Operation timed out)