[00:01] I know one way to find out [00:04] *** fenn has quit IRC (Read error: Connection reset by peer) [00:07] [20:30:25] It's just 7z l, so it's not too expensive [00:07] [20:43:52] results are cached for 24h after last request too though [00:07] [20:43:59] so it's not as expensive if it gets popular [00:07] [20:45:28] we want it to be a first class citizen! [00:13] ah, so that 24h is why my tests were so slow [00:13] thanks [00:15] the most effective way to get IA to fix a bug is to give them a few hundred items that exhibit the problem [00:16] the devs there prioritize actual pain points [00:18] a bit like massage therapists [00:38] *** lytv has quit IRC (Max SendQ exceeded) [00:42] *** lytv has joined #archiveteam-bs [00:49] *** cbb2 has quit IRC (Read error: Operation timed out) [01:14] *** fenn has joined #archiveteam-bs [01:18] *** nico_32_ is now known as nico_32 [01:37] lelandbat: how about animated webp? [01:37] plus webpjs or something such [02:49] *** Jonimus has quit IRC (Read error: Operation timed out) [03:07] *** xmc is now known as chronomex [03:08] *** chronomex is now known as xmc [03:10] *** schbirid has quit IRC (Read error: Operation timed out) [03:10] *** schbirid has joined #archiveteam-bs [03:13] *** primus104 has quit IRC (Leaving.) [03:14] *** mistym has quit IRC (Remote host closed the connection) [04:06] https://www.youtube.com/watch?v=ATUANyWfqFM [04:06] http://www.bbc.co.uk/taster/projects/story-of-now [04:06] it's region-locked to the UK [04:06] and will apparently be removed in 2 months [04:06] and presumably "interactive" [04:06] can somebody work on grabbing this? [04:11] *** Jonimus has joined #archiveteam-bs [04:36] *** lelandbat has quit IRC (Quit: http://chat.efnet.org (Session timeout)) [04:40] ""DHS Might Shut Down on Friday: Should You Be Worried?"" heh [04:40] omgwereallgonnadie [04:42] who is DHS? [04:44] the agency in the usa that harasses people who are trying to get on a goddamn airplane [04:46] You guys should just get them all drunk [04:46] if they're passed out drunk then they can't harrass you [05:07] could a few people check if https://mysp.ac/11Ti works and what country? [05:09] redirects me to www.empirewatches.co.uk [05:09] i'm in the usa [05:10] northwest, aka seattle [05:10] western aust on firefox and windows 7, redirects to http://www.empirewatches.co.uk// [05:12] thanks. i think their load balancer must be broken then [05:12] USA, error page [05:18] *** aaaaaaaaa has quit IRC (Leaving) [05:22] i'm grabbing old BBC urls [05:22] they still have them and we don't [05:22] like this one: http://news.bbc.co.uk/2/hi/europe/536017.stm [05:23] i'm grabing the files first so i can make a index [05:23] the index will then be used for to make a web archive [05:35] so i found something interesting [05:35] it will at least make my job easier [05:35] all the number urls are the same with bbc [05:36] for example [05:36] http://news.bbc.co.uk/2/hi/europe/536017.stm and http://news.bbc.co.uk/2/hi/uk/536017.stm get you the same page [05:37] looks like http://news.bbc.co.uk/2/hi/536017.stm give you the same page [05:38] *** schbirid has quit IRC (Read error: Operation timed out) [05:43] just for people to know the number dates are all over the place [05:53] *** schbirid has joined #archiveteam-bs [06:03] *** mistym has joined #archiveteam-bs [06:10] fuckssake [06:10] so I assume everybody heard about IS destroying a museum and library? [06:10] bastards [06:11] can we do anything to stop this? [06:11] asswipes indeed [06:11] i'd put a tenner towards a napalm bombing run [06:12] or can we convince the middle eastern museums to take up more digitization? [06:12] some digitization will be just to document the item [06:13] like get every page of every book scanned [06:13] i would like that [06:14] how would we go about getting that started? [06:14] hold a fundraiser and get our arab members to do it? [06:14] one way is to get the stuff out of country [06:14] dunno about the middle east, there's one for timbuktu where they're also dealing with extremist fucksticks https://t160k.org/campaign/libraries-in-exile/ [06:16] can we send book scanners to these libraries? [06:16] ask librarians to scan rarest books first? [06:17] also some amount of manuscripts were similarly spirited out of mosul http://www.ncregister.com/daily-news/iraqi-priests-protect-historic-christian-writings-from-islamic-state/ [06:17] These books should have been copied years ago when they were discovered [06:18] it's a tremendously big job [06:18] and the technology to do it well hasn't existed for very long [06:20] :I well we need to get something going to start [06:20] how can i help out? [06:20] even big well-funded libraries in western countries are still working through their manuscript collections [06:21] just a thought [06:21] with the digital we should do high res prints of the digital on acid free paper [06:22] or something [06:22] cheaper to colocate isn't it? [06:23] the acid free paper thing is for stuff thats turning into dust [06:23] or has some sort of mold on it [06:23] like, the first timbuktu fundraiser was just to buy proper boxes to keep all the stuff from exposure to the elements [06:23] let alone buying cameras and computers and paying people to go through it all page by page [06:25] https://soundcloud.com/glennbeck/glenn-beck-presents-armageddon-the-rise-of-the-caliphate-22715 [06:26] he said something not crazy? [06:26] i think glenn tries to explains why they burn the libraries [06:26] now they're cataloguing it all because we don't even know which are the most rare ones when they're all just in unmarked piles [06:27] so I want it all digitized yesterday too but it's totally understandable [06:29] *** Control-S has joined #archiveteam-bs [06:30] I think iraqi and syrian christians are way more worried about not being shot and thrown in a ditch at this point so I don't know when there will be a project to contribute to [06:32] muslims like archiving too right? [06:33] i never really bothered with what religions like what [06:33] they certainly can, the new library of alexandria has done some good stuff lately [06:34] *** Ctrl-S has quit IRC (Read error: Operation timed out) [06:34] *** Control-S is now known as Ctrl-S [06:35] Ctrl-S: just know the caliphate was something glenn talk about in 2009 [06:35] then in 2014 every news outlet start talking about [06:35] he as also the one on NSA storing everything [06:36] if only the NSA had an archivist [06:37] alexandria reference http://laughingsquid.com/digital-amnesia-a-documentary-about-the-limited-shelf-life-of-digital-data/ [06:41] https://archive.org/details/tv?q=Caliphate [06:53] unrelated note: blogger is reverting their "no more nudity" policy [06:53] so we'll probably have a year of leeway before they try it again [06:53] :P [06:54] or it just happens without warning [06:54] or that [06:54] we shoudlarchive it all ASAP [06:55] put up instructions on how to save a blog pls, like what commands to run so that us idiots can do it too [08:04] a article that glenn says is from a left-wing magazine: http://www.theatlantic.com/features/archive/2015/02/what-isis-really-wants/384980/ [08:05] http://www.theatlantic.com/international/archive/2015/02/what-isis-really-wants-reader-response-atlantic/385710/ [08:35] http://techcrunch.com/2014/12/15/how-to-speak-startup/ [08:35] lol [08:44] *** primus104 has joined #archiveteam-bs [09:55] *** mst_ has joined #archiveteam-bs [10:07] *** mst_ has quit IRC (Quit: bye) [10:44] *** garyrh has quit IRC (hub.se irc.ac.za) [10:44] *** useretail has quit IRC (hub.se irc.ac.za) [10:47] *** marvinw_ has quit IRC (Read error: Operation timed out) [10:49] *** marvinw has joined #archiveteam-bs [10:51] *** S[h]O[r]T has quit IRC (Read error: Operation timed out) [10:54] *** useretai- has joined #archiveteam-bs [11:07] *** mistym has quit IRC (Remote host closed the connection) [11:15] *** schbirid has quit IRC (Read error: Operation timed out) [11:16] *** schbirid has joined #archiveteam-bs [12:08] *** primus104 has quit IRC (Leaving.) [12:19] *** Jonimus has quit IRC (Read error: Operation timed out) [12:44] *** underscor has quit IRC (Ping timeout: 370 seconds) [12:45] *** underscor has joined #archiveteam-bs [12:45] *** swebb sets mode: +o underscor [13:04] *** mistym has joined #archiveteam-bs [13:13] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [13:16] *** mistym has quit IRC (Read error: Operation timed out) [14:34] *** dashcloud has quit IRC (Remote host closed the connection) [14:37] *** dashcloud has joined #archiveteam-bs [15:40] *** primus104 has joined #archiveteam-bs [16:19] *** garyrh has joined #archiveteam-bs [16:23] *** primus104 has quit IRC (Leaving.) [16:29] *** garyrh_ has quit IRC (Remote host closed the connection) [16:31] *** garyrh_ has joined #archiveteam-bs [17:18] http://www.michaelgeist.ca/2015/02/rogers-executive-calls-canadian-government-shut-vpns/ [17:20] *** primus104 has joined #archiveteam-bs [17:30] *** Coderjoe has quit IRC (Read error: Connection reset by peer) [17:30] *** Coderjoe has joined #archiveteam-bs [17:54] shopping for a TLS cert CA is hard [17:55] you need to not only know prices but whether or not they're in most people's trusted roots [17:56] Let's Encrypt can't come online fast enough [17:57] how does it work if you're using Cloudflare? they're offering certs now- so do you get one for your site and one from cloudflare for hosting there? [17:58] CloudFlare might work but I don't want to try putting Jenkins behind it [17:58] too many POSTs [18:00] dashcloud: unless that sort of thing is known to work, heh [18:02] well, it's free, might as well try it out [18:03] yipdw: try wosign or startssl, both are free and trusted [18:04] mhazinsk: haven't heard of wosign before, will check that out [18:10] they allow 100 hosts per cert, which is amazing fo a free CA [18:38] *** primus104 has quit IRC (Leaving.) [18:44] *** Jonimus has joined #archiveteam-bs [19:09] *** mistym has joined #archiveteam-bs [19:29] *** S[h]O[r]T has joined #archiveteam-bs [19:29] *** GLaDOS has joined #archiveteam-bs [19:29] *** Muad-Dib has joined #archiveteam-bs [19:29] *** Danneh__ has joined #archiveteam-bs [19:29] *** Rickster has joined #archiveteam-bs [19:29] *** deathy has joined #archiveteam-bs [20:13] did anyone happen to grab computerandvideogames.com? [20:15] *** espes___ has quit IRC (Ping timeout: 265 seconds) [20:15] *** chazchaz has quit IRC (Read error: Operation timed out) [20:17] Looks like it: http://archive.fart.website/archivebot/viewer/?q=computerandvideogames.com [20:21] *** chazchaz has joined #archiveteam-bs [20:27] ok good that it grabbed something recent [20:27] as the site just went down [20:28] *** chazchaz has quit IRC (Read error: Operation timed out) [20:28] as in terminated as they are consolidating the content into gamesradar [20:34] *** chazchaz has joined #archiveteam-bs [20:38] *** espes__ has joined #archiveteam-bs [20:48] *** BlueMaxim has joined #archiveteam-bs [20:50] *** primus104 has joined #archiveteam-bs [21:11] https://i.imgur.com/JRjdoVf.gif [21:32] i'm uploading The Web Ahead podcast [21:32] all of them [21:32] https://archive.org/details/The_Web_Ahead_1 [22:21] *** yan has joined #archiveteam-bs [22:47] *** kyan has joined #archiveteam-bs [23:00] *** Rotab has joined #archiveteam-bs [23:33] *** yan has quit IRC (Quit: leaving) [23:54] i'm downloading the 1xxxx urls of bbc news [23:54] these got back to 1998/1999 [23:54] *go back [23:54] its also grabbing every image on the pages too [23:55] i take that back [23:56] the urls are from 1997