[01:07] *** BlueMax has quit IRC (Leaving) [02:03] *** Petri152 has quit IRC (Read error: Operation timed out) [02:04] *** jspiros has quit IRC (Read error: Operation timed out) [02:04] *** zyphlar has quit IRC (Read error: Operation timed out) [02:05] *** JAA has quit IRC (Read error: Operation timed out) [02:06] *** wabu has quit IRC (Ping timeout: 246 seconds) [02:07] *** wp494 has quit IRC (Read error: Operation timed out) [02:09] *** wp494 has joined #archiveteam-bs [02:57] *** Mateon1 has quit IRC (Ping timeout: 252 seconds) [02:58] *** Mateon1 has joined #archiveteam-bs [03:03] *** Petri152 has joined #archiveteam-bs [03:03] *** zyphlar has joined #archiveteam-bs [03:04] *** JAA has joined #archiveteam-bs [03:04] *** swebb sets mode: +o JAA [03:04] *** bakJAA sets mode: +o JAA [03:08] *** JAA has quit IRC (Read error: Operation timed out) [03:08] *** Petri152 has quit IRC (Read error: Operation timed out) [03:08] *** zyphlar has quit IRC (Read error: Operation timed out) [03:25] *** BlueMax has joined #archiveteam-bs [03:38] *** archodg__ has joined #archiveteam-bs [03:41] *** odemg has quit IRC (Ping timeout: 268 seconds) [03:41] *** archodg_ has quit IRC (Read error: Operation timed out) [03:53] *** odemg has joined #archiveteam-bs [04:08] *** JAA has joined #archiveteam-bs [04:08] *** swebb sets mode: +o JAA [04:08] *** bakJAA sets mode: +o JAA [04:09] *** zyphlar has joined #archiveteam-bs [04:09] *** Petri152 has joined #archiveteam-bs [04:14] *** Petri152 has quit IRC (Read error: Operation timed out) [04:14] *** JAA has quit IRC (Read error: Operation timed out) [04:15] *** zyphlar has quit IRC (Read error: Operation timed out) [04:25] *** achip has quit IRC (west.us.hub irc.Prison.NET) [04:38] *** Mateon1 has quit IRC (Remote host closed the connection) [04:39] *** Mateon1 has joined #archiveteam-bs [04:58] *** achip has joined #archiveteam-bs [05:10] *** achip has quit IRC (west.us.hub irc.Prison.NET) [05:13] *** zyphlar has joined #archiveteam-bs [05:14] *** wabu has joined #archiveteam-bs [05:14] *** Petri152 has joined #archiveteam-bs [05:14] *** JAA has joined #archiveteam-bs [05:14] *** swebb sets mode: +o JAA [05:14] *** bakJAA sets mode: +o JAA [05:17] *** jspiros has joined #archiveteam-bs [05:24] *** achip has joined #archiveteam-bs [05:39] *** Petri152 has quit IRC (Read error: Operation timed out) [05:39] *** wabu has quit IRC (Ping timeout: 246 seconds) [05:40] *** JAA has quit IRC (Ping timeout: 246 seconds) [05:41] *** zyphlar has quit IRC (Ping timeout: 246 seconds) [05:41] *** jspiros has quit IRC (Read error: Operation timed out) [06:38] *** zyphlar has joined #archiveteam-bs [06:38] *** jrwr has quit IRC (Read error: Operation timed out) [06:38] *** jspiros has joined #archiveteam-bs [06:39] *** Petri152 has joined #archiveteam-bs [06:39] *** jrwr has joined #archiveteam-bs [06:39] *** JAA has joined #archiveteam-bs [06:39] *** swebb sets mode: +o JAA [06:39] *** bakJAA sets mode: +o JAA [06:39] *** wabu has joined #archiveteam-bs [06:46] *** jrwr has quit IRC (Read error: Operation timed out) [06:46] *** jrwr has joined #archiveteam-bs [06:49] *** jrwr has quit IRC (Read error: Operation timed out) [06:50] *** jrwr has joined #archiveteam-bs [06:53] *** jrwr has quit IRC (Read error: Operation timed out) [06:53] *** jrwr has joined #archiveteam-bs [06:59] *** jrwr has quit IRC (Read error: Operation timed out) [07:00] *** jrwr has joined #archiveteam-bs [08:36] *** wp494 has quit IRC (Read error: Operation timed out) [08:37] *** wp494 has joined #archiveteam-bs [10:10] anyone here has paid for a Memory of Mankind project tablet? [10:38] Can’t afford it [10:56] *** BlueMax has quit IRC (Leaving) [11:23] *** icedice has joined #archiveteam-bs [12:49] *** icedice has quit IRC (Ping timeout: 252 seconds) [12:52] *** ta9le has joined #archiveteam-bs [13:00] https://archive.org/details/international_201807 [13:00] https://archive.org/details/concert_20180724 [13:00] https://archive.org/details/national_201807 [13:00] Have 25 and a half hours of NZ public access radio [13:01] Flashfire: what would you write on it? [14:56] *** DFJustin has quit IRC (Ping timeout: 260 seconds) [14:59] *** Arctic has joined #archiveteam-bs [14:59] *** Arctic has quit IRC (Quit: Page closed) [15:00] *** schbirid has joined #archiveteam-bs [15:12] *** svchfoo3 has quit IRC (Read error: Operation timed out) [15:13] *** svchfoo3 has joined #archiveteam-bs [15:13] *** svchfoo1 sets mode: +o svchfoo3 [15:29] *** Soni has quit IRC (Ping timeout: 264 seconds) [15:38] *** DFJustin has joined #archiveteam-bs [15:38] *** swebb sets mode: +o DFJustin [15:46] *** Soni has joined #archiveteam-bs [16:04] *** SketchCo1 is now known as SketchCow [16:06] *** K4k_ has quit IRC (Ping timeout: 260 seconds) [16:17] eientei95: regarding radio stations [16:17] cool :) [16:18] From #findeck: Regarding sprunge, I looked at the source code. The ID generation seems... weird: https://github.com/rupa/sprunge/blob/master/sprunge.py#L18-L24 [16:19] It uses A-Za-z0-9 as the character set but only randints between 0 and 35, i.e. not the entire set is used. [16:19] eientei95: we got a project running at IA to record radio stations. [16:19] arkiver: Yeah, I heard [16:20] Python's random.randint(a, b) is exclusive of b as well, I believe, so it's only 35^4 = about 1.5 million combinations. [16:20] JAA: interesting [16:20] Ah no, it's inclusive. randrange is the exclusive one. 36^4 = 1679616 combinations then. [16:21] still, very few URLs relatively [16:21] So it limits it to ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghij [16:21] eientei95: out of interest, any specific reason you´re recording these stations? [16:21] Oh, I was just recording them for ~24h to see if it was possible [16:22] Yep, A-Za-j. [16:22] Why are we talking about Sprunge again? [16:23] iirc someone was asking about archiving it [16:24] I think https://github.com/rupa/sprunge/issues/45#issuecomment-394846803 [16:24] not sure if there´s anything else [16:24] Ah, right. [16:33] eientei95: are you planning to do more recording? [16:34] I can, just not sure of a better way of recording than running wget in screen on my laptop [16:34] *** K4k has joined #archiveteam-bs [17:24] *** PurpleSym has quit IRC (Quit: *) [17:24] *** PurpleSym has joined #archiveteam-bs [17:49] *** slyphic has quit IRC (Remote host closed the connection) [18:01] *** SmileyG has joined #archiveteam-bs [18:03] *** Smiley has quit IRC (Ping timeout: 268 seconds) [18:12] The radio archive project is at https://archive.org/details/radio-archive [18:12] eientei95 ^ [18:13] Most recordings are not publicly available, but that might change soon [18:15] Oh, nice. [18:15] I hope it does. :-) [18:15] This is right now the only one that´s fully publicly available https://archive.org/details/Radio-VOA-Global-English [18:15] JAA: yes :) [18:16] We have a dashboard too, at http://researcher7.fnf.archive.org:30001/ [18:19] Neat [18:24] arkiver: Where/how can I suggest additional stations? I see that there's a lack of Swiss stations. [18:25] Oh, I'm happy to know that there is https://archive.org/details/Radio-WFMU-91-1-FM. They host their own archives but they purge the high-quality MP3s after a few weeks. [18:26] JAA: You can let me know, I might add some forms to the dashboard to take suggestions and improvements too [18:27] Alright, I'll compile a list sometime. :-) [18:27] *** JAA sets mode: +o arkiver [18:27] I´m currently going through every country in Europe, just done with Austria, next up is Belarus [18:27] Ah, I see. [18:27] we could make a channel, maybe #radio-archive ? [18:27] Speaking of radio archiving, there are at least a couple of onion services (Tor hidden services) with what look like 24/7 recordings of some Spanish-language radio stations. [18:27] That's why you threw those two radio map sites into ArchiveBot the other day. [18:28] yep, they are pretty great resources [18:28] lots of metadata and logos [18:28] http://www.g-radio.org/?page_id=9 leads to http://www.g-radio.org/wp-content/uploads/2015/09/grr-1.png and reading the URL from the screenshot leads to http://rbksxf6tw7gctk3a.onion/grabaciones.html (grabaciones = recordings) [18:28] Let´s discuss radio things at #radio-archive [18:30] *** jschwart has joined #archiveteam-bs [19:16] *** underscor has quit IRC (Remote host closed the connection) [19:17] *** underscor has joined #archiveteam-bs [19:17] *** swebb sets mode: +o underscor [19:18] *** underscor has quit IRC (Remote host closed the connection) [19:19] *** underscor has joined #archiveteam-bs [19:19] *** swebb sets mode: +o underscor [19:36] This Chromebot thing... [19:39] https://archive.org/details/archiveteam_chromebot [19:39] I've moved things to it, now, and they'll show up [19:39] But this project somehow missed things [19:42] Like... what is it, even [19:44] PurpleSym: ^ [19:45] i think it's an archivebot workalike that uses a captive chrome browser [19:45] much like --phantomjs used to do [19:46] Yep. One thing worth noting is that the WARCs do not contain the exact data as sent by the server because Chromium's APIs don't expose that. Transfer encoding is stripped, and headers are normalised, for example. [19:47] Source code's here: https://github.com/PromyLOPh/crocoite [19:53] *** Stilett0 has quit IRC (Read error: Operation timed out) [19:57] *** Stilett0 has joined #archiveteam-bs [20:02] https://archive.org/details/archiveteam_chromebot [20:35] I mean overall, thats not a huge deal, encoding might be, but the headers should be too big of a issue [20:53] Well, since WARCs are all about preserving the data sent by the server as accurately as possible, I think it's important to at least document it prominently. [20:56] yeah [20:57] maybe we should change mediatype for these to data so they don´t go in the wayback machine (for now) [21:15] *** TC01 has quit IRC (Read error: Operation timed out) [21:18] *** TC01 has joined #archiveteam-bs [21:34] *** dashcloud has quit IRC (Remote host closed the connection) [21:35] *** dashcloud has joined #archiveteam-bs [22:56] *** jschwart has quit IRC (Quit: Konversation terminated!) [23:15] *** BlueMax has joined #archiveteam-bs [23:47] *** m007a83 has quit IRC (Quit: Leaving)