[00:01] *** DiscantX has joined #archiveteam-bs [00:19] *** DoomTay has joined #archiveteam-bs [00:20] *** JesseW has quit IRC (Ping timeout: 370 seconds) [00:24] *** Sum has quit IRC (Ping timeout: 370 seconds) [00:25] *** Sum has joined #archiveteam-bs [00:40] *** rsanek has joined #archiveteam-bs [00:40] WHAT FORSOOTH, PRITHEE TELL ME THE SECRET WORD [00:41] *** rsanek has left [00:46] *** Sue_ has quit IRC (Read error: Operation timed out) [00:58] *** JesseW has joined #archiveteam-bs [01:01] *** BlueMaxim has joined #archiveteam-bs [01:17] *** JesseW has quit IRC (Quit: Leaving.) [01:17] *** JesseW has joined #archiveteam-bs [01:36] *** DiscantX has quit IRC (Ping timeout: 244 seconds) [02:00] *** Sum has quit IRC (Ping timeout: 370 seconds) [02:01] *** Sum has joined #archiveteam-bs [02:13] *** DiscantX has joined #archiveteam-bs [02:20] *** DiscantX has quit IRC (Ping timeout: 244 seconds) [02:39] *** Sum has quit IRC (Ping timeout: 370 seconds) [02:40] *** Sum has joined #archiveteam-bs [02:55] *** DiscantX has joined #archiveteam-bs [03:01] *** Sum has quit IRC (Ping timeout: 370 seconds) [03:02] *** Sum has joined #archiveteam-bs [03:04] *** DiscantX has quit IRC (Ping timeout: 244 seconds) [03:12] *** ravetcofx has quit IRC (Ping timeout: 506 seconds) [03:20] *** ravetcofx has joined #archiveteam-bs [03:22] *** Coderjoe has quit IRC (Read error: Connection reset by peer) [03:30] *** Coderjoe has joined #archiveteam-bs [04:06] *** Sum has quit IRC (Ping timeout: 370 seconds) [04:07] *** Sum has joined #archiveteam-bs [04:15] *** Sum has quit IRC (Ping timeout: 370 seconds) [04:16] *** Sum has joined #archiveteam-bs [04:24] *** RichardG has quit IRC (Ping timeout: 258 seconds) [04:28] *** ravetcofx has quit IRC (Ping timeout: 506 seconds) [04:38] *** Sum has quit IRC (Ping timeout: 370 seconds) [04:39] *** Sum has joined #archiveteam-bs [04:42] *** ravetcofx has joined #archiveteam-bs [04:54] *** ravetcofx has quit IRC (Read error: Operation timed out) [05:00] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [05:00] well freenode just fucking fell apart [05:01] what happened? [05:01] [01:00:51] -- There are 1 users and 211 invisible on 1 servers [05:01] haha [05:02] ah, in a technical sense -- good. I was worried it was in a organization/social sense. [05:02] yeah [05:02] though I am a bit concerned about the latter, generally [05:03] they've had an internal schism of sorts over the past year [05:03] a bunch of people quit [05:03] yeah, that was what I heard [05:03] christel even stepped down [05:03] though she reversed that soon after [05:04] *** metalcamp has joined #archiveteam-bs [05:06] I know very very little about freenode interpersonal politics. I just appreciate the work they do [05:06] same, for the most part [05:06] *** ravetcofx has joined #archiveteam-bs [05:06] *** Sk1d has joined #archiveteam-bs [05:19] *** metal_cam has joined #archiveteam-bs [05:20] *** metalcamp has quit IRC (Ping timeout: 244 seconds) [05:21] *** ndiddy has quit IRC (Quit: Leaving) [05:44] *** Jeroen52 has quit IRC (Ping timeout: 260 seconds) [05:47] *** Sum has quit IRC (Ping timeout: 370 seconds) [05:48] *** Jeroen52 has joined #archiveteam-bs [05:52] *** tomwsmf-a has joined #archiveteam-bs [06:00] *** JesseW has quit IRC (Ping timeout: 370 seconds) [06:38] *** DoomTay has quit IRC (Quit: Page closed) [06:56] freenode still broken it looks like [07:00] *** anjacks0n has joined #archiveteam-bs [07:30] *** anjacks0n has quit IRC (anjacks0n) [07:33] *** ravetcofx has quit IRC (Read error: Operation timed out) [07:43] *** ravetcofx has joined #archiveteam-bs [07:48] *** Sum has joined #archiveteam-bs [07:53] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [07:54] *** anjacks0n has joined #archiveteam-bs [07:57] *** anjacks0n has quit IRC (anjacks0n) [08:04] *** ravetcofx has quit IRC (Read error: Operation timed out) [08:17] *** ravetcofx has joined #archiveteam-bs [08:36] *** ravetcofx has quit IRC (Remote host closed the connection) [09:04] *** robink has quit IRC (Ping timeout: 633 seconds) [09:13] *** robink has joined #archiveteam-bs [10:26] *** whydomain has joined #archiveteam-bs [10:29] i'll link some of my findings that may help with google code: [10:29] https://github.com/icy/google-group-crawler [10:31] https://github.com/henryk/gggd (uses python last commit 11 months ago) [10:32] https://github.com/ssimpo/google-groups [10:32] https://github.com/bojieli/GoogleGroup-Archiver [10:33] Hi, Does anyone have any experience with EasyCap USB video capture cards for archiving VHS tapes? [10:33] I know they're cheap and are low quality, but VHS is also low quality so will it archive VHS in the highest quality that VHS can output? [10:33] (i.e: is there any advantage to using a more expensive HD capture card to capture non-HD VHS?) [10:33] https://github.com/himukr/google-grp-scraper [10:34] https://github.com/apowers313/google-groups-scraper [10:35] https://github.com/nruth/google-apps-groups-scraper (uses Page-scraping with Firefox) [10:37] https://github.com/hebelken/Google-Group-Scraper (scrapes RSS feed) [10:37] https://github.com/jrholliday/gg-scrape [10:42] https://github.com/clehner/ggscrape [10:45] https://github.com/clehner/gg_scraper [10:48] *** kristian_ has joined #archiveteam-bs [10:48] That's ll i can find of interest on github [10:50] 70 pages of results :P [10:51] the web UI for Google Groups certainly isn't pleasant [10:52] yeah [10:53] None of these is going to work for us, luckcolor. They all use either the RSS feeds (which are limited to 50? entries), a headless browser or parse the HTML. [10:53] sigh [10:53] There was a gist of a python script using GWT. [10:53] mmh [10:53] also is there a irc channel for the project? [10:54] I don’t think there’s a project yet. [10:54] ok [11:01] Yeah, looks like it is screen scraping /forum/message/raw [11:02] i'll re-investigate the results [11:11] https://github.com/henryk/gggd https://github.com/jrholliday/gg-scrape https://github.com/clehner/ggscrape are looking promising [11:11] also it seems that most people when making those scripts like MBOX format [11:11] * luckcolor googles MBOX [11:15] *** whydomain has quit IRC (http://www.kiwiirc.com/ - A hand crafted IRC client) [11:57] *** signius has quit IRC (Ping timeout: 260 seconds) [12:11] *** signius has joined #archiveteam-bs [12:18] *** Sum has quit IRC (Quit: Leaving) [12:56] *** anjacks0n has joined #archiveteam-bs [13:07] *** anjacks0n has quit IRC (anjacks0n) [13:14] *** dashcloud has quit IRC (Read error: Connection reset by peer) [13:15] *** dashcloud has joined #archiveteam-bs [13:52] *** anjacks0n has joined #archiveteam-bs [13:57] *** anjacks0n has quit IRC (anjacks0n) [14:07] *** VADemon has joined #archiveteam-bs [14:28] *** ndiddy has joined #archiveteam-bs [14:53] *** anjacks0n has joined #archiveteam-bs [15:14] *** BlueMaxim has quit IRC (Quit: Leaving) [15:17] *** JesseW has joined #archiveteam-bs [15:18] *** RichardG has joined #archiveteam-bs [15:24] luckcolor: This is the shortest script I could come up to fetch all messages of *known* Google Group: https://6xq.net/paste/forbyogi.html [15:25] Discovery is a different story though. [15:52] *** ravetcofx has joined #archiveteam-bs [15:52] *** RichardG has quit IRC (Read error: Operation timed out) [15:53] *** RichardG has joined #archiveteam-bs [16:00] *** anjacks0n has quit IRC (anjacks0n) [16:01] *** JesseW has quit IRC (Ping timeout: 370 seconds) [16:25] *** anjacks0n has joined #archiveteam-bs [16:36] *** anjacks0n has quit IRC (anjacks0n) [16:48] *** DoomTay has joined #archiveteam-bs [17:04] *** VADemon has quit IRC (Quit: left4dead) [17:05] *** kristian_ has quit IRC (Leaving) [17:29] *** tomwsmf-a has joined #archiveteam-bs [17:29] *** schbirid has joined #archiveteam-bs [17:38] *** anjacks0n has joined #archiveteam-bs [17:52] *** anjacks0n has quit IRC (anjacks0n) [17:59] *** anjacks0n has joined #archiveteam-bs [18:36] *** VADemon has joined #archiveteam-bs [18:53] *** DiscantX has joined #archiveteam-bs [19:00] *** DiscantX has quit IRC (Ping timeout: 244 seconds) [19:01] *** JesseW has joined #archiveteam-bs [19:14] *** JesseW has quit IRC (Ping timeout: 370 seconds) [19:16] *** DiscantX has joined #archiveteam-bs [19:28] *** dashcloud has quit IRC (Read error: Operation timed out) [19:29] so [19:29] I ran into a variant of Browlock (fake ransomware that runs in browser) [19:29] that -successfully- prevents a Chrome tab from being closed [19:29] with some bizarre confirmation dialog / redirect hack [19:29] that also slides right past the Safe Browsing warning screens [19:33] *** dashcloud has joined #archiveteam-bs [19:36] *** REiN^ has quit IRC () [19:51] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [19:52] i'm at 757k items now [20:06] *** metal_cam has quit IRC (Ping timeout: 250 seconds) [20:07] *** metalcamp has joined #archiveteam-bs [20:10] Yikes [20:12] *** schbirid has quit IRC (Quit: Leaving) [20:15] i'm going to start going after washingtonpost.com [20:16] turns out they have xml archives of old news going back to 1977 [20:17] there maybe over 1600k urls i can grab [20:29] *** DiscantX has quit IRC (Ping timeout: 244 seconds) [20:31] *** DoomTay has quit IRC (Quit: Page closed) [20:39] *** xXx_ndidd has joined #archiveteam-bs [20:40] *** ndiddy has quit IRC (Ping timeout: 244 seconds) [21:02] *** REiN^ has joined #archiveteam-bs [21:27] *** JesseW has joined #archiveteam-bs [21:27] *** metalcamp has quit IRC (Ping timeout: 244 seconds) [21:36] *** VADemon has quit IRC (Quit: left4dead) [21:57] *** Start_ has joined #archiveteam-bs [21:57] *** Start has quit IRC (Read error: Connection reset by peer) [22:30] *** dashcloud has quit IRC (Read error: Connection reset by peer) [22:32] *** dashcloud has joined #archiveteam-bs [23:22] *** DoomTay has joined #archiveteam-bs [23:36] *** divingk has joined #archiveteam-bs [23:37] I hadn't heard about source code accidentally left in games before -- but I don't focus much in that area. [23:37] I don't think many people here have, [23:37] since video games are not something normally mined through [23:38] I wouldn't be surprised if some of the people more focused on old games have. [23:38] *** RichardG has quit IRC (Read error: Connection reset by peer) [23:38] and it's quite tough to find these normally without having a giant ROM archive or two laying around. [23:38] Well, we have that. :-) [23:38] *** RichardG has joined #archiveteam-bs [23:38] (previous conversation: http://archive.fart.website/bin/irclogger_log/archiveteam?date=2016-07-10,Sun&sel=180#l176 [23:39] Since many of these either have missing or lost source code, [23:39] and seeing that loads of games are missing on TCRF, [23:40] that also factored into me digging through these games. [23:40] I've even found a couple of tech documents in the process. [23:40] I know Machine Lighting for the ZX Spectrum has a bunch of macro definitions, [23:41] plus another game has part of a manual for some debugging thing. [23:42] I'd also like to ask how many other people have done this, [23:43] since I know that there are many more games that could be added. [23:43] Well, if anyone has, they'll likely speak up eventually. You can check the logs tomorrow and see. [23:45] I'll probably check in during the weekends. [23:47] *** BlueMaxim has joined #archiveteam-bs [23:47] so i have uploaded over 43k nasa docs [23:48] Oh, yes, before I forget... [23:49] Zeppelin Games. [23:49] There are several master disks of their games, [23:49] two of which have a lot of code. [23:49] Kick Box Vigilante for the ZX Spectrum [23:49] and Mazie for the Amstrad CPC. [23:52] You will need a hex editor to see these.