[03:52] *** thuban2 has joined #archiveteam-bs [03:55] *** thuban1 has quit IRC (Ping timeout: 258 seconds) [04:01] *** DogsRNice has quit IRC (Read error: Connection reset by peer) [04:27] *** odemgi has joined #archiveteam-bs [04:30] *** odemgi_ has quit IRC (Read error: Operation timed out) [04:31] *** qw3rty_ has joined #archiveteam-bs [04:34] *** BlueMaxim has joined #archiveteam-bs [04:39] *** qw3rty__ has quit IRC (Read error: Operation timed out) [04:47] *** BlueMax has quit IRC (Ping timeout: 745 seconds) [05:15] *** nicolas17 has quit IRC (Ping timeout: 360 seconds) [07:43] *** HP_Archiv has joined #archiveteam-bs [07:43] *** HP_Archiv has quit IRC (Client Quit) [07:56] *** Ctrl has quit IRC (Read error: Operation timed out) [08:01] *** HP_Archiv has joined #archiveteam-bs [08:01] *** HP_Archiv has quit IRC (Read error: Connection reset by peer) [08:34] *** luckcolor has quit IRC (Read error: Operation timed out) [08:44] *** luckcolor has joined #archiveteam-bs [09:34] *** Stiletto has quit IRC () [09:44] *** Stiletto has joined #archiveteam-bs [09:50] *** SmileyG has joined #archiveteam-bs [09:51] *** Smiley has quit IRC (Read error: Operation timed out) [09:55] *** Stiletto has quit IRC () [10:01] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [10:01] *** Stiletto has joined #archiveteam-bs [10:01] *** Stiletto has quit IRC (Read error: Connection reset by peer) [10:05] *** Stiletto has joined #archiveteam-bs [10:35] *** bsmith093 has quit IRC (Ping timeout: 255 seconds) [10:43] *** bsmith093 has joined #archiveteam-bs [10:46] *** foureyes_ has joined #archiveteam-bs [10:46] *** pew has quit IRC (Ping timeout: 276 seconds) [10:46] *** purplebot has quit IRC (Ping timeout: 276 seconds) [10:47] *** foureyes has quit IRC (Ping timeout: 276 seconds) [10:49] *** purplebot has joined #archiveteam-bs [10:58] *** pew has joined #archiveteam-bs [11:53] *** pew has quit IRC (Ping timeout: 276 seconds) [12:25] *** pew has joined #archiveteam-bs [14:13] *** Dallas has joined #archiveteam-bs [14:28] Hey I have wpull installed from https://pypi.org/project/wpull/ but it doesn't seem to have the phantomjs switches available https://wpull.readthedocs.io/en/master/usage.html#phantomjs-integration do I have to like build it with the support activated ? [14:43] Dallas: Firstly, you should either explicitly install version 1.2.3 or install from the git repository. The latest version on PyPI (2.0.1) is pretty much unusable. The most serious bugs are fixed on git (v2.0.3), but that's never been uploaded to PyPI. [14:43] Not sure why you're not seeing the PhantomJS options though; nothing special should be required for that. [14:44] Huh let me try installing 1.2.3 now then ty [14:44] (As in, wpull doesn't only make them available if you have phantomjs in your PATH, for example. They're always there.) [14:50] okay it's working woo , idk why it wasn't before :/ [14:54] *** DogsRNice has joined #archiveteam-bs [15:09] *** nicolas17 has joined #archiveteam-bs [15:19] *** Mayonaise has quit IRC (Read error: Operation timed out) [15:56] DogsRNice: Yeah, looks like it should work. No JS, captchas, or similar bullshit involved. [15:56] its a site that hosts custom huds and hitsounds for tf2 as well as a small forum [15:56] ah good [15:56] apparently its been arounf for 6 years [16:00] *** brayden has quit IRC (Read error: Operation timed out) [16:00] Soo, I have a list of ~149k galeon.com subdomains. That sounds fun. [16:01] You still have that dns data I gave you right? [16:01] Don't think so, but I could redownload it obviously (even if not the newest dataset). If you want to search through that, feel free! [16:02] Unfortunately, the two most powerful AB pipelines are blocked from Galeon, so not sure we'll be able to archive this in time. [16:05] *** brayden has joined #archiveteam-bs [16:10] what a good time for steam to be down right as a site with lots of links to it gets archived [16:20] *** alex73 has joined #archiveteam-bs [16:35] *** underscor has quit IRC (Read error: Connection reset by peer) [16:36] *** underscor has joined #archiveteam-bs [16:37] *** Raccoon has quit IRC (Ping timeout: 360 seconds) [16:37] *** Raccoon has joined #archiveteam-bs [16:37] *** Yurume has joined #archiveteam-bs [16:37] *** equant has quit IRC (Read error: Connection reset by peer) [16:43] *** ShellyRol has quit IRC (Read error: Operation timed out) [16:43] *** marti has joined #archiveteam-bs [16:43] *** ShellyRol has joined #archiveteam-bs [16:43] *** Yurume_ has quit IRC (Read error: Operation timed out) [16:44] *** equant has joined #archiveteam-bs [16:50] *** wyatt8740 has quit IRC (Ping timeout: 246 seconds) [16:53] *** steveastr has joined #archiveteam-bs [16:54] *** steveastr has quit IRC (Client Quit) [17:05] I'm impressed. Galeon is still running an HTTP 0.9 server for some subdomains. WTF? [17:05] E.g. http://animalesbellos.galeon.com/ [17:06] *** godane has quit IRC (Ping timeout: 258 seconds) [17:11] Do we have any tools capable of archiving HTTP 0.9? wpull clearly doesn't like it. [17:21] wget seems to handle it fine, but I wonder what IA/the WBM does with that then. [17:22] *** godane has joined #archiveteam-bs [17:23] ... but it's only HTTP 0.9 on the homepage. Goddammit. [17:24] JAA: did you use sublist3r to get all the domains? [17:24] No [17:24] what did you use? [17:25] have you ever heard of sublist3r? it uses multiple methods to find the subdomains [17:25] The site's sitemap plus a little directory I found at http://galeon.com/todostodos/ [17:25] It's much more efficient to use available lists than bruteforcing. [17:32] *** Raccoon` has joined #archiveteam-bs [17:34] *** Raccoon has quit IRC (Ping timeout: 276 seconds) [17:34] *** Raccoon` is now known as Raccoon [17:51] oh [17:54] *** Datechnom has quit IRC (Quit: Ping timeout (120 seconds)) [17:55] *** slyphic has quit IRC (Read error: Operation timed out) [17:55] *** slyphic has joined #archiveteam-bs [17:55] *** Datechnom has joined #archiveteam-bs [18:04] tldr: dont worry about warcraft 3 maps. they arent gone, but the support for them in the new official client is... quite broken atm [18:16] *** Cheryl has joined #archiveteam-bs [18:17] After you archive my group's data, who can access it? [18:39] ...And how will I be able to access it? [19:09] *** TC01_ has joined #archiveteam-bs [19:15] *** TC01 has quit IRC (Ping timeout: 745 seconds) [19:35] *** godane has quit IRC (Read error: Connection reset by peer) [19:40] *** alex73 has quit IRC (Quit: Connection closed for inactivity) [19:56] *** BlueMax has joined #archiveteam-bs [20:08] *** TC01 has joined #archiveteam-bs [20:16] *** TC01_ has quit IRC (Ping timeout: 745 seconds) [20:21] *** thuban3 has joined #archiveteam-bs [20:22] *** thuban2 has quit IRC (Read error: Connection reset by peer) [20:22] *** thuban3 has quit IRC (Read error: Connection reset by peer) [20:25] *** thuban3 has joined #archiveteam-bs [20:29] *** DLoader_ has joined #archiveteam-bs [20:34] *** Cheryl has quit IRC (Quit: Page closed) [20:34] *** Cheryl has joined #archiveteam-bs [20:35] Are private archived groups still private? Who can access them and how? [20:40] *** DLoader has quit IRC (Ping timeout: 745 seconds) [20:41] *** DLoader_ is now known as DLoader [20:48] *** Cheryl has quit IRC (Quit: Page closed) [20:58] *** TC01 has quit IRC (Read error: Operation timed out) [21:12] *** Ctrl has joined #archiveteam-bs [21:13] *** BlueMax has quit IRC (Read error: Connection reset by peer) [21:21] *** thuban4 has joined #archiveteam-bs [21:29] *** TC01 has joined #archiveteam-bs [21:31] *** thuban3 has quit IRC (Ping timeout: 745 seconds) [21:34] *** godane has joined #archiveteam-bs [21:48] i believe private yahoo groups aren't being archived by us [21:49] but real answers would come from the people working on the project, and they are all in the channel #yahoosucks [21:49] (many of them are here but not all) [21:50] #pythons-attack-y! actually, the other was only for the DPoS project. [21:50] ah sorry [21:50] do you mean #python-attacks-y! ? [21:50] No, attack. And the exclamation mark is part of the channel name. [21:51] ah ok! [21:51] Er, pythons-attack, not python-attacks, yeah. [21:52] we archived a small number of private groups by request of the moderators; betamax would know more [21:52] *** thuban4 is now known as thuban [21:55] yes, we had a Yahoo account that mods of private groups could invite if they wanted to be archived [21:56] and in some cases (particularly fandom groups, which were being worked on by another group) we "cold called" groups, asking if they wanted to be archived [22:11] *** foureyes_ is now known as foureyes [22:33] *** DLoader_ has joined #archiveteam-bs [22:44] *** DLoader has quit IRC (Ping timeout: 745 seconds) [22:44] *** DLoader_ is now known as DLoader [22:49] *** DLoader_ has joined #archiveteam-bs [22:57] *** DLoader has quit IRC (Ping timeout: 745 seconds) [22:57] *** DLoader_ is now known as DLoader [22:58] *** DLoader_ has joined #archiveteam-bs [23:01] *** VerifiedJ has joined #archiveteam-bs [23:10] *** DLoader has quit IRC (Ping timeout: 745 seconds) [23:10] *** DLoader_ is now known as DLoader