[00:01] those definitely seem to be _some_ of the homedirs, but i wouldn't be surprised if the rest were just users manually created on astro's ftp server... [00:01] here is an example: https://astro.temple.edu/~krunal/acb/new.html [00:01] looks like an old form email address: spopoff@vm.temple.edu is [00:01] https://astro.temple.edu/~spopoff/ [00:02] 1997 Temple University School of Medicine. All rights reserved - This page last updated on08/13/02 [00:03] what's up with SPN? It's throwing Bummer errors. [00:07] Any idea if IA is having troubles right now? thuban3, does http://web.archive.org/web/form-submit.jsp?type=prefixquery&url=https://astro.temple.edu/ work for you? [00:07] " [00:07] This page is not available on the web [00:07] because of server error [00:07] " [00:09] same here [00:49] *** godane has joined #archiveteam-bs [00:52] *** Pixi` has joined #archiveteam-bs [00:53] *** Pixi has quit IRC (Read error: Operation timed out) [00:53] *** Pixi` has quit IRC (Read error: Connection reset by peer) [00:58] *** britmob has quit IRC (Read error: Connection reset by peer) [01:04] *** britmob has joined #archiveteam-bs [01:34] *** phuzion has joined #archiveteam-bs [01:35] Is anyone aware that the admin interface for urlteam is super unstable right now? [01:36] admin interface or tracker interface? [01:39] admin interface [01:40] *** ShellyRol has quit IRC (Remote host closed the connection) [01:41] *** ShellyRol has joined #archiveteam-bs [02:51] *** Larsenv has quit IRC (Max SendQ exceeded) [02:52] *** Pixi has joined #archiveteam-bs [02:54] *** Darkstar has quit IRC (Read error: Connection reset by peer) [02:54] *** VADemon__ has joined #archiveteam-bs [02:56] *** VADemon__ has quit IRC (Read error: Connection reset by peer) [02:57] *** Darkstar has joined #archiveteam-bs [02:57] *** fredgido has joined #archiveteam-bs [02:58] *** VADemon has joined #archiveteam-bs [02:58] *** fredgido_ has quit IRC (Ping timeout: 360 seconds) [03:01] *** Pixi` has joined #archiveteam-bs [03:02] *** VADemon_ has quit IRC (Read error: Operation timed out) [03:03] *** Larsenv has joined #archiveteam-bs [03:06] *** Stiletto has quit IRC (Ping timeout: 360 seconds) [03:06] *** Stiletto has joined #archiveteam-bs [03:07] *** Pixi has quit IRC (Read error: Operation timed out) [03:14] *** step has quit IRC (Quit: ZNC 1.7.5 - https://znc.in) [03:15] *** step has joined #archiveteam-bs [03:36] *** bsmith093 has quit IRC (Quit: Leaving.) [03:36] *** bsmith093 has joined #archiveteam-bs [03:52] *** bsmith093 has quit IRC (Quit: Leaving.) [03:52] *** bsmith093 has joined #archiveteam-bs [04:05] *** BlueMax has joined #archiveteam-bs [04:19] *** OrIdow6 has quit IRC (Ping timeout: 276 seconds) [04:21] *** OrIdow6 has joined #archiveteam-bs [04:21] *** OrIdow6 has quit IRC (Client Quit) [04:24] *** thuban3 is now known as thuban [04:32] *** OrIdow6 has joined #archiveteam-bs [04:32] *** qw3rty__ has joined #archiveteam-bs [04:32] *** OrIdow6 has quit IRC (Client Quit) [04:33] *** OrIdow6 has joined #archiveteam-bs [04:36] *** qw3rty_ has quit IRC (Ping timeout: 276 seconds) [05:07] SketchCow: i'm gruessing this has mostly public domain movies : https://www.youtube.com/channel/UCHrz6qtqlly9bxWpIvHFlFg/videos [05:08] i'm only guessing cause the Sherlock Holmes : Woman in Green is on archive.org here: https://archive.org/details/TheWomanInGreen1945 [05:23] *** thuban1 has joined #archiveteam-bs [05:28] *** thuban has quit IRC (Read error: Operation timed out) [05:32] *** thuban1 is now known as thuban [05:32] *** RichardG has quit IRC (Ping timeout: 745 seconds) [06:19] *** wyatt8740 has quit IRC (Read error: Operation timed out) [06:29] *** wyatt8740 has joined #archiveteam-bs [06:33] *** BlueMax has quit IRC (Read error: Connection reset by peer) [06:34] *** BlueMax has joined #archiveteam-bs [06:34] *** Laverne has joined #archiveteam-bs [07:18] *** step has quit IRC (Quit: ZNC 1.7.5 - https://znc.in) [07:20] *** step has joined #archiveteam-bs [08:47] *** obskyr has quit IRC (Ping timeout: 745 seconds) [08:47] *** obskyr has joined #archiveteam-bs [08:48] *** gandalf has quit IRC (Ping timeout: 745 seconds) [08:48] *** ephemer0l has quit IRC (Ping timeout: 745 seconds) [08:49] *** gandalf has joined #archiveteam-bs [09:03] *** Meroje_ has joined #archiveteam-bs [09:04] *** Meroje has quit IRC (Read error: Connection reset by peer) [09:50] Is there any limitation on archivebot downloaded files that uploaded to Wayback Machine ? Archivebot downloaded site litaratura.org some days ago, and this data already in WBM: https://web.archive.org/web/20200221193534/http://litaratura.org/starazhytnya?artid=121. Good. But there is link on this page, that was downloaded by logs(https://archive.fart.website/archivebot/viewer/job/dw9q2), but WBM says 404: https://web.archive.org/web/20200221193534/h [09:50] ttp://litaratura.org/download.php?item=9bbf7eba5c4304ea85b79db45b5621a8.pdf&title=Ernst%20Teador%20Amadej%20Hofman,%20Kurdupyel%20Cakhyes,%20yakoha%20zvali%20Cynober%20(PDF) [09:51] Can it happens because url contains spaces ? [09:54] alex73_: https://web.archive.org/web/20200221193733/http://litaratura.org/download.php?item=9bbf7eba5c4304ea85b79db45b5621a8.pdf&title=Ernst+Teador+Amadej+Hofman,+Kurdupyel+Cakhyes,+yakoha+zvali+Cynober+(PDF) [09:54] So yes, spaces. [09:56] This smells like a bug in wpull or urlparse. [10:02] *** RichardG has joined #archiveteam-bs [10:02] *** Smiley has quit IRC (Ping timeout: 276 seconds) [10:05] *** Smiley has joined #archiveteam-bs [10:11] *** SmileyG has joined #archiveteam-bs [10:21] *** Smiley has quit IRC (Ping timeout: 745 seconds) [10:24] WARC file contains record with field: [10:24] WARC-Target-URI: http://litaratura.org/download.php?item=9bbf7eba5c4304ea85b79db45b5621a8.pdf&title=Ernst+Teador+Amadej+Hofman,+Kurdupyel+Cakhyes,+yakoha+zvali+Cynober+(PDF) [10:24] But ISO 28500 says: The URI in this value shall be properly escaped according to [RFC3986] and written with no internal whitespace. [10:24] RFC3986 says about %20 replacement for spaces, not '+'. [10:24] So, does it mean WARC-Target-URI should contain %20 instead plus char ? [10:37] Kind of. wpull should have normalised the URL to contain %20 rather than + in the query string before retrieval. And then that's also what would've been written to WARC. [10:57] *** ephemer0l has joined #archiveteam-bs [11:17] *** wyatt8740 has quit IRC (Ping timeout: 276 seconds) [11:22] *** bitbit has joined #archiveteam-bs [11:25] *** wyatt8740 has joined #archiveteam-bs [12:04] *** atluxity has joined #archiveteam-bs [12:05] well it sure has been a while [12:10] *** BlueMax has quit IRC (Read error: Connection reset by peer) [13:13] alex73_: Filed as https://github.com/ArchiveTeam/wpull/issues/445 [15:18] *** superkuh_ is now known as superkuh [15:55] *** thuban has quit IRC (Ping timeout: 276 seconds) [16:10] *** thuban has joined #archiveteam-bs [17:51] *** balrog has quit IRC (Read error: Operation timed out) [17:51] *** balrog has joined #archiveteam-bs [17:53] *** twigfoot has quit IRC (Ping timeout: 360 seconds) [17:54] *** twigfoot has joined #archiveteam-bs [18:02] *** eythian_ has joined #archiveteam-bs [18:02] *** pie_[bnc] has quit IRC (Read error: Connection reset by peer) [18:03] *** pie_[bnc] has joined #archiveteam-bs [18:03] *** Mateon1 has quit IRC (Remote host closed the connection) [18:03] *** Mateon1 has joined #archiveteam-bs [18:04] *** superkuh has quit IRC (Excess Flood) [18:05] *** Mayonaise has quit IRC (Ping timeout: 360 seconds) [18:06] *** superkuh has joined #archiveteam-bs [18:06] *** Mayonaise has joined #archiveteam-bs [18:15] *** eythian has quit IRC (Read error: Operation timed out) [18:17] *** SmileyG has quit IRC (Remote host closed the connection) [18:17] *** Smiley has joined #archiveteam-bs [18:19] *** antomati_ has joined #archiveteam-bs [18:21] *** unlobito has quit IRC (Ping timeout: 360 seconds) [18:21] *** Maylay_ has joined #archiveteam-bs [18:22] *** unlobito has joined #archiveteam-bs [18:28] *** eythian has joined #archiveteam-bs [18:29] *** Mateon1 has quit IRC (Remote host closed the connection) [18:29] *** antomatic has quit IRC (Read error: Operation timed out) [18:30] *** superkuh has quit IRC (Excess Flood) [18:31] *** Mateon1 has joined #archiveteam-bs [18:31] *** superkuh has joined #archiveteam-bs [18:34] *** Maylay has quit IRC (Read error: Operation timed out) [18:40] *** eythian_ has quit IRC (Read error: Operation timed out) [19:19] *** underscor has quit IRC (Ping timeout: 276 seconds) [19:40] *** SmileyG has joined #archiveteam-bs [19:43] *** Smiley has quit IRC (Read error: Operation timed out) [19:49] *** Smiley has joined #archiveteam-bs [19:57] *** SmileyG_ has joined #archiveteam-bs [19:59] *** SmileyG has quit IRC (Ping timeout: 745 seconds) [20:04] *** SmileyG has joined #archiveteam-bs [20:04] *** godane has quit IRC (Read error: Operation timed out) [20:05] *** Smiley has quit IRC (Ping timeout: 610 seconds) [20:07] *** godane has joined #archiveteam-bs [20:10] SketchCow: i'm starting to upload issues of MIT Technology Review [20:14] *** SmileyG_ has quit IRC (Ping timeout: 745 seconds) [20:30] *** halt has joined #archiveteam-bs [20:34] *** halt_ has quit IRC (Ping timeout: 610 seconds) [21:29] *** godane has quit IRC (Read error: Connection reset by peer) [22:14] *** thuban1 has joined #archiveteam-bs [22:15] *** thuban has quit IRC (Read error: Operation timed out) [22:40] *** scorche` has joined #archiveteam-bs [22:42] *** scorche has quit IRC (Read error: Operation timed out) [22:42] *** scorche` is now known as scorche [22:58] *** Ryz has quit IRC (Remote host closed the connection) [22:58] *** kiska18 has quit IRC (Remote host closed the connection) [22:59] *** kiska18 has joined #archiveteam-bs [23:00] *** Ryz has joined #archiveteam-bs [23:22] *** BlueMax has joined #archiveteam-bs