[00:07] *** dewdropaw has joined #archiveteam-bs [00:07] *** nyany has quit IRC (Read error: Operation timed out) [00:07] *** nyany has joined #archiveteam-bs [00:07] *** Igloo has quit IRC (Read error: Operation timed out) [00:07] *** Larsenv has quit IRC (Read error: Operation timed out) [00:07] *** cppchrisc has joined #archiveteam-bs [00:07] *** cppchrisc has quit IRC (Connection closed) [00:07] *** Igloo has joined #archiveteam-bs [00:08] *** cppchrisc has joined #archiveteam-bs [00:08] *** Larsenv has joined #archiveteam-bs [00:08] *** svchfoo1 sets mode: +o Igloo [00:08] *** svchfoo3 sets mode: +o Igloo [00:10] *** dewdrop has quit IRC (Ping timeout: 360 seconds) [00:21] @betamax and @markedL, sorry for the delayed response. Work priorities and all... [00:22] I figured as much. The membership though, is that Archive-It you're talking about? [00:31] I mean, there are ways to check if something is in the WBM. how many URLs do you need to check? [00:45] I believe there were 55 links in, 'https://transfer.notkiska.pw/PvcO6/ModDB_Potter_Downloads_URLs_11.2019.txt' ' @betamax pulled them for me last night [00:46] Also, I'd like to know for future reference/be able to do it on a whim [00:46] but how do I manage to archive the downloads at the end of a GDrive link, instead of just archiving the URL ? [00:47] HP_Archiv: rclone can grab those [00:47] assuming you can save the folder/file to your gdrive [01:21] *** Video has joined #archiveteam-bs [01:34] @ivan, It's not my Google Drive those files are hosted on. And what I'd like to do is save them as part of the overal capture for HP-Game.net in the WBM, and then on IA [01:34] Is that possible?> [01:35] *** LowLevelM has joined #archiveteam-bs [01:58] *** LowLevelM has quit IRC (Ping timeout: 262 seconds) [02:17] *** omglolba- has joined #archiveteam-bs [02:22] *** pew has quit IRC (Ping timeout: 252 seconds) [02:26] *** dd33cc has quit IRC (Ping timeout: 260 seconds) [02:27] *** omglolbah has quit IRC (Ping timeout: 745 seconds) [02:35] *** IAmbience has quit IRC (Quit: Connection closed for inactivity) [02:36] *** pew has joined #archiveteam-bs [02:41] *** DogsRNice has quit IRC (Read error: Connection reset by peer) [03:22] *** manjaro-u has quit IRC (Read error: Operation timed out) [04:39] *** qw3rty2 has joined #archiveteam-bs [04:48] *** qw3rty has quit IRC (Ping timeout: 745 seconds) [05:31] *** systwi has quit IRC (Read error: Connection reset by peer) [05:32] *** systwi has joined #archiveteam-bs [05:59] *** Stilettoo has joined #archiveteam-bs [05:59] *** Stiletto has quit IRC (Ping timeout: 246 seconds) [06:00] *** ShellyRol has quit IRC (Read error: Connection reset by peer) [06:02] *** ShellyRol has joined #archiveteam-bs [06:13] *** HP_Archiv has quit IRC (Quit: Page closed) [06:13] *** HP_Archiv has joined #archiveteam-bs [06:13] Does anyone know? [07:42] *** is- has joined #archiveteam-bs [08:03] *** Ivy has quit IRC (Quit: Connection closed for inactivity) [08:13] *** purplebot has quit IRC (Remote host closed the connection) [08:14] *** purplebot has joined #archiveteam-bs [08:26] *** Flashfire has quit IRC (Remote host closed the connection) [08:26] *** kiska has quit IRC (Remote host closed the connection) [08:27] *** Flashfire has joined #archiveteam-bs [08:27] *** kiska has joined #archiveteam-bs [08:27] *** Fusl__ sets mode: +o kiska [08:27] *** Fusl_ sets mode: +o kiska [08:27] *** Fusl sets mode: +o kiska [09:44] Anyone around? [09:45] Hi HP_Archiv [09:45] You can check in bulk with the CDX API [09:46] I have no idea what that is, I'm new around here [09:46] Then make a list, provide them in #archivebot and it will go into WBM when the job is done + a period of time [09:46] https://github.com/internetarchive/wayback/tree/master/wayback-cdx-server [09:48] Okay thank you ^^ But that wasn't what I was asking right now. So earlier I was in here trying to get help for how to get AB to archive a file that's from a URL on a specific URL I'm going to capture/submit. [09:48] I want to archive those files, hosted on a Google Drive account, into WBM [09:48] Any way to do this? [09:49] I can't scroll back from earlier this afternoon, but basically this - https://hp-games.net/343 [09:49] So, WBM only works if the files are in their original location [09:49] That's a mod entry on an Potter game site. The mod file itself, creator by a different person other than the site owner, has the mod file hosted in a Google Dirve and on Yandex. I can submit that URL no problem. I've already done this. However, how do I get archive bot to archive that particular file? [09:50] https://drive.google.com/open?id=0BxEt9eREFkhlaUZrQ3lKME9LWDg [09:50] These links? [09:50] Yup, correct [09:50] Ok, Leave it with me. ArchiveBot may not do it [09:50] I need to step away for a few minutes, But I can look for you. Only certain trusted people can upload to the Archive and have it in the WBM [09:51] Although anyone can upload to IA. [09:51] Well I hate to keep having to rely on others to fulfill my requests... [09:51] Hm, okay. But there's a variety of links, just like that page, which contain other links pointing to Google Drive files. [09:51] This page: https://hp-games.net/343 [09:51] Oops [09:52] https://hp-games.net/all-mods [09:52] That page ^^ [09:52] I want to archive all of those pod pages & associated page elements, and then archive the hosted files that either link out to Google Drive/Yandex. [09:53] If you could do that, that would be awesome. But it seems time consuming (unless you're using a script I'm unaware of.) Either way, take your time. [09:53] mod pages* [09:56] One last thing, some of those mod pages, ex: this page, https://hp-games.net/mods-dl-downloads, link out to a separate page with many direct links to Google/Yandex. I have no idea how you're going to get all of this links/sub-links, etc. in an easy fashion. But if you need any help, let me know [10:15] *** SmileyG has joined #archiveteam-bs [10:19] *** schbirid has joined #archiveteam-bs [10:21] The problem is that ArchiveBot can't get that those files [10:22] @Igloo, there's no workaround? [10:25] Oh there are workarounds, Just looking at options :) [10:25] *** Smiley has quit IRC (Ping timeout: 745 seconds) [10:25] Heh, okay. Let me know what you come up with :) [10:28] Also, this is completely unrelated, but has ArchiveTeam considered the implications of when Myspace goes away, have they archived Myspace already? [10:29] Myspace was done a while back I am sure [10:30] https://www.archiveteam.org/index.php?title=Myspace [10:31] Thanks for the link ^^ apparently because of zero heads up, they were unable to archive a lot, sadly [10:32] I was just now mulling over what sites out there might need focus and for whatever reason I thought of Myspace, heh [10:33] focus = attention [10:33] Yeah, There is a huge list of shit that needs to be looked at [10:34] Yeah, I actually just thought of one - Urban Dictionary [10:34] That is a goldmine for future linguistics [10:35] And it appears that they haven't gotten to that yet [10:37] Do you have a solution for my HP-Games dilemma? [10:38] http://shiva3dengine.com/legacy_forum/index.php Can someone chuck this in for achiving, uses a session ID in the URL and I don't know what to do about it [10:38] It's a legacy forum for an old 3D game engine [10:41] Igloo: Cheers, guess it was that simple [10:41] Should be :) [10:41] Monitoring it [11:01] *** BlueMax has quit IRC (Quit: Leaving) [11:28] *** Damme has quit IRC (Read error: Connection reset by peer) [11:31] I forget, what are the parameters for entering a link into archivebot? It's !ao < or something, I think [11:34] Never mind, got it [11:35] for future reference: https://archivebot.readthedocs.io/en/latest/ [11:35] although some of those commands require voice / ops, which you'll need to ask for in #archivebot before being able to use [11:39] @betamax thank you [11:44] Hm, I can't seem to find the one I was using before [11:45] Isn't it this, ' !ao < ' ? [11:49] I got it. [11:50] @betamax, but using the command I just did is the right way to properly archive an entire site? That's the default, correct? [11:57] !ao < takes the list of urls, and individually archives each of those URLs [11:57] there isn't really a "default" [11:58] however the way to archive an entire site would be !a (which needs voice / ops), this recursively archives a single site (ie: archives the page you give it, then all links to the same domain on that page, then all links from those links, etc..) [11:59] Oh, I see. So I still need to ask for assistance if I want something done thoroughly? [12:00] probably best just to ask for voice / ops, so you can do it youself [12:00] but you will need to ask the first time, yes [12:00] Okay understood. I'll ask later on at a more appropriate time. It's 4 am where I am, heh [12:00] Thanks :) [12:28] *** wyatt8740 has quit IRC (Read error: Operation timed out) [12:55] SketchCow: so your getting some Canon japanese manuals from 2001 [12:55] for there printers at the time [13:23] *** jleclanch has quit IRC (Quit: Connection closed for inactivity) [13:28] *** Video_ has joined #archiveteam-bs [13:30] *** Stiletto has joined #archiveteam-bs [13:30] *** Stilettoo has quit IRC (Read error: Operation timed out) [13:34] *** Video has quit IRC (Read error: Operation timed out) [14:01] *** Damme has joined #archiveteam-bs [14:02] *** Ivy has joined #archiveteam-bs [14:13] *** mls_ has quit IRC (Remote host closed the connection) [14:19] *** mls_ has joined #archiveteam-bs [14:29] *** britmob has quit IRC (Read error: Connection reset by peer) [14:44] *** Zerote has joined #archiveteam-bs [15:20] *** britmob has joined #archiveteam-bs [15:44] *** JH8813269 has quit IRC (Quit: The Lounge - https://thelounge.chat) [16:17] in case I feel like reviving this tweet but for tiktok, can I just create a project page in the wiki? [16:26] Sire [16:26] Sure [16:31] *** X-Scale` has joined #archiveteam-bs [16:32] *** X-Scale has quit IRC (Ping timeout: 252 seconds) [16:32] *** X-Scale` is now known as X-Scale [16:49] *** meltir has joined #archiveteam-bs [17:07] *** SmileyG has quit IRC (Read error: Operation timed out) [17:08] *** Smiley has joined #archiveteam-bs [17:20] *** SmileyG has joined #archiveteam-bs [17:21] *** Smiley has quit IRC (Read error: Operation timed out) [17:25] *** SmileyG has quit IRC (Ping timeout: 258 seconds) [17:25] *** Smiley has joined #archiveteam-bs [17:39] feedback welcome, in case there are any obvious fuckups on my end [17:44] *** Smiley has quit IRC (Read error: Operation timed out) [17:46] *** Smiley has joined #archiveteam-bs [17:55] is it ok to create irc channels in advance, with no imminent shutdown/deletion? I have opinions on irc networks.. [17:56] *** icedice has joined #archiveteam-bs [18:31] *** X-Scale` has joined #archiveteam-bs [18:32] *** X-Scale has quit IRC (Ping timeout: 252 seconds) [18:32] *** X-Scale` is now known as X-Scale [18:38] *** HP_Archiv has quit IRC (Ping timeout: 260 seconds) [18:49] *** twigfoot has quit IRC (Read error: Operation timed out) [18:49] *** HashbangI has quit IRC (Read error: Operation timed out) [18:49] *** anarcat has quit IRC (Read error: Operation timed out) [18:49] *** Video has joined #archiveteam-bs [18:49] *** anarcat has joined #archiveteam-bs [18:49] *** twigfoot has joined #archiveteam-bs [18:49] *** closure has quit IRC (Read error: Operation timed out) [18:49] *** kiskabak has quit IRC (Read error: Operation timed out) [18:50] *** jake_test has quit IRC (Read error: Operation timed out) [18:50] *** closure has joined #archiveteam-bs [18:51] *** balrog has quit IRC (Read error: Operation timed out) [18:51] *** balrog has joined #archiveteam-bs [18:51] *** dewdrop has joined #archiveteam-bs [18:51] *** Dj-Wawa has quit IRC (Read error: Operation timed out) [18:52] *** Dj-Wawa has joined #archiveteam-bs [18:53] *** Zerote has quit IRC (Read error: Operation timed out) [18:54] *** Video_ has quit IRC (Read error: Operation timed out) [18:54] *** Zerote has joined #archiveteam-bs [18:55] *** dewdropaw has quit IRC (Read error: Operation timed out) [18:55] *** legoktm has joined #archiveteam-bs [18:55] *** ugh has quit IRC (Read error: Connection reset by peer) [18:55] *** ShellyRol has quit IRC (Read error: Operation timed out) [18:55] *** systwi_ has joined #archiveteam-bs [18:55] *** PhrackD has quit IRC (Read error: Connection reset by peer) [18:55] *** HashbangI has joined #archiveteam-bs [18:57] *** systwi has quit IRC (Read error: Operation timed out) [18:57] *** godane has quit IRC (Ping timeout: 612 seconds) [18:58] *** PhrackD has joined #archiveteam-bs [18:58] *** ShellyRol has joined #archiveteam-bs [18:59] *** godane has joined #archiveteam-bs [19:09] *** jake_test has joined #archiveteam-bs [19:09] *** ShellyRol has quit IRC (Read error: Connection reset by peer) [19:12] *** ShellyRol has joined #archiveteam-bs [19:38] *** manjaro-u has joined #archiveteam-bs [19:42] SketchCow: Just noticed that there are a lot of PyeongChang Olympics items in the AB collection. I suspect those should have their own collection instead; they definitely weren't retrieved through AB. https://archive.org/details/archivebot?and%5B%5D=pyeongchang&sin=&sort=-publicdate [20:12] *** Ivy has quit IRC (Quit: Connection closed for inactivity) [20:20] *** britmob has quit IRC (Ping timeout: 252 seconds) [20:24] *** britmob has joined #archiveteam-bs [21:22] *** Zerote has quit IRC (Quit: Leaving) [21:22] *** Zerote_ has quit IRC (Quit: Leaving) [21:22] *** Zerote has joined #archiveteam-bs [22:32] *** schbirid has quit IRC (Quit: Leaving) [22:32] *** bluefoo has quit IRC (Ping timeout: 246 seconds) [23:00] *** BlueMax has joined #archiveteam-bs [23:00] *** BlueMaxim has joined #archiveteam-bs [23:04] *** DogsRNice has joined #archiveteam-bs [23:06] *** bluefoo has joined #archiveteam-bs [23:14] *** killsushi has joined #archiveteam-bs [23:19] *** BlueMaxim has quit IRC (Quit: Leaving) [23:44] *** britmob has quit IRC (Read error: Operation timed out)