#archiveteam-bs 2019-11-09,Sat

↑back Search

Time Nickname Message
00:07 🔗 dewdropaw has joined #archiveteam-bs
00:07 🔗 nyany has quit IRC (Read error: Operation timed out)
00:07 🔗 nyany has joined #archiveteam-bs
00:07 🔗 Igloo has quit IRC (Read error: Operation timed out)
00:07 🔗 Larsenv has quit IRC (Read error: Operation timed out)
00:07 🔗 cppchrisc has joined #archiveteam-bs
00:07 🔗 cppchrisc has quit IRC (Connection closed)
00:07 🔗 Igloo has joined #archiveteam-bs
00:08 🔗 cppchrisc has joined #archiveteam-bs
00:08 🔗 Larsenv has joined #archiveteam-bs
00:08 🔗 svchfoo1 sets mode: +o Igloo
00:08 🔗 svchfoo3 sets mode: +o Igloo
00:10 🔗 dewdrop has quit IRC (Ping timeout: 360 seconds)
00:21 🔗 HP_Archiv @betamax and @markedL, sorry for the delayed response. Work priorities and all...
00:22 🔗 HP_Archiv I figured as much. The membership though, is that Archive-It you're talking about?
00:31 🔗 markedL I mean, there are ways to check if something is in the WBM. how many URLs do you need to check?
00:45 🔗 HP_Archiv I believe there were 55 links in, 'https://transfer.notkiska.pw/PvcO6/ModDB_Potter_Downloads_URLs_11.2019.txt' ' @betamax pulled them for me last night
00:46 🔗 HP_Archiv Also, I'd like to know for future reference/be able to do it on a whim
00:46 🔗 HP_Archiv but how do I manage to archive the downloads at the end of a GDrive link, instead of just archiving the URL ?
00:47 🔗 ivan HP_Archiv: rclone can grab those
00:47 🔗 ivan assuming you can save the folder/file to your gdrive
01:21 🔗 Video has joined #archiveteam-bs
01:34 🔗 HP_Archiv @ivan, It's not my Google Drive those files are hosted on. And what I'd like to do is save them as part of the overal capture for HP-Game.net in the WBM, and then on IA
01:34 🔗 HP_Archiv Is that possible?>
01:35 🔗 LowLevelM has joined #archiveteam-bs
01:58 🔗 LowLevelM has quit IRC (Ping timeout: 262 seconds)
02:17 🔗 omglolba- has joined #archiveteam-bs
02:22 🔗 pew has quit IRC (Ping timeout: 252 seconds)
02:26 🔗 dd33cc has quit IRC (Ping timeout: 260 seconds)
02:27 🔗 omglolbah has quit IRC (Ping timeout: 745 seconds)
02:35 🔗 IAmbience has quit IRC (Quit: Connection closed for inactivity)
02:36 🔗 pew has joined #archiveteam-bs
02:41 🔗 DogsRNice has quit IRC (Read error: Connection reset by peer)
03:22 🔗 manjaro-u has quit IRC (Read error: Operation timed out)
04:39 🔗 qw3rty2 has joined #archiveteam-bs
04:48 🔗 qw3rty has quit IRC (Ping timeout: 745 seconds)
05:31 🔗 systwi has quit IRC (Read error: Connection reset by peer)
05:32 🔗 systwi has joined #archiveteam-bs
05:59 🔗 Stilettoo has joined #archiveteam-bs
05:59 🔗 Stiletto has quit IRC (Ping timeout: 246 seconds)
06:00 🔗 ShellyRol has quit IRC (Read error: Connection reset by peer)
06:02 🔗 ShellyRol has joined #archiveteam-bs
06:13 🔗 HP_Archiv has quit IRC (Quit: Page closed)
06:13 🔗 HP_Archiv has joined #archiveteam-bs
06:13 🔗 HP_Archiv Does anyone know?
07:42 🔗 is- has joined #archiveteam-bs
08:03 🔗 Ivy has quit IRC (Quit: Connection closed for inactivity)
08:13 🔗 purplebot has quit IRC (Remote host closed the connection)
08:14 🔗 purplebot has joined #archiveteam-bs
08:26 🔗 Flashfire has quit IRC (Remote host closed the connection)
08:26 🔗 kiska has quit IRC (Remote host closed the connection)
08:27 🔗 Flashfire has joined #archiveteam-bs
08:27 🔗 kiska has joined #archiveteam-bs
08:27 🔗 Fusl__ sets mode: +o kiska
08:27 🔗 Fusl_ sets mode: +o kiska
08:27 🔗 Fusl sets mode: +o kiska
09:44 🔗 HP_Archiv Anyone around?
09:45 🔗 Igloo Hi HP_Archiv
09:45 🔗 Igloo You can check in bulk with the CDX API
09:46 🔗 HP_Archiv I have no idea what that is, I'm new around here
09:46 🔗 Igloo Then make a list, provide them in #archivebot and it will go into WBM when the job is done + a period of time
09:46 🔗 Igloo https://github.com/internetarchive/wayback/tree/master/wayback-cdx-server
09:48 🔗 HP_Archiv Okay thank you ^^ But that wasn't what I was asking right now. So earlier I was in here trying to get help for how to get AB to archive a file that's from a URL on a specific URL I'm going to capture/submit.
09:48 🔗 HP_Archiv I want to archive those files, hosted on a Google Drive account, into WBM
09:48 🔗 HP_Archiv Any way to do this?
09:49 🔗 HP_Archiv I can't scroll back from earlier this afternoon, but basically this - https://hp-games.net/343
09:49 🔗 Igloo So, WBM only works if the files are in their original location
09:49 🔗 HP_Archiv That's a mod entry on an Potter game site. The mod file itself, creator by a different person other than the site owner, has the mod file hosted in a Google Dirve and on Yandex. I can submit that URL no problem. I've already done this. However, how do I get archive bot to archive that particular file?
09:50 🔗 Igloo https://drive.google.com/open?id=0BxEt9eREFkhlaUZrQ3lKME9LWDg
09:50 🔗 Igloo These links?
09:50 🔗 HP_Archiv Yup, correct
09:50 🔗 Igloo Ok, Leave it with me. ArchiveBot may not do it
09:50 🔗 Igloo I need to step away for a few minutes, But I can look for you. Only certain trusted people can upload to the Archive and have it in the WBM
09:51 🔗 Igloo Although anyone can upload to IA.
09:51 🔗 HP_Archiv Well I hate to keep having to rely on others to fulfill my requests...
09:51 🔗 HP_Archiv Hm, okay. But there's a variety of links, just like that page, which contain other links pointing to Google Drive files.
09:51 🔗 HP_Archiv This page: https://hp-games.net/343
09:51 🔗 HP_Archiv Oops
09:52 🔗 HP_Archiv https://hp-games.net/all-mods
09:52 🔗 HP_Archiv That page ^^
09:52 🔗 HP_Archiv I want to archive all of those pod pages & associated page elements, and then archive the hosted files that either link out to Google Drive/Yandex.
09:53 🔗 HP_Archiv If you could do that, that would be awesome. But it seems time consuming (unless you're using a script I'm unaware of.) Either way, take your time.
09:53 🔗 HP_Archiv mod pages*
09:56 🔗 HP_Archiv One last thing, some of those mod pages, ex: this page, https://hp-games.net/mods-dl-downloads, link out to a separate page with many direct links to Google/Yandex. I have no idea how you're going to get all of this links/sub-links, etc. in an easy fashion. But if you need any help, let me know
10:15 🔗 SmileyG has joined #archiveteam-bs
10:19 🔗 schbirid has joined #archiveteam-bs
10:21 🔗 Igloo The problem is that ArchiveBot can't get that those files
10:22 🔗 HP_Archiv @Igloo, there's no workaround?
10:25 🔗 Igloo Oh there are workarounds, Just looking at options :)
10:25 🔗 Smiley has quit IRC (Ping timeout: 745 seconds)
10:25 🔗 HP_Archiv Heh, okay. Let me know what you come up with :)
10:28 🔗 HP_Archiv Also, this is completely unrelated, but has ArchiveTeam considered the implications of when Myspace goes away, have they archived Myspace already?
10:29 🔗 Igloo Myspace was done a while back I am sure
10:30 🔗 Igloo https://www.archiveteam.org/index.php?title=Myspace
10:31 🔗 HP_Archiv Thanks for the link ^^ apparently because of zero heads up, they were unable to archive a lot, sadly
10:32 🔗 HP_Archiv I was just now mulling over what sites out there might need focus and for whatever reason I thought of Myspace, heh
10:33 🔗 HP_Archiv focus = attention
10:33 🔗 Igloo Yeah, There is a huge list of shit that needs to be looked at
10:34 🔗 HP_Archiv Yeah, I actually just thought of one - Urban Dictionary
10:34 🔗 HP_Archiv That is a goldmine for future linguistics
10:35 🔗 HP_Archiv And it appears that they haven't gotten to that yet
10:37 🔗 HP_Archiv Do you have a solution for my HP-Games dilemma?
10:38 🔗 eientei95 http://shiva3dengine.com/legacy_forum/index.php Can someone chuck this in for achiving, uses a session ID in the URL and I don't know what to do about it
10:38 🔗 eientei95 It's a legacy forum for an old 3D game engine
10:41 🔗 eientei95 Igloo: Cheers, guess it was that simple
10:41 🔗 Igloo Should be :)
10:41 🔗 Igloo Monitoring it
11:01 🔗 BlueMax has quit IRC (Quit: Leaving)
11:28 🔗 Damme has quit IRC (Read error: Connection reset by peer)
11:31 🔗 HP_Archiv I forget, what are the parameters for entering a link into archivebot? It's !ao < or something, I think
11:34 🔗 HP_Archiv Never mind, got it
11:35 🔗 betamax for future reference: https://archivebot.readthedocs.io/en/latest/
11:35 🔗 betamax although some of those commands require voice / ops, which you'll need to ask for in #archivebot before being able to use
11:39 🔗 HP_Archiv @betamax thank you
11:44 🔗 HP_Archiv Hm, I can't seem to find the one I was using before
11:45 🔗 HP_Archiv Isn't it this, ' !ao < ' ?
11:49 🔗 HP_Archiv I got it.
11:50 🔗 HP_Archiv @betamax, but using the command I just did is the right way to properly archive an entire site? That's the default, correct?
11:57 🔗 betamax !ao < takes the list of urls, and individually archives each of those URLs
11:57 🔗 betamax there isn't really a "default"
11:58 🔗 betamax however the way to archive an entire site would be !a (which needs voice / ops), this recursively archives a single site (ie: archives the page you give it, then all links to the same domain on that page, then all links from those links, etc..)
11:59 🔗 HP_Archiv Oh, I see. So I still need to ask for assistance if I want something done thoroughly?
12:00 🔗 betamax probably best just to ask for voice / ops, so you can do it youself
12:00 🔗 betamax but you will need to ask the first time, yes
12:00 🔗 HP_Archiv Okay understood. I'll ask later on at a more appropriate time. It's 4 am where I am, heh
12:00 🔗 HP_Archiv Thanks :)
12:28 🔗 wyatt8740 has quit IRC (Read error: Operation timed out)
12:55 🔗 godane SketchCow: so your getting some Canon japanese manuals from 2001
12:55 🔗 godane for there printers at the time
13:23 🔗 jleclanch has quit IRC (Quit: Connection closed for inactivity)
13:28 🔗 Video_ has joined #archiveteam-bs
13:30 🔗 Stiletto has joined #archiveteam-bs
13:30 🔗 Stilettoo has quit IRC (Read error: Operation timed out)
13:34 🔗 Video has quit IRC (Read error: Operation timed out)
14:01 🔗 Damme has joined #archiveteam-bs
14:02 🔗 Ivy has joined #archiveteam-bs
14:13 🔗 mls_ has quit IRC (Remote host closed the connection)
14:19 🔗 mls_ has joined #archiveteam-bs
14:29 🔗 britmob has quit IRC (Read error: Connection reset by peer)
14:44 🔗 Zerote has joined #archiveteam-bs
15:20 🔗 britmob has joined #archiveteam-bs
15:44 🔗 JH8813269 has quit IRC (Quit: The Lounge - https://thelounge.chat)
16:17 🔗 kpcyrd in case I feel like reviving this tweet but for tiktok, can I just create a project page in the wiki?
16:26 🔗 Igloo Sire
16:26 🔗 Igloo Sure
16:31 🔗 X-Scale` has joined #archiveteam-bs
16:32 🔗 X-Scale has quit IRC (Ping timeout: 252 seconds)
16:32 🔗 X-Scale` is now known as X-Scale
16:49 🔗 meltir has joined #archiveteam-bs
17:07 🔗 SmileyG has quit IRC (Read error: Operation timed out)
17:08 🔗 Smiley has joined #archiveteam-bs
17:20 🔗 SmileyG has joined #archiveteam-bs
17:21 🔗 Smiley has quit IRC (Read error: Operation timed out)
17:25 🔗 SmileyG has quit IRC (Ping timeout: 258 seconds)
17:25 🔗 Smiley has joined #archiveteam-bs
17:39 🔗 kpcyrd feedback welcome, in case there are any obvious fuckups on my end
17:44 🔗 Smiley has quit IRC (Read error: Operation timed out)
17:46 🔗 Smiley has joined #archiveteam-bs
17:55 🔗 kpcyrd is it ok to create irc channels in advance, with no imminent shutdown/deletion? I have opinions on irc networks..
17:56 🔗 icedice has joined #archiveteam-bs
18:31 🔗 X-Scale` has joined #archiveteam-bs
18:32 🔗 X-Scale has quit IRC (Ping timeout: 252 seconds)
18:32 🔗 X-Scale` is now known as X-Scale
18:38 🔗 HP_Archiv has quit IRC (Ping timeout: 260 seconds)
18:49 🔗 twigfoot has quit IRC (Read error: Operation timed out)
18:49 🔗 HashbangI has quit IRC (Read error: Operation timed out)
18:49 🔗 anarcat has quit IRC (Read error: Operation timed out)
18:49 🔗 Video has joined #archiveteam-bs
18:49 🔗 anarcat has joined #archiveteam-bs
18:49 🔗 twigfoot has joined #archiveteam-bs
18:49 🔗 closure has quit IRC (Read error: Operation timed out)
18:49 🔗 kiskabak has quit IRC (Read error: Operation timed out)
18:50 🔗 jake_test has quit IRC (Read error: Operation timed out)
18:50 🔗 closure has joined #archiveteam-bs
18:51 🔗 balrog has quit IRC (Read error: Operation timed out)
18:51 🔗 balrog has joined #archiveteam-bs
18:51 🔗 dewdrop has joined #archiveteam-bs
18:51 🔗 Dj-Wawa has quit IRC (Read error: Operation timed out)
18:52 🔗 Dj-Wawa has joined #archiveteam-bs
18:53 🔗 Zerote has quit IRC (Read error: Operation timed out)
18:54 🔗 Video_ has quit IRC (Read error: Operation timed out)
18:54 🔗 Zerote has joined #archiveteam-bs
18:55 🔗 dewdropaw has quit IRC (Read error: Operation timed out)
18:55 🔗 legoktm has joined #archiveteam-bs
18:55 🔗 ugh has quit IRC (Read error: Connection reset by peer)
18:55 🔗 ShellyRol has quit IRC (Read error: Operation timed out)
18:55 🔗 systwi_ has joined #archiveteam-bs
18:55 🔗 PhrackD has quit IRC (Read error: Connection reset by peer)
18:55 🔗 HashbangI has joined #archiveteam-bs
18:57 🔗 systwi has quit IRC (Read error: Operation timed out)
18:57 🔗 godane has quit IRC (Ping timeout: 612 seconds)
18:58 🔗 PhrackD has joined #archiveteam-bs
18:58 🔗 ShellyRol has joined #archiveteam-bs
18:59 🔗 godane has joined #archiveteam-bs
19:09 🔗 jake_test has joined #archiveteam-bs
19:09 🔗 ShellyRol has quit IRC (Read error: Connection reset by peer)
19:12 🔗 ShellyRol has joined #archiveteam-bs
19:38 🔗 manjaro-u has joined #archiveteam-bs
19:42 🔗 JAA SketchCow: Just noticed that there are a lot of PyeongChang Olympics items in the AB collection. I suspect those should have their own collection instead; they definitely weren't retrieved through AB. https://archive.org/details/archivebot?and%5B%5D=pyeongchang&sin=&sort=-publicdate
20:12 🔗 Ivy has quit IRC (Quit: Connection closed for inactivity)
20:20 🔗 britmob has quit IRC (Ping timeout: 252 seconds)
20:24 🔗 britmob has joined #archiveteam-bs
21:22 🔗 Zerote has quit IRC (Quit: Leaving)
21:22 🔗 Zerote_ has quit IRC (Quit: Leaving)
21:22 🔗 Zerote has joined #archiveteam-bs
22:32 🔗 schbirid has quit IRC (Quit: Leaving)
22:32 🔗 bluefoo has quit IRC (Ping timeout: 246 seconds)
23:00 🔗 BlueMax has joined #archiveteam-bs
23:00 🔗 BlueMaxim has joined #archiveteam-bs
23:04 🔗 DogsRNice has joined #archiveteam-bs
23:06 🔗 bluefoo has joined #archiveteam-bs
23:14 🔗 killsushi has joined #archiveteam-bs
23:19 🔗 BlueMaxim has quit IRC (Quit: Leaving)
23:44 🔗 britmob has quit IRC (Read error: Operation timed out)

irclogger-viewer