Time |
Nickname |
Message |
00:07
🔗
|
|
dewdropaw has joined #archiveteam-bs |
00:07
🔗
|
|
nyany has quit IRC (Read error: Operation timed out) |
00:07
🔗
|
|
nyany has joined #archiveteam-bs |
00:07
🔗
|
|
Igloo has quit IRC (Read error: Operation timed out) |
00:07
🔗
|
|
Larsenv has quit IRC (Read error: Operation timed out) |
00:07
🔗
|
|
cppchrisc has joined #archiveteam-bs |
00:07
🔗
|
|
cppchrisc has quit IRC (Connection closed) |
00:07
🔗
|
|
Igloo has joined #archiveteam-bs |
00:08
🔗
|
|
cppchrisc has joined #archiveteam-bs |
00:08
🔗
|
|
Larsenv has joined #archiveteam-bs |
00:08
🔗
|
|
svchfoo1 sets mode: +o Igloo |
00:08
🔗
|
|
svchfoo3 sets mode: +o Igloo |
00:10
🔗
|
|
dewdrop has quit IRC (Ping timeout: 360 seconds) |
00:21
🔗
|
HP_Archiv |
@betamax and @markedL, sorry for the delayed response. Work priorities and all... |
00:22
🔗
|
HP_Archiv |
I figured as much. The membership though, is that Archive-It you're talking about? |
00:31
🔗
|
markedL |
I mean, there are ways to check if something is in the WBM. how many URLs do you need to check? |
00:45
🔗
|
HP_Archiv |
I believe there were 55 links in, 'https://transfer.notkiska.pw/PvcO6/ModDB_Potter_Downloads_URLs_11.2019.txt' ' @betamax pulled them for me last night |
00:46
🔗
|
HP_Archiv |
Also, I'd like to know for future reference/be able to do it on a whim |
00:46
🔗
|
HP_Archiv |
but how do I manage to archive the downloads at the end of a GDrive link, instead of just archiving the URL ? |
00:47
🔗
|
ivan |
HP_Archiv: rclone can grab those |
00:47
🔗
|
ivan |
assuming you can save the folder/file to your gdrive |
01:21
🔗
|
|
Video has joined #archiveteam-bs |
01:34
🔗
|
HP_Archiv |
@ivan, It's not my Google Drive those files are hosted on. And what I'd like to do is save them as part of the overal capture for HP-Game.net in the WBM, and then on IA |
01:34
🔗
|
HP_Archiv |
Is that possible?> |
01:35
🔗
|
|
LowLevelM has joined #archiveteam-bs |
01:58
🔗
|
|
LowLevelM has quit IRC (Ping timeout: 262 seconds) |
02:17
🔗
|
|
omglolba- has joined #archiveteam-bs |
02:22
🔗
|
|
pew has quit IRC (Ping timeout: 252 seconds) |
02:26
🔗
|
|
dd33cc has quit IRC (Ping timeout: 260 seconds) |
02:27
🔗
|
|
omglolbah has quit IRC (Ping timeout: 745 seconds) |
02:35
🔗
|
|
IAmbience has quit IRC (Quit: Connection closed for inactivity) |
02:36
🔗
|
|
pew has joined #archiveteam-bs |
02:41
🔗
|
|
DogsRNice has quit IRC (Read error: Connection reset by peer) |
03:22
🔗
|
|
manjaro-u has quit IRC (Read error: Operation timed out) |
04:39
🔗
|
|
qw3rty2 has joined #archiveteam-bs |
04:48
🔗
|
|
qw3rty has quit IRC (Ping timeout: 745 seconds) |
05:31
🔗
|
|
systwi has quit IRC (Read error: Connection reset by peer) |
05:32
🔗
|
|
systwi has joined #archiveteam-bs |
05:59
🔗
|
|
Stilettoo has joined #archiveteam-bs |
05:59
🔗
|
|
Stiletto has quit IRC (Ping timeout: 246 seconds) |
06:00
🔗
|
|
ShellyRol has quit IRC (Read error: Connection reset by peer) |
06:02
🔗
|
|
ShellyRol has joined #archiveteam-bs |
06:13
🔗
|
|
HP_Archiv has quit IRC (Quit: Page closed) |
06:13
🔗
|
|
HP_Archiv has joined #archiveteam-bs |
06:13
🔗
|
HP_Archiv |
Does anyone know? |
07:42
🔗
|
|
is- has joined #archiveteam-bs |
08:03
🔗
|
|
Ivy has quit IRC (Quit: Connection closed for inactivity) |
08:13
🔗
|
|
purplebot has quit IRC (Remote host closed the connection) |
08:14
🔗
|
|
purplebot has joined #archiveteam-bs |
08:26
🔗
|
|
Flashfire has quit IRC (Remote host closed the connection) |
08:26
🔗
|
|
kiska has quit IRC (Remote host closed the connection) |
08:27
🔗
|
|
Flashfire has joined #archiveteam-bs |
08:27
🔗
|
|
kiska has joined #archiveteam-bs |
08:27
🔗
|
|
Fusl__ sets mode: +o kiska |
08:27
🔗
|
|
Fusl_ sets mode: +o kiska |
08:27
🔗
|
|
Fusl sets mode: +o kiska |
09:44
🔗
|
HP_Archiv |
Anyone around? |
09:45
🔗
|
Igloo |
Hi HP_Archiv |
09:45
🔗
|
Igloo |
You can check in bulk with the CDX API |
09:46
🔗
|
HP_Archiv |
I have no idea what that is, I'm new around here |
09:46
🔗
|
Igloo |
Then make a list, provide them in #archivebot and it will go into WBM when the job is done + a period of time |
09:46
🔗
|
Igloo |
https://github.com/internetarchive/wayback/tree/master/wayback-cdx-server |
09:48
🔗
|
HP_Archiv |
Okay thank you ^^ But that wasn't what I was asking right now. So earlier I was in here trying to get help for how to get AB to archive a file that's from a URL on a specific URL I'm going to capture/submit. |
09:48
🔗
|
HP_Archiv |
I want to archive those files, hosted on a Google Drive account, into WBM |
09:48
🔗
|
HP_Archiv |
Any way to do this? |
09:49
🔗
|
HP_Archiv |
I can't scroll back from earlier this afternoon, but basically this - https://hp-games.net/343 |
09:49
🔗
|
Igloo |
So, WBM only works if the files are in their original location |
09:49
🔗
|
HP_Archiv |
That's a mod entry on an Potter game site. The mod file itself, creator by a different person other than the site owner, has the mod file hosted in a Google Dirve and on Yandex. I can submit that URL no problem. I've already done this. However, how do I get archive bot to archive that particular file? |
09:50
🔗
|
Igloo |
https://drive.google.com/open?id=0BxEt9eREFkhlaUZrQ3lKME9LWDg |
09:50
🔗
|
Igloo |
These links? |
09:50
🔗
|
HP_Archiv |
Yup, correct |
09:50
🔗
|
Igloo |
Ok, Leave it with me. ArchiveBot may not do it |
09:50
🔗
|
Igloo |
I need to step away for a few minutes, But I can look for you. Only certain trusted people can upload to the Archive and have it in the WBM |
09:51
🔗
|
Igloo |
Although anyone can upload to IA. |
09:51
🔗
|
HP_Archiv |
Well I hate to keep having to rely on others to fulfill my requests... |
09:51
🔗
|
HP_Archiv |
Hm, okay. But there's a variety of links, just like that page, which contain other links pointing to Google Drive files. |
09:51
🔗
|
HP_Archiv |
This page: https://hp-games.net/343 |
09:51
🔗
|
HP_Archiv |
Oops |
09:52
🔗
|
HP_Archiv |
https://hp-games.net/all-mods |
09:52
🔗
|
HP_Archiv |
That page ^^ |
09:52
🔗
|
HP_Archiv |
I want to archive all of those pod pages & associated page elements, and then archive the hosted files that either link out to Google Drive/Yandex. |
09:53
🔗
|
HP_Archiv |
If you could do that, that would be awesome. But it seems time consuming (unless you're using a script I'm unaware of.) Either way, take your time. |
09:53
🔗
|
HP_Archiv |
mod pages* |
09:56
🔗
|
HP_Archiv |
One last thing, some of those mod pages, ex: this page, https://hp-games.net/mods-dl-downloads, link out to a separate page with many direct links to Google/Yandex. I have no idea how you're going to get all of this links/sub-links, etc. in an easy fashion. But if you need any help, let me know |
10:15
🔗
|
|
SmileyG has joined #archiveteam-bs |
10:19
🔗
|
|
schbirid has joined #archiveteam-bs |
10:21
🔗
|
Igloo |
The problem is that ArchiveBot can't get that those files |
10:22
🔗
|
HP_Archiv |
@Igloo, there's no workaround? |
10:25
🔗
|
Igloo |
Oh there are workarounds, Just looking at options :) |
10:25
🔗
|
|
Smiley has quit IRC (Ping timeout: 745 seconds) |
10:25
🔗
|
HP_Archiv |
Heh, okay. Let me know what you come up with :) |
10:28
🔗
|
HP_Archiv |
Also, this is completely unrelated, but has ArchiveTeam considered the implications of when Myspace goes away, have they archived Myspace already? |
10:29
🔗
|
Igloo |
Myspace was done a while back I am sure |
10:30
🔗
|
Igloo |
https://www.archiveteam.org/index.php?title=Myspace |
10:31
🔗
|
HP_Archiv |
Thanks for the link ^^ apparently because of zero heads up, they were unable to archive a lot, sadly |
10:32
🔗
|
HP_Archiv |
I was just now mulling over what sites out there might need focus and for whatever reason I thought of Myspace, heh |
10:33
🔗
|
HP_Archiv |
focus = attention |
10:33
🔗
|
Igloo |
Yeah, There is a huge list of shit that needs to be looked at |
10:34
🔗
|
HP_Archiv |
Yeah, I actually just thought of one - Urban Dictionary |
10:34
🔗
|
HP_Archiv |
That is a goldmine for future linguistics |
10:35
🔗
|
HP_Archiv |
And it appears that they haven't gotten to that yet |
10:37
🔗
|
HP_Archiv |
Do you have a solution for my HP-Games dilemma? |
10:38
🔗
|
eientei95 |
http://shiva3dengine.com/legacy_forum/index.php Can someone chuck this in for achiving, uses a session ID in the URL and I don't know what to do about it |
10:38
🔗
|
eientei95 |
It's a legacy forum for an old 3D game engine |
10:41
🔗
|
eientei95 |
Igloo: Cheers, guess it was that simple |
10:41
🔗
|
Igloo |
Should be :) |
10:41
🔗
|
Igloo |
Monitoring it |
11:01
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
11:28
🔗
|
|
Damme has quit IRC (Read error: Connection reset by peer) |
11:31
🔗
|
HP_Archiv |
I forget, what are the parameters for entering a link into archivebot? It's !ao < or something, I think |
11:34
🔗
|
HP_Archiv |
Never mind, got it |
11:35
🔗
|
betamax |
for future reference: https://archivebot.readthedocs.io/en/latest/ |
11:35
🔗
|
betamax |
although some of those commands require voice / ops, which you'll need to ask for in #archivebot before being able to use |
11:39
🔗
|
HP_Archiv |
@betamax thank you |
11:44
🔗
|
HP_Archiv |
Hm, I can't seem to find the one I was using before |
11:45
🔗
|
HP_Archiv |
Isn't it this, ' !ao < ' ? |
11:49
🔗
|
HP_Archiv |
I got it. |
11:50
🔗
|
HP_Archiv |
@betamax, but using the command I just did is the right way to properly archive an entire site? That's the default, correct? |
11:57
🔗
|
betamax |
!ao < takes the list of urls, and individually archives each of those URLs |
11:57
🔗
|
betamax |
there isn't really a "default" |
11:58
🔗
|
betamax |
however the way to archive an entire site would be !a (which needs voice / ops), this recursively archives a single site (ie: archives the page you give it, then all links to the same domain on that page, then all links from those links, etc..) |
11:59
🔗
|
HP_Archiv |
Oh, I see. So I still need to ask for assistance if I want something done thoroughly? |
12:00
🔗
|
betamax |
probably best just to ask for voice / ops, so you can do it youself |
12:00
🔗
|
betamax |
but you will need to ask the first time, yes |
12:00
🔗
|
HP_Archiv |
Okay understood. I'll ask later on at a more appropriate time. It's 4 am where I am, heh |
12:00
🔗
|
HP_Archiv |
Thanks :) |
12:28
🔗
|
|
wyatt8740 has quit IRC (Read error: Operation timed out) |
12:55
🔗
|
godane |
SketchCow: so your getting some Canon japanese manuals from 2001 |
12:55
🔗
|
godane |
for there printers at the time |
13:23
🔗
|
|
jleclanch has quit IRC (Quit: Connection closed for inactivity) |
13:28
🔗
|
|
Video_ has joined #archiveteam-bs |
13:30
🔗
|
|
Stiletto has joined #archiveteam-bs |
13:30
🔗
|
|
Stilettoo has quit IRC (Read error: Operation timed out) |
13:34
🔗
|
|
Video has quit IRC (Read error: Operation timed out) |
14:01
🔗
|
|
Damme has joined #archiveteam-bs |
14:02
🔗
|
|
Ivy has joined #archiveteam-bs |
14:13
🔗
|
|
mls_ has quit IRC (Remote host closed the connection) |
14:19
🔗
|
|
mls_ has joined #archiveteam-bs |
14:29
🔗
|
|
britmob has quit IRC (Read error: Connection reset by peer) |
14:44
🔗
|
|
Zerote has joined #archiveteam-bs |
15:20
🔗
|
|
britmob has joined #archiveteam-bs |
15:44
🔗
|
|
JH8813269 has quit IRC (Quit: The Lounge - https://thelounge.chat) |
16:17
🔗
|
kpcyrd |
in case I feel like reviving this tweet but for tiktok, can I just create a project page in the wiki? |
16:26
🔗
|
Igloo |
Sire |
16:26
🔗
|
Igloo |
Sure |
16:31
🔗
|
|
X-Scale` has joined #archiveteam-bs |
16:32
🔗
|
|
X-Scale has quit IRC (Ping timeout: 252 seconds) |
16:32
🔗
|
|
X-Scale` is now known as X-Scale |
16:49
🔗
|
|
meltir has joined #archiveteam-bs |
17:07
🔗
|
|
SmileyG has quit IRC (Read error: Operation timed out) |
17:08
🔗
|
|
Smiley has joined #archiveteam-bs |
17:20
🔗
|
|
SmileyG has joined #archiveteam-bs |
17:21
🔗
|
|
Smiley has quit IRC (Read error: Operation timed out) |
17:25
🔗
|
|
SmileyG has quit IRC (Ping timeout: 258 seconds) |
17:25
🔗
|
|
Smiley has joined #archiveteam-bs |
17:39
🔗
|
kpcyrd |
feedback welcome, in case there are any obvious fuckups on my end |
17:44
🔗
|
|
Smiley has quit IRC (Read error: Operation timed out) |
17:46
🔗
|
|
Smiley has joined #archiveteam-bs |
17:55
🔗
|
kpcyrd |
is it ok to create irc channels in advance, with no imminent shutdown/deletion? I have opinions on irc networks.. |
17:56
🔗
|
|
icedice has joined #archiveteam-bs |
18:31
🔗
|
|
X-Scale` has joined #archiveteam-bs |
18:32
🔗
|
|
X-Scale has quit IRC (Ping timeout: 252 seconds) |
18:32
🔗
|
|
X-Scale` is now known as X-Scale |
18:38
🔗
|
|
HP_Archiv has quit IRC (Ping timeout: 260 seconds) |
18:49
🔗
|
|
twigfoot has quit IRC (Read error: Operation timed out) |
18:49
🔗
|
|
HashbangI has quit IRC (Read error: Operation timed out) |
18:49
🔗
|
|
anarcat has quit IRC (Read error: Operation timed out) |
18:49
🔗
|
|
Video has joined #archiveteam-bs |
18:49
🔗
|
|
anarcat has joined #archiveteam-bs |
18:49
🔗
|
|
twigfoot has joined #archiveteam-bs |
18:49
🔗
|
|
closure has quit IRC (Read error: Operation timed out) |
18:49
🔗
|
|
kiskabak has quit IRC (Read error: Operation timed out) |
18:50
🔗
|
|
jake_test has quit IRC (Read error: Operation timed out) |
18:50
🔗
|
|
closure has joined #archiveteam-bs |
18:51
🔗
|
|
balrog has quit IRC (Read error: Operation timed out) |
18:51
🔗
|
|
balrog has joined #archiveteam-bs |
18:51
🔗
|
|
dewdrop has joined #archiveteam-bs |
18:51
🔗
|
|
Dj-Wawa has quit IRC (Read error: Operation timed out) |
18:52
🔗
|
|
Dj-Wawa has joined #archiveteam-bs |
18:53
🔗
|
|
Zerote has quit IRC (Read error: Operation timed out) |
18:54
🔗
|
|
Video_ has quit IRC (Read error: Operation timed out) |
18:54
🔗
|
|
Zerote has joined #archiveteam-bs |
18:55
🔗
|
|
dewdropaw has quit IRC (Read error: Operation timed out) |
18:55
🔗
|
|
legoktm has joined #archiveteam-bs |
18:55
🔗
|
|
ugh has quit IRC (Read error: Connection reset by peer) |
18:55
🔗
|
|
ShellyRol has quit IRC (Read error: Operation timed out) |
18:55
🔗
|
|
systwi_ has joined #archiveteam-bs |
18:55
🔗
|
|
PhrackD has quit IRC (Read error: Connection reset by peer) |
18:55
🔗
|
|
HashbangI has joined #archiveteam-bs |
18:57
🔗
|
|
systwi has quit IRC (Read error: Operation timed out) |
18:57
🔗
|
|
godane has quit IRC (Ping timeout: 612 seconds) |
18:58
🔗
|
|
PhrackD has joined #archiveteam-bs |
18:58
🔗
|
|
ShellyRol has joined #archiveteam-bs |
18:59
🔗
|
|
godane has joined #archiveteam-bs |
19:09
🔗
|
|
jake_test has joined #archiveteam-bs |
19:09
🔗
|
|
ShellyRol has quit IRC (Read error: Connection reset by peer) |
19:12
🔗
|
|
ShellyRol has joined #archiveteam-bs |
19:38
🔗
|
|
manjaro-u has joined #archiveteam-bs |
19:42
🔗
|
JAA |
SketchCow: Just noticed that there are a lot of PyeongChang Olympics items in the AB collection. I suspect those should have their own collection instead; they definitely weren't retrieved through AB. https://archive.org/details/archivebot?and%5B%5D=pyeongchang&sin=&sort=-publicdate |
20:12
🔗
|
|
Ivy has quit IRC (Quit: Connection closed for inactivity) |
20:20
🔗
|
|
britmob has quit IRC (Ping timeout: 252 seconds) |
20:24
🔗
|
|
britmob has joined #archiveteam-bs |
21:22
🔗
|
|
Zerote has quit IRC (Quit: Leaving) |
21:22
🔗
|
|
Zerote_ has quit IRC (Quit: Leaving) |
21:22
🔗
|
|
Zerote has joined #archiveteam-bs |
22:32
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
22:32
🔗
|
|
bluefoo has quit IRC (Ping timeout: 246 seconds) |
23:00
🔗
|
|
BlueMax has joined #archiveteam-bs |
23:00
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
23:04
🔗
|
|
DogsRNice has joined #archiveteam-bs |
23:06
🔗
|
|
bluefoo has joined #archiveteam-bs |
23:14
🔗
|
|
killsushi has joined #archiveteam-bs |
23:19
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
23:44
🔗
|
|
britmob has quit IRC (Read error: Operation timed out) |