| Time |
Nickname |
Message |
|
00:00
🔗
|
|
enowaldo has quit IRC (Ping timeout: 492 seconds) |
|
00:09
🔗
|
|
apache2 has quit IRC (Remote host closed the connection) |
|
00:09
🔗
|
|
apache2 has joined #archiveteam-bs |
|
00:12
🔗
|
JAA |
Hrm, testing something to try and fix that, but I can't seem to hammer them very hard. |
|
00:13
🔗
|
JAA |
Getting some "Bad Gateway" errors now as well, ew. |
|
00:17
🔗
|
JAA |
Well, I guess I'm going to stop here. Feel free to ignore the /posts/ URLs on the AB jobs if you think that's better. I'm not sure I'll be around again before the deadline. |
|
00:18
🔗
|
|
bitBaron has quit IRC (Quit: Bye.) |
|
00:22
🔗
|
godane |
so i throw out most of the pc computing magazines that i have scanned |
|
00:23
🔗
|
godane |
this was just so i can have space for more magazines to scan later |
|
00:24
🔗
|
godane |
alot of them was water damaged so i don't fell that bad about it |
|
00:34
🔗
|
|
Jopik has quit IRC (Remote host closed the connection) |
|
00:34
🔗
|
|
Jopik has joined #archiveteam-bs |
|
00:34
🔗
|
|
Zerote has quit IRC (Ping timeout: 260 seconds) |
|
00:55
🔗
|
|
BlueMax has joined #archiveteam-bs |
|
01:01
🔗
|
|
jut has quit IRC (Read error: Connection reset by peer) |
|
01:02
🔗
|
|
jut has joined #archiveteam-bs |
|
01:08
🔗
|
|
enowaldo has joined #archiveteam-bs |
|
01:20
🔗
|
|
enowaldo has quit IRC (Read error: Operation timed out) |
|
01:50
🔗
|
|
Despatche has quit IRC (Quit: Read error: Connection reset by deer) |
|
02:08
🔗
|
|
enowaldo has joined #archiveteam-bs |
|
02:17
🔗
|
|
enowaldo has quit IRC (Ping timeout: 492 seconds) |
|
02:27
🔗
|
|
dashcloud has joined #archiveteam-bs |
|
02:56
🔗
|
|
Dimtree has quit IRC () |
|
03:00
🔗
|
|
m007a83_ has joined #archiveteam-bs |
|
03:00
🔗
|
|
drcd_ has joined #archiveteam-bs |
|
03:02
🔗
|
|
deevious has quit IRC (Ping timeout: 252 seconds) |
|
03:02
🔗
|
|
coderobe has quit IRC (Read error: Connection reset by peer) |
|
03:02
🔗
|
|
Flashfire has quit IRC (Read error: Connection reset by peer) |
|
03:02
🔗
|
|
ColdIce has quit IRC (Quit: Ping timeout (120 seconds)) |
|
03:02
🔗
|
|
Terbium has quit IRC (Ping timeout: 252 seconds) |
|
03:02
🔗
|
|
deevious has joined #archiveteam-bs |
|
03:02
🔗
|
|
coderobe has joined #archiveteam-bs |
|
03:02
🔗
|
|
jut has quit IRC (Ping timeout: 252 seconds) |
|
03:02
🔗
|
|
odemgi_ has quit IRC (Ping timeout: 252 seconds) |
|
03:02
🔗
|
|
m007a83 has quit IRC (Ping timeout: 252 seconds) |
|
03:02
🔗
|
|
odemgi_ has joined #archiveteam-bs |
|
03:03
🔗
|
|
Flashfire has joined #archiveteam-bs |
|
03:03
🔗
|
|
ColdIce has joined #archiveteam-bs |
|
03:03
🔗
|
|
kiska has quit IRC (Ping timeout: 252 seconds) |
|
03:03
🔗
|
|
drcd has quit IRC (Ping timeout: 252 seconds) |
|
03:03
🔗
|
|
kiska has joined #archiveteam-bs |
|
03:04
🔗
|
|
svchfoo3 sets mode: +o kiska |
|
03:04
🔗
|
|
svchfoo1 sets mode: +o kiska |
|
03:04
🔗
|
|
jut has joined #archiveteam-bs |
|
03:07
🔗
|
|
Terbium has joined #archiveteam-bs |
|
03:08
🔗
|
|
Dimtree has joined #archiveteam-bs |
|
03:15
🔗
|
|
odemgi has joined #archiveteam-bs |
|
03:17
🔗
|
|
odemgi_ has quit IRC (Ping timeout: 252 seconds) |
|
03:24
🔗
|
|
odemg has quit IRC (Ping timeout: 615 seconds) |
|
03:30
🔗
|
|
odemg has joined #archiveteam-bs |
|
03:32
🔗
|
|
qw3rty119 has joined #archiveteam-bs |
|
03:36
🔗
|
|
qw3rty118 has quit IRC (Read error: Operation timed out) |
|
03:44
🔗
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
|
03:53
🔗
|
|
drcd_ is now known as drcd |
|
04:45
🔗
|
|
drcd has quit IRC (Read error: Connection reset by peer) |
|
05:01
🔗
|
|
m007a83_ is now known as m007a83 |
|
05:12
🔗
|
|
enowaldo has joined #archiveteam-bs |
|
05:24
🔗
|
|
Frogging has quit IRC (Read error: Operation timed out) |
|
05:24
🔗
|
|
Frogging has joined #archiveteam-bs |
|
05:24
🔗
|
|
balrog has quit IRC (Read error: Operation timed out) |
|
05:24
🔗
|
|
closure has quit IRC (Read error: Operation timed out) |
|
05:24
🔗
|
|
ivan has quit IRC (Read error: Operation timed out) |
|
05:24
🔗
|
|
JAA has quit IRC (Read error: Operation timed out) |
|
05:24
🔗
|
|
closure has joined #archiveteam-bs |
|
05:25
🔗
|
|
wabu has quit IRC (Read error: Operation timed out) |
|
05:25
🔗
|
|
balrog has joined #archiveteam-bs |
|
05:25
🔗
|
|
ivan has joined #archiveteam-bs |
|
05:25
🔗
|
|
simon816 has quit IRC (Ping timeout: 246 seconds) |
|
05:25
🔗
|
|
svchfoo1 has quit IRC (Read error: Operation timed out) |
|
05:25
🔗
|
|
enowaldo has quit IRC (Read error: Operation timed out) |
|
05:25
🔗
|
|
Exairnous has quit IRC (Read error: Operation timed out) |
|
05:25
🔗
|
|
SynMonger has quit IRC (Read error: Operation timed out) |
|
05:26
🔗
|
|
Exairnous has joined #archiveteam-bs |
|
05:26
🔗
|
|
fredgido has quit IRC (Ping timeout: 600 seconds) |
|
05:26
🔗
|
|
c4rc4s has quit IRC (Read error: Operation timed out) |
|
05:26
🔗
|
|
swebb has quit IRC (Read error: Operation timed out) |
|
05:26
🔗
|
|
SynMonger has joined #archiveteam-bs |
|
05:27
🔗
|
|
Hintswen has quit IRC (Ping timeout: 246 seconds) |
|
05:27
🔗
|
|
Hintswen has joined #archiveteam-bs |
|
05:28
🔗
|
|
wp494 has quit IRC (Read error: Operation timed out) |
|
05:28
🔗
|
|
swebb has joined #archiveteam-bs |
|
05:29
🔗
|
|
tech234a has joined #archiveteam-bs |
|
05:32
🔗
|
|
wp494 has joined #archiveteam-bs |
|
05:35
🔗
|
|
c4rc4s has joined #archiveteam-bs |
|
05:35
🔗
|
|
simon816 has joined #archiveteam-bs |
|
05:35
🔗
|
|
svchfoo1 has joined #archiveteam-bs |
|
05:35
🔗
|
|
Fusl sets mode: +o svchfoo1 |
|
05:38
🔗
|
|
JAA has joined #archiveteam-bs |
|
05:38
🔗
|
|
Fusl sets mode: +o JAA |
|
05:39
🔗
|
|
bakJAA sets mode: +o JAA |
|
05:39
🔗
|
|
wabu has joined #archiveteam-bs |
|
05:53
🔗
|
|
JAA has quit IRC (Read error: Operation timed out) |
|
05:54
🔗
|
|
wabu has quit IRC (Read error: Operation timed out) |
|
05:55
🔗
|
|
svchfoo1 has quit IRC (Read error: Operation timed out) |
|
05:55
🔗
|
|
simon816 has quit IRC (Read error: Operation timed out) |
|
05:56
🔗
|
|
c4rc4s has quit IRC (Read error: Operation timed out) |
|
05:58
🔗
|
|
killsushi has quit IRC (Quit: Leaving) |
|
05:58
🔗
|
|
simon816 has joined #archiveteam-bs |
|
05:58
🔗
|
|
c4rc4s has joined #archiveteam-bs |
|
05:59
🔗
|
|
svchfoo1 has joined #archiveteam-bs |
|
06:01
🔗
|
|
JAA has joined #archiveteam-bs |
|
06:02
🔗
|
|
wabu has joined #archiveteam-bs |
|
06:09
🔗
|
|
d5f4a3622 has quit IRC (Read error: Connection reset by peer) |
|
06:12
🔗
|
|
d5f4a3622 has joined #archiveteam-bs |
|
06:18
🔗
|
kiska |
JAA: arkiver: This is what I have, https://github.com/kiska3/sola-grab |
|
06:44
🔗
|
|
BlueMax has joined #archiveteam-bs |
|
06:52
🔗
|
|
Exairnous has quit IRC (Ping timeout: 265 seconds) |
|
07:06
🔗
|
|
Mata has quit IRC (Ping timeout: 600 seconds) |
|
07:13
🔗
|
|
enowaldo has joined #archiveteam-bs |
|
07:22
🔗
|
|
enowaldo has quit IRC (Ping timeout: 492 seconds) |
|
07:22
🔗
|
godane |
SketchCow: i think you need to fix this cause there russian magazines not english ones: https://archive.org/details/magazines_russian?and[]=languageSorter%3A%22English%22 |
|
07:38
🔗
|
godane |
Also someone may have put the rest of Byte Magazine here: https://vintageapple.org/byte/ |
|
07:39
🔗
|
|
tech234a has quit IRC (Quit: Connection closed for inactivity) |
|
08:18
🔗
|
kiska |
Hrm... I'll run the tracker on my domain, and I can start a crawl, hopefully of sola.ai. I am still trying to write the damn thing |
|
08:19
🔗
|
|
Reventlov has quit IRC (Quit: WeeChat 2.4) |
|
08:34
🔗
|
|
jesso has quit IRC (Quit: jesso) |
|
08:39
🔗
|
|
godane1 has joined #archiveteam-bs |
|
08:40
🔗
|
|
godane has quit IRC (Ping timeout: 615 seconds) |
|
08:43
🔗
|
|
jesso has joined #archiveteam-bs |
|
08:45
🔗
|
|
JAA has quit IRC (Reconnecting) |
|
08:45
🔗
|
|
JAA has joined #archiveteam-bs |
|
08:45
🔗
|
|
Fusl sets mode: +o JAA |
|
08:45
🔗
|
|
bakJAA sets mode: +o JAA |
|
09:01
🔗
|
|
RichardG has quit IRC (Quit: Keyboard not found, press F1 to continue) |
|
09:03
🔗
|
|
RichardG has joined #archiveteam-bs |
|
09:07
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
|
09:19
🔗
|
|
icedice has joined #archiveteam-bs |
|
09:20
🔗
|
|
PhrackD has quit IRC (Read error: Operation timed out) |
|
09:20
🔗
|
|
PhrackD has joined #archiveteam-bs |
|
09:21
🔗
|
|
icedice has quit IRC (Client Quit) |
|
09:24
🔗
|
kiska |
JAA: I've put all the code I've written into my repo |
|
09:35
🔗
|
godane1 |
SketchCow: btw i think the macworld pdfs got redone/restore by vintageapple.org |
|
09:35
🔗
|
godane1 |
there is like a index in the pdfs now |
|
09:35
🔗
|
godane1 |
and there smaller too |
|
09:39
🔗
|
|
netsound has joined #archiveteam-bs |
|
09:41
🔗
|
|
icedice has joined #archiveteam-bs |
|
09:44
🔗
|
|
Odd0002_ has joined #archiveteam-bs |
|
09:45
🔗
|
|
Odd0002 has quit IRC (Ping timeout: 252 seconds) |
|
09:45
🔗
|
|
Odd0002_ is now known as Odd0002 |
|
09:48
🔗
|
|
deevious has quit IRC (Read error: Connection reset by peer) |
|
10:30
🔗
|
JAA |
kiska: Aye, but we only have 1.5 hours left... |
|
10:37
🔗
|
JAA |
At least my API job grabbed about 1.6 GB of data: https://archive.fart.website/archivebot/viewer/job/c46az |
|
10:37
🔗
|
JAA |
And the others are currently grabbing images and stuff. |
|
10:38
🔗
|
VADemon |
L133: newurl = "https://api.solacore.net/users/" .. uuid .. "/posts/?limit=30&offset=30" |
|
10:38
🔗
|
VADemon |
kiska: dont you want to iterate through the offset to get the rest there? |
|
10:44
🔗
|
VADemon |
SketchCow: and those russian magazines have garbage OCR text because of lang=ENG |
|
10:55
🔗
|
kiska |
VADemon: yes I do |
|
10:56
🔗
|
VADemon |
is it done elsewhere or do you need help? |
|
10:59
🔗
|
kiska |
I am coding that right now |
|
11:05
🔗
|
kiska |
I just pushed something, can you check if that looks ok? |
|
11:32
🔗
|
kiska |
I am running it locally and it looks like its grabbing expected contents |
|
11:32
🔗
|
kiska |
Can someone check this? |
|
11:35
🔗
|
|
tomaspark has quit IRC (Read error: Operation timed out) |
|
11:38
🔗
|
|
enowaldo has joined #archiveteam-bs |
|
11:40
🔗
|
|
wyatt8740 has quit IRC (Read error: Operation timed out) |
|
11:50
🔗
|
VADemon |
kiska: L142 -> to "local nextpage = string.match(html, "/users/[^/]+/posts/%?limit=(%d+)&offset=(%d+)")" |
|
11:51
🔗
|
kiska |
Changing |
|
11:51
🔗
|
VADemon |
[%d] would be a pattern of digits, same as just %d. If you needed to make a better pattern you could use [%dabcdf] |
|
11:51
🔗
|
VADemon |
Hold on, just remove the brackets on %d altogheter kiska |
|
11:52
🔗
|
kiska |
I am on a train so can you give me that line again? |
|
11:53
🔗
|
VADemon |
https://github.com/kiska3/sola-grab/blob/master/sola.lua#L142 |
|
11:53
🔗
|
VADemon |
remove [] around %d |
|
11:53
🔗
|
VADemon |
[] is a pattern definition, -+?* dont work inside it. You define a pattern [abc] then apply -+?* to it: [abc]+ |
|
11:55
🔗
|
|
enowaldo has quit IRC (Read error: Operation timed out) |
|
11:59
🔗
|
|
kiska1 has quit IRC (Ping timeout (120 seconds)) |
|
11:59
🔗
|
|
kiska1 has joined #archiveteam-bs |
|
11:59
🔗
|
kiska |
Looks like they've killed the servers already |
|
11:59
🔗
|
|
svchfoo3 sets mode: +o kiska1 |
|
12:00
🔗
|
|
Zerote has joined #archiveteam-bs |
|
12:07
🔗
|
|
wyatt8740 has joined #archiveteam-bs |
|
12:15
🔗
|
|
wp494 has quit IRC (Quit: LOUD UNNECESSARY QUIT MESSAGES) |
|
12:20
🔗
|
|
VADemon has quit IRC (Quit: left4dead) |
|
12:21
🔗
|
|
Despatche has joined #archiveteam-bs |
|
12:22
🔗
|
|
wp494 has joined #archiveteam-bs |
|
12:22
🔗
|
JAA |
F |
|
12:27
🔗
|
kiska |
I started grabbing about 2 hrs ago, but I found many mistakes in my code |
|
12:28
🔗
|
kiska |
Apparently it's not down can someone access sola.ai and see if content is still there |
|
12:29
🔗
|
kiska |
That being said, wget isn't getting data |
|
12:29
🔗
|
JAA |
Uhm yeah, was just about to say that. |
|
12:29
🔗
|
JAA |
Still working for me. |
|
12:29
🔗
|
kiska |
I see |
|
12:30
🔗
|
kiska |
Can you clone my repo and run the warrior pipeline on the machines I gave you? |
|
12:31
🔗
|
kiska |
I haven't specified an rsync target in the test tracker so rsync errors are expected. Hetzner instances don't ship with rsync installed so apt install that |
|
12:31
🔗
|
kiska |
I've pre-packaged wget lua so there is no need to build it |
|
12:33
🔗
|
|
wyatt8740 has quit IRC (Read error: Operation timed out) |
|
12:38
🔗
|
JAA |
I fucking love tmux synchronize-panes. |
|
12:38
🔗
|
kiska |
xD |
|
12:40
🔗
|
|
wyatt8740 has joined #archiveteam-bs |
|
12:41
🔗
|
JAA |
Did you do something on one of the machines? They behave differently. |
|
12:41
🔗
|
JAA |
.88 has a pip installed in /usr/local? |
|
12:42
🔗
|
kiska |
Huh? |
|
12:42
🔗
|
kiska |
I chucked a ao pipeline on it to clear the AB backlog |
|
12:43
🔗
|
JAA |
Oh |
|
12:44
🔗
|
JAA |
Up and running |
|
12:47
🔗
|
JAA |
Actually, taking it down again because I forgot something. |
|
12:48
🔗
|
JAA |
Ok, up again for real now. |
|
12:49
🔗
|
kiska |
Huh? |
|
12:50
🔗
|
JAA |
Had it running directly in SSH instead of in a tmux session. |
|
12:53
🔗
|
|
odemgi has quit IRC (Read error: Connection reset by peer) |
|
12:53
🔗
|
|
odemgi has joined #archiveteam-bs |
|
12:58
🔗
|
kiska |
Ah I see |
|
13:05
🔗
|
godane1 |
SketchCow: i noticed the infoworld magazines i uploaded are dark now |
|
13:06
🔗
|
kiska |
JAA: Is it grabbing anything? |
|
13:06
🔗
|
JAA |
kiska: Nope, tracker rate limited. |
|
13:07
🔗
|
godane1 |
what i find funny is that there from google books and the IA/American Libraries collection has there files up: https://archive.org/details/bub_gb_yjAEAAAAMBAJ |
|
13:07
🔗
|
kiska |
I am trying, but wget isn't grabbing anything |
|
13:08
🔗
|
godane1 |
SketchCow: so my question is why can the google books rips like those be still up but my rips that put metadata and fixes into the rips be taking down |
|
13:15
🔗
|
JAA |
kiska: Same here, got some jobs now but it isn't grabbing anything. |
|
13:16
🔗
|
JAA |
Ah no, it is grabbing stuff, just not printing anything to the console. |
|
13:16
🔗
|
JAA |
It's very slow though. |
|
13:17
🔗
|
kiska |
... what have I done... |
|
13:19
🔗
|
kiska |
Can you check the code and see if there is any issues with it? Otherwise it'll be down to parsing the html |
|
13:20
🔗
|
JAA |
I'll have a look if I see anything obvious. |
|
13:22
🔗
|
JAA |
kiska: There's a syntax error in the Lua script. |
|
13:22
🔗
|
JAA |
lua: sola.lua:195: '=' expected near 'end' |
|
13:22
🔗
|
JAA |
Missing return on line 194 |
|
13:22
🔗
|
kiska |
I typed this on a train.... |
|
13:23
🔗
|
JAA |
Also, what's the matter with api%/solacore%.net? Is that slash supposed to be a period? |
|
13:23
🔗
|
kiska |
Yeah... |
|
13:24
🔗
|
marked |
an item is a profile? |
|
13:24
🔗
|
|
icedice has quit IRC (Leaving) |
|
13:24
🔗
|
JAA |
Yes |
|
13:25
🔗
|
godane1 |
latest scan : https://archive.org/details/good-food-magazine-1987-07 |
|
13:31
🔗
|
godane1 |
latest scan : https://archive.org/details/enjoy-your-cockatiel-pet-library |
|
13:31
🔗
|
godane1 |
latest scan : https://archive.org/details/fin-facts-aquarium-handbook-1992-wardley |
|
13:33
🔗
|
kiska |
It looks like 30 connections from me is 502'ing their service xD |
|
13:34
🔗
|
JAA |
Yeah, that started happening yesterday evening. I hammered them with 32 connections when I was scraping user profiles. |
|
13:35
🔗
|
kiska |
So limit = ~60 connections |
|
13:37
🔗
|
kiska |
I don't get how its producing this url... https://api.solacore.net/users/items/posts/?limit=30&offset=30 |
|
13:38
🔗
|
JAA |
Check the referrer in the WARC. |
|
13:42
🔗
|
|
ayanami_ has joined #archiveteam-bs |
|
13:42
🔗
|
ayanami_ |
https://www.flogao.com.br Check this out. Brazilian site shutting down in June |
|
13:43
🔗
|
kiska |
Ahhhhh |
|
13:44
🔗
|
ayanami_ |
..? sorry |
|
13:45
🔗
|
JAA |
Thanks for letting us know. I've put it on my list to investigate. kiska may be screaming about something unrelated. |
|
13:47
🔗
|
kiska |
... Referer: https://sola.ai/erenjager |
|
13:48
🔗
|
JAA |
:-/ |
|
13:48
🔗
|
JAA |
Yeah, saw those on the AB job before as well (including &) |
|
13:49
🔗
|
|
enowaldo has joined #archiveteam-bs |
|
13:49
🔗
|
JAA |
Away again for a bit. |
|
13:50
🔗
|
kiska |
I am just going to use the ignore-list on that url |
|
14:17
🔗
|
|
enowaldo has quit IRC (Read error: Operation timed out) |
|
14:24
🔗
|
kiska |
JAA: Can you update the machines I gave you with the latest commit? I think I fixed whatever was causing it to not function. I was basically recursing the entirety of sola.ai with this "^https?://sola.ai/" and not clamping down |
|
14:29
🔗
|
JAA |
kiska: Yup, up and running again. |
|
14:31
🔗
|
kiska |
I clamped it down to "^https?://sola.ai/" .. item_value so hopefully it'll grab things still |
|
14:34
🔗
|
JAA |
How are the post URLs treated? |
|
14:34
🔗
|
JAA |
Or rather, which post URLs are retrieved? |
|
14:34
🔗
|
JAA |
/user/$slug or /posts/$postid ? |
|
14:35
🔗
|
JAA |
I seem some of the former, but I wonder if that's only those which are linked directly in the HTML of the profile page or also the pagination. |
|
14:36
🔗
|
kiska |
I am going to use this in the httploop_result https://pastebin.com/RstQFpdm and in the allowed function I'll include string.match(url, "^https?://sola.ai/posts") |
|
14:37
🔗
|
kiska |
But I haven't pushed out that change yet, since I don't know what will occur |
|
14:39
🔗
|
|
wyatt8740 has quit IRC (Read error: Operation timed out) |
|
14:42
🔗
|
kiska |
JAA: Can you check this commit and see if it does allow the /posts/ url's https://github.com/kiska3/sola-grab/commit/80d3bc57d2062de1659574aea3bab91a17b67432 |
|
14:43
🔗
|
JAA |
This can never be true, can it? https://github.com/kiska3/sola-grab/blob/80d3bc57d2062de1659574aea3bab91a17b67432/sola.lua#L64-L68 |
|
14:43
🔗
|
kiska |
Yeah I am thinking about it |
|
14:44
🔗
|
JAA |
I'd just add another 'or string.match' for /posts. |
|
14:44
🔗
|
kiska |
So if I remove the inner if statement, it should become true, and grab the /posts/ url, but if it doesn't match the item_value it'll be rejected on the redirect |
|
14:46
🔗
|
kiska |
So if "string.match(url, "^https?://sola%.ai/posts") or string.match(url, "^https?://sola%.ai/" .. item_value)" should be the statement |
|
14:48
🔗
|
kiska |
Yeah that should now fix the issue of /posts/ not being grabbed |
|
14:49
🔗
|
|
wabu has quit IRC (Read error: Operation timed out) |
|
14:49
🔗
|
JAA |
Yeah |
|
14:49
🔗
|
kiska |
Well lets resume the tracker with the change, I didn't increment the pipeline version, I should probably do that |
|
14:50
🔗
|
JAA |
Yes please |
|
14:51
🔗
|
kiska |
Now resuming with 20190410.02 |
|
14:52
🔗
|
JAA |
Broken again |
|
14:52
🔗
|
JAA |
lua: sola.lua:198: '=' expected near 'end' |
|
14:52
🔗
|
|
wabu has joined #archiveteam-bs |
|
14:53
🔗
|
kiska |
... I keep doing that |
|
14:54
🔗
|
|
icedice has joined #archiveteam-bs |
|
14:55
🔗
|
JAA |
Test it with 'lua sola.lua'. |
|
14:56
🔗
|
JAA |
If you get something like 'lua: sola.lua:6: bad argument #1 to 'gsub' (string expected, got nil)', at least the syntax's right. ;-) |
|
14:56
🔗
|
kiska |
Should now work.... |
|
14:56
🔗
|
JAA |
Yup, doing something at least. |
|
14:57
🔗
|
kiska |
At least we want something, cause it'll grab something xD |
|
14:57
🔗
|
JAA |
And I do see /posts URLs in the output. |
|
14:57
🔗
|
JAA |
That's the spirit almost 3 hours after the deadline. |
|
14:58
🔗
|
kiska |
Well they did post their shutting down statement on April fools... |
|
14:58
🔗
|
JAA |
Yeah |
|
14:58
🔗
|
kiska |
Let me requeue the ones without the /posts/ url |
|
14:59
🔗
|
JAA |
But they didn't post any update afterwards. |
|
15:15
🔗
|
kiska |
*sigh* |
|
15:15
🔗
|
kiska |
What is it doing now... |
|
15:21
🔗
|
|
enowaldo has joined #archiveteam-bs |
|
15:25
🔗
|
|
Verified_ has quit IRC (Quit: Quit) |
|
15:25
🔗
|
|
Verified_ has joined #archiveteam-bs |
|
15:27
🔗
|
|
bitspill has quit IRC (Quit: Connection closed for inactivity) |
|
16:10
🔗
|
|
tech234a has joined #archiveteam-bs |
|
16:21
🔗
|
|
icedice2 has joined #archiveteam-bs |
|
16:25
🔗
|
|
icedice has quit IRC (Ping timeout: 252 seconds) |
|
16:42
🔗
|
|
enowaldo has quit IRC (Read error: Operation timed out) |
|
16:46
🔗
|
|
PhrackD- has joined #archiveteam-bs |
|
16:47
🔗
|
|
PhrackD has quit IRC (Read error: Operation timed out) |
|
16:47
🔗
|
|
PhrackD- is now known as PhrackD |
|
16:48
🔗
|
ayanami_ |
I wonder if that Sola shutdown is April Fools after all - let's hope not |
|
16:48
🔗
|
ayanami_ |
Might have just been bad timing |
|
16:52
🔗
|
|
icedice2 has quit IRC (Quit: Leaving) |
|
16:52
🔗
|
kiska |
its past the deadline so xD |
|
16:53
🔗
|
kiska |
Also I have no clue what my script is doing now... |
|
16:53
🔗
|
|
icedice has joined #archiveteam-bs |
|
17:07
🔗
|
|
Hani111 has joined #archiveteam-bs |
|
17:07
🔗
|
|
Hani has quit IRC (Read error: Operation timed out) |
|
17:07
🔗
|
|
Hani111 is now known as Hani |
|
17:09
🔗
|
|
enowaldo has joined #archiveteam-bs |
|
17:14
🔗
|
|
Hani111 has joined #archiveteam-bs |
|
17:16
🔗
|
|
Hani has quit IRC (Ping timeout: 268 seconds) |
|
17:20
🔗
|
|
Hani111 has quit IRC (Read error: Operation timed out) |
|
17:21
🔗
|
|
enowaldo has quit IRC (Read error: Operation timed out) |
|
17:24
🔗
|
|
Hani has joined #archiveteam-bs |
|
17:29
🔗
|
|
Exairnous has joined #archiveteam-bs |
|
17:40
🔗
|
t3 |
VoynichCr: Hi. I wanted to make a wiki page using HadeanEon. How do I do that? |
|
17:41
🔗
|
t3 |
VoynichCr: I've already sent you a message on the other channel, but it would be easier to discuss it here. |
|
17:42
🔗
|
t3 |
I've made https://www.archiveteam.org/index.php?title=ArchiveBot/Educational_institutions |
|
17:42
🔗
|
t3 |
I will eventually make others too. |
|
17:43
🔗
|
VoynichCr |
bot just updated the table t3 |
|
17:43
🔗
|
VoynichCr |
sure, create all pages you need |
|
17:44
🔗
|
t3 |
VoynichCr: How do I make the bot create the page? |
|
17:45
🔗
|
VoynichCr |
you can't, you have to create the /list, and the mainpage like you did, and wait 1 day |
|
17:46
🔗
|
t3 |
Is there a way I can make the bot add more items? |
|
17:46
🔗
|
VoynichCr |
modify /list and add more links there |
|
17:46
🔗
|
t3 |
So there is no IRC bot to control it, I presume? |
|
17:46
🔗
|
VoynichCr |
no |
|
17:46
🔗
|
t3 |
And when I archive a website, the bot will automatically update the list? |
|
17:47
🔗
|
VoynichCr |
it updates the table, yeah |
|
17:47
🔗
|
t3 |
Oh okay! Thanks. |
|
17:48
🔗
|
VoynichCr |
you are welcome |
|
17:48
🔗
|
t3 |
Does it deduplicate the list? |
|
17:48
🔗
|
VoynichCr |
yes |
|
17:49
🔗
|
t3 |
Awesome! |
|
17:50
🔗
|
t3 |
So how is it added to the ArchiveBot template table on the bottom? |
|
17:51
🔗
|
VoynichCr |
t3: you have to add it handly, click [e] link |
|
18:18
🔗
|
|
Exairnous has quit IRC (Ping timeout: 252 seconds) |
|
18:20
🔗
|
|
tech234a has quit IRC (Quit: Connection closed for inactivity) |
|
18:33
🔗
|
|
Exairnous has joined #archiveteam-bs |
|
18:48
🔗
|
|
Oddly has quit IRC (Read error: Operation timed out) |
|
18:51
🔗
|
|
VADemon has joined #archiveteam-bs |
|
18:54
🔗
|
|
m007a83 has quit IRC (Read error: Connection reset by peer) |
|
18:58
🔗
|
|
PhrackD has quit IRC (Read error: Operation timed out) |
|
19:00
🔗
|
|
PhrackD has joined #archiveteam-bs |
|
19:00
🔗
|
|
enowaldo has joined #archiveteam-bs |
|
19:25
🔗
|
|
Exairnous has quit IRC (Read error: Operation timed out) |
|
19:28
🔗
|
|
killsushi has joined #archiveteam-bs |
|
19:41
🔗
|
t3 |
VoynichCr: Thanks. So I've added the link to the Educational institutions wiki page. Thanks for the help! I'm just going to have to wait for the HadeanEon bot to update the page. A whole day seems like a long time. |
|
19:59
🔗
|
|
killsushi has quit IRC (Quit: Leaving) |
|
20:07
🔗
|
|
Hani has quit IRC (Ping timeout: 255 seconds) |
|
20:07
🔗
|
marked |
kiska JAA : https://sola.ai/ is 503-ing |
|
20:09
🔗
|
|
PhrackD has quit IRC (Read error: Operation timed out) |
|
20:09
🔗
|
marked |
ERROR 503: Service Unavailable: Back-end server is at capacity. |
|
20:09
🔗
|
t3 |
marked: I'll slow it down. |
|
20:09
🔗
|
|
PhrackD has joined #archiveteam-bs |
|
20:11
🔗
|
|
Hani has joined #archiveteam-bs |
|
20:13
🔗
|
marked |
slow down what? the tracker's not moving |
|
20:14
🔗
|
marked |
oh wait, it's doing check-outs but not check-ins? |
|
20:15
🔗
|
|
Hani111 has joined #archiveteam-bs |
|
20:20
🔗
|
|
Hani has quit IRC (Read error: Operation timed out) |
|
20:20
🔗
|
|
Hani111 is now known as Hani |
|
21:21
🔗
|
|
wp494 has quit IRC (Ping timeout: 252 seconds) |
|
21:22
🔗
|
|
wp494 has joined #archiveteam-bs |
|
21:34
🔗
|
t3 |
marked: I made it go faster. |
|
21:35
🔗
|
t3 |
marked: It's supposed to shut down today. Maybe that's why it's sending out 503s. |
|
21:46
🔗
|
t3 |
JAA: For 753f9wxjswuxuz1n687khv2cf, it seems like archive.fo URLs are not loading on the pipeline. |
|
21:46
🔗
|
t3 |
But I don't think it should be archiving an archive. |
|
21:47
🔗
|
t3 |
I will add `!ig 753f9wxjswuxuz1n687khv2cf ^https?://archive\.fo/`. |
|
21:52
🔗
|
t3 |
JAA: I've increase the concurrency of the sola.ai jobs. |
|
22:19
🔗
|
|
enowaldo has quit IRC (Ping timeout: 265 seconds) |
|
22:28
🔗
|
JAA |
So I guess Sola shut down by now? |
|
22:32
🔗
|
ayanami_ |
Yup |
|
22:32
🔗
|
ayanami_ |
503 |
|
22:32
🔗
|
ayanami_ |
"sola.ai is currently unable to handle this request."\ |
|
22:33
🔗
|
|
tech234a has joined #archiveteam-bs |
|
22:34
🔗
|
|
mgrytbak has joined #archiveteam-bs |
|
22:35
🔗
|
|
BlueMax has joined #archiveteam-bs |
|
23:08
🔗
|
|
enowaldo has joined #archiveteam-bs |
|
23:21
🔗
|
|
ndiddy has joined #archiveteam-bs |
|
23:22
🔗
|
|
enowaldo has quit IRC (Ping timeout: 252 seconds) |
|
23:32
🔗
|
marked |
screen -d |