Time |
Nickname |
Message |
00:03
🔗
|
|
Jens has quit IRC (Remote host closed the connection) |
00:03
🔗
|
|
Jens has joined #archiveteam-bs |
00:34
🔗
|
|
Despatche has quit IRC (Quit: Read error: Connection reset by peer) |
00:34
🔗
|
|
Despatche has joined #archiveteam-bs |
01:00
🔗
|
|
dashcloud has quit IRC (Remote host closed the connection) |
01:12
🔗
|
arkiver |
JAA: how is giga.de doing? |
01:39
🔗
|
|
lindalap_ has joined #archiveteam-bs |
01:39
🔗
|
|
lindalap has quit IRC (Write error: Connection reset by peer) |
01:45
🔗
|
|
lindalap_ is now known as lindalap |
01:50
🔗
|
|
Lord_Nigh has quit IRC (Read error: Operation timed out) |
01:52
🔗
|
|
Lord_Nigh has joined #archiveteam-bs |
01:56
🔗
|
SketchCow |
18:30 < ola_norsk> are there any words about the 'wikifying' of IA item metadata? |
01:56
🔗
|
SketchCow |
Ha ha, so, would you like the ingredients for a true disaster |
02:05
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
02:05
🔗
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
02:06
🔗
|
|
HCross has quit IRC (Quit: Connection closed for inactivity) |
02:36
🔗
|
|
LordNigh2 has joined #archiveteam-bs |
02:38
🔗
|
|
Lord_Nigh has quit IRC (Ping timeout: 268 seconds) |
02:38
🔗
|
|
LordNigh2 is now known as Lord_Nigh |
02:42
🔗
|
|
LordNigh2 has joined #archiveteam-bs |
02:44
🔗
|
|
Lord_Nigh has quit IRC (Ping timeout: 252 seconds) |
02:44
🔗
|
|
LordNigh2 is now known as Lord_Nigh |
03:13
🔗
|
|
m007a83_ has joined #archiveteam-bs |
03:15
🔗
|
|
lindalap_ has joined #archiveteam-bs |
03:15
🔗
|
|
lindalap has quit IRC (Write error: Connection reset by peer) |
03:15
🔗
|
|
m007a83_ has quit IRC (Read error: Connection reset by peer) |
03:15
🔗
|
|
m007a83__ has joined #archiveteam-bs |
03:17
🔗
|
|
m007a83 has quit IRC (Ping timeout: 252 seconds) |
03:17
🔗
|
|
lindalap_ is now known as lindalap |
03:27
🔗
|
|
m007a83__ is now known as m007a83 |
03:30
🔗
|
|
wp494 has joined #archiveteam-bs |
03:30
🔗
|
|
svchfoo3 sets mode: +o wp494 |
03:32
🔗
|
eientei95 |
SketchCow: Yes |
03:34
🔗
|
|
wp494_ has quit IRC (Read error: Operation timed out) |
03:45
🔗
|
|
qw3rty118 has joined #archiveteam-bs |
03:48
🔗
|
|
Despatche has quit IRC (Quit: Read error: Connection reset by peer) |
03:52
🔗
|
|
qw3rty117 has quit IRC (Read error: Operation timed out) |
03:55
🔗
|
|
odemg has quit IRC (Read error: Operation timed out) |
04:06
🔗
|
|
ndiddy has quit IRC () |
04:07
🔗
|
|
odemg has joined #archiveteam-bs |
07:46
🔗
|
|
Lord_Nigh has quit IRC (Read error: Operation timed out) |
07:46
🔗
|
|
Lord_Nigh has joined #archiveteam-bs |
08:08
🔗
|
|
Lord_Nigh has quit IRC (Read error: Operation timed out) |
08:10
🔗
|
|
Lord_Nigh has joined #archiveteam-bs |
08:11
🔗
|
|
jschwart has joined #archiveteam-bs |
08:20
🔗
|
|
Jens has quit IRC (Remote host closed the connection) |
08:20
🔗
|
|
Jens has joined #archiveteam-bs |
08:39
🔗
|
|
Darkstar has quit IRC (Ping timeout: 260 seconds) |
08:43
🔗
|
|
Darkstar has joined #archiveteam-bs |
08:51
🔗
|
|
schbirid has joined #archiveteam-bs |
08:52
🔗
|
|
Darkstar has quit IRC (Ping timeout: 246 seconds) |
08:53
🔗
|
|
Darkstar has joined #archiveteam-bs |
08:55
🔗
|
JAA |
arkiver: It finished a few hours ago. Testing it now to see if everything's alright. |
09:02
🔗
|
|
Darkstar has quit IRC (Ping timeout: 246 seconds) |
09:03
🔗
|
|
m007a83_ has joined #archiveteam-bs |
09:04
🔗
|
|
C4K3 has quit IRC (Read error: Operation timed out) |
09:04
🔗
|
|
Mayonaise has quit IRC (Read error: Operation timed out) |
09:04
🔗
|
|
sep332 has quit IRC (Read error: Operation timed out) |
09:04
🔗
|
|
FireFly has quit IRC (Read error: Operation timed out) |
09:04
🔗
|
|
REiN^ has quit IRC (Read error: Operation timed out) |
09:04
🔗
|
|
Mayonaise has joined #archiveteam-bs |
09:04
🔗
|
|
beardicus has quit IRC (Read error: Operation timed out) |
09:05
🔗
|
|
PotcFdk has quit IRC (Read error: Operation timed out) |
09:06
🔗
|
|
beardicus has joined #archiveteam-bs |
09:06
🔗
|
|
m007a83 has quit IRC (Read error: Operation timed out) |
09:07
🔗
|
|
Darkstar has joined #archiveteam-bs |
09:07
🔗
|
|
FireFly has joined #archiveteam-bs |
09:07
🔗
|
|
REiN^ has joined #archiveteam-bs |
09:08
🔗
|
|
sep332 has joined #archiveteam-bs |
09:09
🔗
|
|
C4K3 has joined #archiveteam-bs |
09:10
🔗
|
|
Sk1d has joined #archiveteam-bs |
09:15
🔗
|
|
HCross has joined #archiveteam-bs |
09:16
🔗
|
|
svchfoo3 sets mode: +o HCross |
09:24
🔗
|
|
Despatche has joined #archiveteam-bs |
10:02
🔗
|
|
PotcFdk has joined #archiveteam-bs |
10:20
🔗
|
|
qwebirc40 has joined #archiveteam-bs |
10:21
🔗
|
qwebirc40 |
Is Miitomo being archived? |
10:23
🔗
|
|
BlueMaxim has quit IRC (Quit: Leaving) |
10:25
🔗
|
|
qwebirc40 has quit IRC (Ping timeout: 260 seconds) |
10:40
🔗
|
|
RIP_ has joined #archiveteam-bs |
10:41
🔗
|
|
RIP_ has quit IRC (Client Quit) |
11:30
🔗
|
JAA |
arkiver: Archives are looking good. :-) |
11:44
🔗
|
Aoede |
miitomo closes may 9th it seems |
12:28
🔗
|
|
RichardG has quit IRC (Ping timeout: 260 seconds) |
12:29
🔗
|
|
godane has quit IRC (Ping timeout: 252 seconds) |
12:38
🔗
|
|
RichardG has joined #archiveteam-bs |
12:52
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
13:10
🔗
|
odemg |
Starting March 30, 2018, we will be turning down support for goo.gl URL shortener. From April 13, 2018 only existing users will be able to create short links on the goo.gl console. You will be able to view your analytics data and download your short link information in csv format for up to one year, until March 30, 2019, when we will discontinue goo.gl. Previously created links will continue to redirect to their intended |
13:10
🔗
|
odemg |
destination. https://developers.googleblog.com/2018/03/transitioning-google-url-shortener.html |
13:10
🔗
|
eientei95 |
We know |
13:11
🔗
|
odemg |
eientei95, ohh you do do you? I must not have been cc on the minutes from our last meeting. |
13:12
🔗
|
odemg |
I suppose that #urlteam business anywho. |
13:12
🔗
|
eientei95 |
archiveteam.log:3344:Apr 03 07:12:28 <JAA> Yeah, we'll grab goo.gl after 2019-03-30 through URLTeam. |
13:13
🔗
|
odemg |
True. |
13:23
🔗
|
|
schbirid has joined #archiveteam-bs |
13:41
🔗
|
|
SmileyG_ has joined #archiveteam-bs |
13:42
🔗
|
|
SmileyG has quit IRC (Read error: Operation timed out) |
13:54
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
16:12
🔗
|
|
Coderjo has quit IRC (Remote host closed the connection) |
16:28
🔗
|
|
Stilett0- is now known as Stiletto |
16:46
🔗
|
|
Stiletto has quit IRC () |
17:36
🔗
|
|
Stilett0- has joined #archiveteam-bs |
17:38
🔗
|
|
bad_faith has quit IRC (Ping timeout: 244 seconds) |
17:42
🔗
|
|
godane has joined #archiveteam-bs |
17:43
🔗
|
|
svchfoo1 sets mode: +o godane |
17:46
🔗
|
JAA |
My GIGA grab is incomplete because their server was throwing 502s for a while. I'll run another grab for those. |
17:47
🔗
|
JAA |
I also found a number of broken threads. In particular, threads where more pages are displayed than exist. The later pages then redirect one by one back to the last existing page through awkward redirects that I didn't follow. |
17:47
🔗
|
JAA |
This means that the "last page" link in some threads won't work. But all accessible content should be there. |
17:48
🔗
|
JAA |
Users would just have to click through all pages or manipulate the URL to read the later content. |
17:48
🔗
|
JAA |
Not much I can do about that. |
17:48
🔗
|
JAA |
I guess I could fetch the redirects, but that gets a bit messy. |
18:39
🔗
|
HCross |
Anyone able to give the tracker a prod? It doesnt seem to be updating anymore |
19:50
🔗
|
|
Mateon1 has quit IRC (Read error: Operation timed out) |
19:50
🔗
|
|
Mateon1 has joined #archiveteam-bs |
21:06
🔗
|
godane |
SketchCow: any news from Mank? |
21:19
🔗
|
SketchCow |
Mank is getting married. |
21:19
🔗
|
SketchCow |
I'm going to see who's taking over his stuff |
21:20
🔗
|
godane |
ok |
21:21
🔗
|
godane |
so now its like 3 people that left the internet archive in month |
21:47
🔗
|
JAA |
I'm grabbing about 3.7k threads again which failed in the initial grab due to those 502s I mentioned (and connection refusals). |
21:48
🔗
|
JAA |
There are also almost 1k threads which had some other kind of issue. Most of those are probably those redirects. I'll just regrab them following infinite redirects. It's likely that no content is missing for almost all of these threads, but better safe than sorry. |
21:56
🔗
|
JAA |
TIL there is no way to follow infinite redirects in wpull (at least not in version 1.2.3). |
22:01
🔗
|
JAA |
Looks like they perform backups or something at 00:00 local time. I get 502s again, just like before around 22:00 UTC (= 00:00 CEST). |
22:03
🔗
|
|
BlueMax has joined #archiveteam-bs |
22:03
🔗
|
|
dxrt has quit IRC (Quit: ZNC - http://znc.sourceforge.net) |
22:05
🔗
|
|
godane has quit IRC (Ping timeout: 268 seconds) |
22:19
🔗
|
|
godane has joined #archiveteam-bs |
22:20
🔗
|
|
svchfoo3 sets mode: +o godane |
22:22
🔗
|
|
jschwart has quit IRC (Quit: Konversation terminated!) |
22:22
🔗
|
JAA |
Turns out it was a really good decision to grab the forums by bruteforcing all thread IDs: there are hidden forums that don't seem to appear anywhere in the forum list, e.g. http://forum.giga.de/gamescom-2010-%5Bread-only%5D/ . A simple recursive grab would almost certainly have missed that. |
22:23
🔗
|
astrid |
hm, nice work <3 |
22:25
🔗
|
JAA |
I'm preparing a job to grab the forum indices with daysprune=-1, by the way. I think then I should have pretty much everything there is. |
22:56
🔗
|
|
HCross has quit IRC (Quit: Connection closed for inactivity) |
23:49
🔗
|
JAA |
Lovely... When you access a forum's later page with ?daysprune=-1, the HTML contains a canonical <link>, which does not include the daysprune parameter, so it's not actually the same content. Great job, vBulletin... |
23:50
🔗
|
JAA |
Anyway, my index grab is running now. |
23:52
🔗
|
JAA |
It's always nice to be able to reuse code written for another archival. In this case, SPUF. |