#archiveteam-bs 2018-04-14,Sat

↑back Search

Time Nickname Message
00:03 🔗 Jens has quit IRC (Remote host closed the connection)
00:03 🔗 Jens has joined #archiveteam-bs
00:34 🔗 Despatche has quit IRC (Quit: Read error: Connection reset by peer)
00:34 🔗 Despatche has joined #archiveteam-bs
01:00 🔗 dashcloud has quit IRC (Remote host closed the connection)
01:12 🔗 arkiver JAA: how is giga.de doing?
01:39 🔗 lindalap_ has joined #archiveteam-bs
01:39 🔗 lindalap has quit IRC (Write error: Connection reset by peer)
01:45 🔗 lindalap_ is now known as lindalap
01:50 🔗 Lord_Nigh has quit IRC (Read error: Operation timed out)
01:52 🔗 Lord_Nigh has joined #archiveteam-bs
01:56 🔗 SketchCow 18:30 < ola_norsk> are there any words about the 'wikifying' of IA item metadata?
01:56 🔗 SketchCow Ha ha, so, would you like the ingredients for a true disaster
02:05 🔗 BlueMaxim has joined #archiveteam-bs
02:05 🔗 BlueMax has quit IRC (Read error: Connection reset by peer)
02:06 🔗 HCross has quit IRC (Quit: Connection closed for inactivity)
02:36 🔗 LordNigh2 has joined #archiveteam-bs
02:38 🔗 Lord_Nigh has quit IRC (Ping timeout: 268 seconds)
02:38 🔗 LordNigh2 is now known as Lord_Nigh
02:42 🔗 LordNigh2 has joined #archiveteam-bs
02:44 🔗 Lord_Nigh has quit IRC (Ping timeout: 252 seconds)
02:44 🔗 LordNigh2 is now known as Lord_Nigh
03:13 🔗 m007a83_ has joined #archiveteam-bs
03:15 🔗 lindalap_ has joined #archiveteam-bs
03:15 🔗 lindalap has quit IRC (Write error: Connection reset by peer)
03:15 🔗 m007a83_ has quit IRC (Read error: Connection reset by peer)
03:15 🔗 m007a83__ has joined #archiveteam-bs
03:17 🔗 m007a83 has quit IRC (Ping timeout: 252 seconds)
03:17 🔗 lindalap_ is now known as lindalap
03:27 🔗 m007a83__ is now known as m007a83
03:30 🔗 wp494 has joined #archiveteam-bs
03:30 🔗 svchfoo3 sets mode: +o wp494
03:32 🔗 eientei95 SketchCow: Yes
03:34 🔗 wp494_ has quit IRC (Read error: Operation timed out)
03:45 🔗 qw3rty118 has joined #archiveteam-bs
03:48 🔗 Despatche has quit IRC (Quit: Read error: Connection reset by peer)
03:52 🔗 qw3rty117 has quit IRC (Read error: Operation timed out)
03:55 🔗 odemg has quit IRC (Read error: Operation timed out)
04:06 🔗 ndiddy has quit IRC ()
04:07 🔗 odemg has joined #archiveteam-bs
07:46 🔗 Lord_Nigh has quit IRC (Read error: Operation timed out)
07:46 🔗 Lord_Nigh has joined #archiveteam-bs
08:08 🔗 Lord_Nigh has quit IRC (Read error: Operation timed out)
08:10 🔗 Lord_Nigh has joined #archiveteam-bs
08:11 🔗 jschwart has joined #archiveteam-bs
08:20 🔗 Jens has quit IRC (Remote host closed the connection)
08:20 🔗 Jens has joined #archiveteam-bs
08:39 🔗 Darkstar has quit IRC (Ping timeout: 260 seconds)
08:43 🔗 Darkstar has joined #archiveteam-bs
08:51 🔗 schbirid has joined #archiveteam-bs
08:52 🔗 Darkstar has quit IRC (Ping timeout: 246 seconds)
08:53 🔗 Darkstar has joined #archiveteam-bs
08:55 🔗 JAA arkiver: It finished a few hours ago. Testing it now to see if everything's alright.
09:02 🔗 Darkstar has quit IRC (Ping timeout: 246 seconds)
09:03 🔗 m007a83_ has joined #archiveteam-bs
09:04 🔗 C4K3 has quit IRC (Read error: Operation timed out)
09:04 🔗 Mayonaise has quit IRC (Read error: Operation timed out)
09:04 🔗 sep332 has quit IRC (Read error: Operation timed out)
09:04 🔗 FireFly has quit IRC (Read error: Operation timed out)
09:04 🔗 REiN^ has quit IRC (Read error: Operation timed out)
09:04 🔗 Mayonaise has joined #archiveteam-bs
09:04 🔗 beardicus has quit IRC (Read error: Operation timed out)
09:05 🔗 PotcFdk has quit IRC (Read error: Operation timed out)
09:06 🔗 beardicus has joined #archiveteam-bs
09:06 🔗 m007a83 has quit IRC (Read error: Operation timed out)
09:07 🔗 Darkstar has joined #archiveteam-bs
09:07 🔗 FireFly has joined #archiveteam-bs
09:07 🔗 REiN^ has joined #archiveteam-bs
09:08 🔗 sep332 has joined #archiveteam-bs
09:09 🔗 C4K3 has joined #archiveteam-bs
09:10 🔗 Sk1d has joined #archiveteam-bs
09:15 🔗 HCross has joined #archiveteam-bs
09:16 🔗 svchfoo3 sets mode: +o HCross
09:24 🔗 Despatche has joined #archiveteam-bs
10:02 🔗 PotcFdk has joined #archiveteam-bs
10:20 🔗 qwebirc40 has joined #archiveteam-bs
10:21 🔗 qwebirc40 Is Miitomo being archived?
10:23 🔗 BlueMaxim has quit IRC (Quit: Leaving)
10:25 🔗 qwebirc40 has quit IRC (Ping timeout: 260 seconds)
10:40 🔗 RIP_ has joined #archiveteam-bs
10:41 🔗 RIP_ has quit IRC (Client Quit)
11:30 🔗 JAA arkiver: Archives are looking good. :-)
11:44 🔗 Aoede miitomo closes may 9th it seems
12:28 🔗 RichardG has quit IRC (Ping timeout: 260 seconds)
12:29 🔗 godane has quit IRC (Ping timeout: 252 seconds)
12:38 🔗 RichardG has joined #archiveteam-bs
12:52 🔗 schbirid has quit IRC (Quit: Leaving)
13:10 🔗 odemg Starting March 30, 2018, we will be turning down support for goo.gl URL shortener. From April 13, 2018 only existing users will be able to create short links on the goo.gl console. You will be able to view your analytics data and download your short link information in csv format for up to one year, until March 30, 2019, when we will discontinue goo.gl. Previously created links will continue to redirect to their intended
13:10 🔗 odemg destination. https://developers.googleblog.com/2018/03/transitioning-google-url-shortener.html
13:10 🔗 eientei95 We know
13:11 🔗 odemg eientei95, ohh you do do you? I must not have been cc on the minutes from our last meeting.
13:12 🔗 odemg I suppose that #urlteam business anywho.
13:12 🔗 eientei95 archiveteam.log:3344:Apr 03 07:12:28 <JAA> Yeah, we'll grab goo.gl after 2019-03-30 through URLTeam.
13:13 🔗 odemg True.
13:23 🔗 schbirid has joined #archiveteam-bs
13:41 🔗 SmileyG_ has joined #archiveteam-bs
13:42 🔗 SmileyG has quit IRC (Read error: Operation timed out)
13:54 🔗 schbirid has quit IRC (Quit: Leaving)
16:12 🔗 Coderjo has quit IRC (Remote host closed the connection)
16:28 🔗 Stilett0- is now known as Stiletto
16:46 🔗 Stiletto has quit IRC ()
17:36 🔗 Stilett0- has joined #archiveteam-bs
17:38 🔗 bad_faith has quit IRC (Ping timeout: 244 seconds)
17:42 🔗 godane has joined #archiveteam-bs
17:43 🔗 svchfoo1 sets mode: +o godane
17:46 🔗 JAA My GIGA grab is incomplete because their server was throwing 502s for a while. I'll run another grab for those.
17:47 🔗 JAA I also found a number of broken threads. In particular, threads where more pages are displayed than exist. The later pages then redirect one by one back to the last existing page through awkward redirects that I didn't follow.
17:47 🔗 JAA This means that the "last page" link in some threads won't work. But all accessible content should be there.
17:48 🔗 JAA Users would just have to click through all pages or manipulate the URL to read the later content.
17:48 🔗 JAA Not much I can do about that.
17:48 🔗 JAA I guess I could fetch the redirects, but that gets a bit messy.
18:39 🔗 HCross Anyone able to give the tracker a prod? It doesnt seem to be updating anymore
19:50 🔗 Mateon1 has quit IRC (Read error: Operation timed out)
19:50 🔗 Mateon1 has joined #archiveteam-bs
21:06 🔗 godane SketchCow: any news from Mank?
21:19 🔗 SketchCow Mank is getting married.
21:19 🔗 SketchCow I'm going to see who's taking over his stuff
21:20 🔗 godane ok
21:21 🔗 godane so now its like 3 people that left the internet archive in month
21:47 🔗 JAA I'm grabbing about 3.7k threads again which failed in the initial grab due to those 502s I mentioned (and connection refusals).
21:48 🔗 JAA There are also almost 1k threads which had some other kind of issue. Most of those are probably those redirects. I'll just regrab them following infinite redirects. It's likely that no content is missing for almost all of these threads, but better safe than sorry.
21:56 🔗 JAA TIL there is no way to follow infinite redirects in wpull (at least not in version 1.2.3).
22:01 🔗 JAA Looks like they perform backups or something at 00:00 local time. I get 502s again, just like before around 22:00 UTC (= 00:00 CEST).
22:03 🔗 BlueMax has joined #archiveteam-bs
22:03 🔗 dxrt has quit IRC (Quit: ZNC - http://znc.sourceforge.net)
22:05 🔗 godane has quit IRC (Ping timeout: 268 seconds)
22:19 🔗 godane has joined #archiveteam-bs
22:20 🔗 svchfoo3 sets mode: +o godane
22:22 🔗 jschwart has quit IRC (Quit: Konversation terminated!)
22:22 🔗 JAA Turns out it was a really good decision to grab the forums by bruteforcing all thread IDs: there are hidden forums that don't seem to appear anywhere in the forum list, e.g. http://forum.giga.de/gamescom-2010-%5Bread-only%5D/ . A simple recursive grab would almost certainly have missed that.
22:23 🔗 astrid hm, nice work <3
22:25 🔗 JAA I'm preparing a job to grab the forum indices with daysprune=-1, by the way. I think then I should have pretty much everything there is.
22:56 🔗 HCross has quit IRC (Quit: Connection closed for inactivity)
23:49 🔗 JAA Lovely... When you access a forum's later page with ?daysprune=-1, the HTML contains a canonical <link>, which does not include the daysprune parameter, so it's not actually the same content. Great job, vBulletin...
23:50 🔗 JAA Anyway, my index grab is running now.
23:52 🔗 JAA It's always nice to be able to reuse code written for another archival. In this case, SPUF.

irclogger-viewer