#archiveteam-bs 2019-04-26,Fri

↑back Search

Time Nickname Message
00:17 πŸ”— bitBaron has quit IRC (Quit: My computer has gone to sleep. 😴πŸ˜ͺZZZzzz…)
00:43 πŸ”— MR9K4 has joined #archiveteam-bs
01:30 πŸ”— drcd has quit IRC (Read error: Connection reset by peer)
01:32 πŸ”— d5f4a3622 has quit IRC (Quit: WeeChat 2.4)
01:34 πŸ”— d5f4a3622 has joined #archiveteam-bs
02:03 πŸ”— bitBaron has joined #archiveteam-bs
02:15 πŸ”— ayanami_ has quit IRC (Quit: Leaving)
02:19 πŸ”— bitBaron has quit IRC (Quit: My computer has gone to sleep. 😴πŸ˜ͺZZZzzz…)
02:21 πŸ”— enowaldo has joined #archiveteam-bs
02:26 πŸ”— enowaldo has quit IRC (Ping timeout: 268 seconds)
02:42 πŸ”— benjins has joined #archiveteam-bs
02:42 πŸ”— BlueMax has joined #archiveteam-bs
03:20 πŸ”— odemgi has joined #archiveteam-bs
03:23 πŸ”— odemgi_ has quit IRC (Ping timeout: 252 seconds)
03:32 πŸ”— GuysFree has quit IRC (Quit: Connection closed for inactivity)
03:33 πŸ”— godane i'm now archiving The Mike Rosen Show
03:34 πŸ”— godane he retired in 2015 then i think maybe doing a at the movies radio show up to 2017
03:35 πŸ”— godane what bothers me is that there mp3s going back to 2008 but only from 2011-09-12 on i'm able to grab
03:36 πŸ”— godane thats cause older urls look like this : http://a1135.g.akamai.net/f/1135/18227/1h/cchannel.download.akamai.com/18227/podcast/DENVER-CO/KOA-AM/Rosen08-3-09-11AM.mp3
03:40 πŸ”— godane later urls are like this : http://media.ccomrcdn.com/media/station_content/668/Rosen11-15-11-09AM_1321905631_28012.mp3
04:43 πŸ”— TC01 has joined #archiveteam-bs
04:49 πŸ”— TC01_ has quit IRC (Ping timeout: 615 seconds)
04:52 πŸ”— ndiddy has quit IRC ()
05:09 πŸ”— kpcyrd is https://archive.org/details/github_narabot_mirror part of archiveteam? couldn't find anything about it
05:26 πŸ”— BlueMax has quit IRC (Quit: Leaving)
05:50 πŸ”— BlueMax has joined #archiveteam-bs
06:13 πŸ”— Zerote has quit IRC (Ping timeout: 600 seconds)
07:18 πŸ”— MrRadar2 has quit IRC (Read error: Operation timed out)
07:18 πŸ”— BnAboyZ has quit IRC (Read error: Operation timed out)
07:22 πŸ”— colona has quit IRC (Ping timeout: 265 seconds)
07:24 πŸ”— colona has joined #archiveteam-bs
07:27 πŸ”— BnAboyZ has joined #archiveteam-bs
07:28 πŸ”— MrRadar2 has joined #archiveteam-bs
07:29 πŸ”— svchfoo3 sets mode: +o MrRadar2
07:35 πŸ”— Zerote has joined #archiveteam-bs
08:53 πŸ”— PurpleSym Wrt Sony’s sketch: There seem to be ~200M sketches and it seems I can retrive ids (UUID) for all of them easily, but it takes some time.
08:54 πŸ”— Zerote has quit IRC (Read error: Operation timed out)
08:54 πŸ”— JAA Sweet. Fortunately, we have 5 months.
08:57 πŸ”— Zerote has joined #archiveteam-bs
08:58 πŸ”— benjinsmi has joined #archiveteam-bs
09:01 πŸ”— benjins has quit IRC (Read error: Operation timed out)
09:02 πŸ”— Odd0002_ has joined #archiveteam-bs
09:07 πŸ”— Odd0002 has quit IRC (Ping timeout: 615 seconds)
09:07 πŸ”— Odd0002_ is now known as Odd0002
10:26 πŸ”— enowaldo has joined #archiveteam-bs
10:27 πŸ”— Verified_ has quit IRC (Remote host closed the connection)
11:01 πŸ”— enowaldo has quit IRC (Ping timeout: 265 seconds)
11:32 πŸ”— enowaldo has joined #archiveteam-bs
11:46 πŸ”— enowaldo has quit IRC (Ping timeout: 268 seconds)
11:51 πŸ”— kiska1 has quit IRC (Read error: Connection reset by peer)
11:52 πŸ”— kiska1 has joined #archiveteam-bs
11:52 πŸ”— svchfoo3 sets mode: +o kiska1
12:00 πŸ”— deathy has quit IRC (Read error: Connection reset by peer)
12:01 πŸ”— diggan has quit IRC (Read error: Connection reset by peer)
12:03 πŸ”— diggan has joined #archiveteam-bs
12:04 πŸ”— deathy has joined #archiveteam-bs
12:08 πŸ”— bitBaron has joined #archiveteam-bs
12:19 πŸ”— enowaldo has joined #archiveteam-bs
12:25 πŸ”— BlueMax has quit IRC (Quit: Leaving)
12:31 πŸ”— icedice has joined #archiveteam-bs
12:45 πŸ”— enowaldo has quit IRC (Ping timeout: 492 seconds)
12:47 πŸ”— bitBaron has quit IRC (Quit: My computer has gone to sleep. 😴πŸ˜ͺZZZzzz…)
12:59 πŸ”— enowaldo has joined #archiveteam-bs
13:05 πŸ”— cfarquhar has quit IRC (Read error: Operation timed out)
13:10 πŸ”— Odd0002_ has joined #archiveteam-bs
13:13 πŸ”— cfarquhar has joined #archiveteam-bs
13:16 πŸ”— Odd0002 has quit IRC (Read error: Operation timed out)
13:16 πŸ”— Odd0002_ is now known as Odd0002
13:17 πŸ”— VerifiedJ has joined #archiveteam-bs
13:21 πŸ”— enowaldo has quit IRC (Read error: Operation timed out)
13:43 πŸ”— cfarquhar has quit IRC (Read error: Operation timed out)
13:48 πŸ”— enowaldo has joined #archiveteam-bs
13:51 πŸ”— cfarquhar has joined #archiveteam-bs
13:58 πŸ”— JAA I have no idea what my bot will do with that edit, but we'll see in a minute or so.
14:11 πŸ”— hook54321 JAA: disallowing ia_archiver no longer prevents sites from being viewed
14:12 πŸ”— JAA hook54321: Is that definite now? I know it happened in the past due to bugs in the robots parser or something. But please feel free to correct that (I didn't write it, btw).
14:19 πŸ”— hook54321 ah ok, I assumed you did, my bad. I haven't seen any official documentation saying that it's like that for all sites now (however lots of IA's policies aren't documented), but I haven't seen the robots.txt error message in a long time, even on sites that specifically disallow it.
14:19 πŸ”— hook54321 Save Page Now ignores robots.txt as well
14:20 πŸ”— hook54321 https://twitter.com/MarkGraham/status/1113503847228395521
14:26 πŸ”— enowaldo has quit IRC (Read error: Operation timed out)
14:27 πŸ”— JAA That's great news.
14:37 πŸ”— hook54321 Yeah.
14:40 πŸ”— hook54321 I kinda understand why their other crawlers (and Alexa's) still pay attention to it in most cases. I'm guessing it keeps crawls more sane since they might not be monitored all the time. For stuff like grabbing pages linked to on Wikipedia though I hope they ignore it.
14:41 πŸ”— hook54321 And they would likely get widely blocked if they didn't.
14:48 πŸ”— JAA "List of unreliable URL shorteners" - "Avoid using them. Use TinyURL.com instead." Someone clearly doesn't know about URLTeam.
14:49 πŸ”— JAA hook54321: Yeah, probably makes sense there. Although their web-wide crawls are only recursing to a limited depth usually. Bans could definitely be an issue though.
14:50 πŸ”— ATrescue has joined #archiveteam-bs
14:50 πŸ”— ATrescue Got invited by JAA.
14:50 πŸ”— JAA ATrescue: Regarding your "List of unreliable URL shorteners", I guess you haven't yet read about URLTeam, have you?
14:51 πŸ”— ATrescue JAA: Not yet.
14:51 πŸ”— JAA I suggest you do then. That list would be a duplicate basically.
14:51 πŸ”— JAA Also, all URL shorteners are bad, not just the ones you consider unreliable (by whatever measure).
14:52 πŸ”— JAA The only exception are service-internal shorteners, like git.io for GitHub. Chances are that those will survive as long as the service exists, and they'd be useless if the service collapses.
14:54 πŸ”— ATrescue @JAA I took a look. That's a very extensive, impressive list. git.io is like t.co (t.co shorts all URL's posted in tweets, even after the original tweet is unavailable).
14:55 πŸ”— hook54321 eh
14:56 πŸ”— hook54321 They link to external sites though, and if twitter ever shut down those links would then be useless.
14:58 πŸ”— ATrescue hook54321: Twitter isn't likely to shutdown anytime soon, but who knows? I have archived many t.co links yesterday.
14:58 πŸ”— enowaldo has joined #archiveteam-bs
15:00 πŸ”— ATrescue TinyURL *seems* reliable. I also don't think they are going to shut down anytime soon, but better safe than sorry.
15:08 πŸ”— enowaldo has quit IRC (Ping timeout: 252 seconds)
15:13 πŸ”— JAA ATrescue: This is far from surprising. TinyCC lets you edit and delete links if you use an account. Anyway, I'm not sure we need a separate page for each URL shortener. Not sure what others think on this though.
15:14 πŸ”— icedice2 has joined #archiveteam-bs
15:15 πŸ”— ATrescue JAA: Surprisingly many URL's in the past (probably when they redesigned their website at some point) became unuseable.
15:15 πŸ”— Zerote has quit IRC (Read error: Operation timed out)
15:16 πŸ”— svchfoo1 has joined #archiveteam-bs
15:16 πŸ”— PurpleSym sets mode: +o svchfoo1
15:22 πŸ”— icedice has quit IRC (Read error: Operation timed out)
15:39 πŸ”— Somebody2 Yeah, please don't make separate pages for URL shorteners.
15:46 πŸ”— bitBaron has joined #archiveteam-bs
15:51 πŸ”— icedice2 has quit IRC (Ping timeout: 252 seconds)
15:52 πŸ”— Zerote has joined #archiveteam-bs
15:57 πŸ”— icedice has joined #archiveteam-bs
16:27 πŸ”— enowaldo has joined #archiveteam-bs
16:46 πŸ”— bitBaron has quit IRC (Quit: My computer has gone to sleep. 😴πŸ˜ͺZZZzzz…)
16:49 πŸ”— bitBaron has joined #archiveteam-bs
16:55 πŸ”— Dj-Wawa has joined #archiveteam-bs
16:56 πŸ”— enowaldo has quit IRC (Ping timeout: 265 seconds)
17:14 πŸ”— enowaldo has joined #archiveteam-bs
17:27 πŸ”— deathy has quit IRC ()
17:27 πŸ”— deathy has joined #archiveteam-bs
17:50 πŸ”— diggan has quit IRC ()
17:50 πŸ”— diggan has joined #archiveteam-bs
18:04 πŸ”— Terbium has quit IRC (Quit: Terbium)
18:11 πŸ”— Terbium has joined #archiveteam-bs
18:31 πŸ”— enowaldo has quit IRC (Ping timeout: 252 seconds)
18:43 πŸ”— enowaldo has joined #archiveteam-bs
18:46 πŸ”— bitBaron has quit IRC (Quit: My computer has gone to sleep. 😴πŸ˜ͺZZZzzz…)
18:55 πŸ”— godane has quit IRC (Quit: Leaving.)
19:03 πŸ”— godane has joined #archiveteam-bs
19:05 πŸ”— ndiddy has joined #archiveteam-bs
19:19 πŸ”— bitBaron has joined #archiveteam-bs
19:53 πŸ”— wyatt8740 has quit IRC (Read error: Operation timed out)
20:07 πŸ”— killsushi has joined #archiveteam-bs
20:08 πŸ”— enowaldo has quit IRC (Read error: Operation timed out)
20:12 πŸ”— tsp__ has quit IRC (Remote host closed the connection)
20:26 πŸ”— godane has quit IRC (Ping timeout: 615 seconds)
20:26 πŸ”— tsp__ has joined #archiveteam-bs
20:33 πŸ”— godane has joined #archiveteam-bs
20:35 πŸ”— enowaldo has joined #archiveteam-bs
20:39 πŸ”— ndiddy has quit IRC (Ping timeout: 615 seconds)
20:48 πŸ”— godane so i got another post by that crazy guy : https://archive.org/details/the-mike-rosen-show-2011-12-30
20:48 πŸ”— revi has quit IRC ()
20:48 πŸ”— revi has joined #archiveteam-bs
20:50 πŸ”— enowaldo has quit IRC (Read error: Operation timed out)
21:04 πŸ”— enowaldo has joined #archiveteam-bs
21:28 πŸ”— icedice has quit IRC (Read error: Operation timed out)
21:38 πŸ”— wyatt8740 has joined #archiveteam-bs
21:51 πŸ”— ndiddy has joined #archiveteam-bs
22:07 πŸ”— enowaldo has quit IRC (Read error: Operation timed out)
22:08 πŸ”— godane so i'm uploading 3 things at once today
22:09 πŸ”— godane The Mike Rosen Show, libuow issuu cbz files, and The Joe Piscopo Show
22:34 πŸ”— BlueMax has joined #archiveteam-bs
23:07 πŸ”— VerifiedJ has quit IRC (Quit: Leaving)
23:52 πŸ”— Rome_Silv has joined #archiveteam-bs
23:58 πŸ”— Rome has quit IRC (Read error: Operation timed out)

irclogger-viewer