#archiveteam-bs 2019-12-25,Wed

↑back Search

Time Nickname Message
00:05 πŸ”— Stilett0 has joined #archiveteam-bs
00:08 πŸ”— Stiletto has quit IRC (Read error: Operation timed out)
00:29 πŸ”— HP_Archiv has joined #archiveteam-bs
00:30 πŸ”— HP_Archiv I have another request for an Ops - http://narniansky.onlinewebshop.net/girder/
00:30 πŸ”— HP_Archiv this site is a resource for the vintage 'Girders and Panels' educational construction toy sets
00:30 πŸ”— HP_Archiv Can someone submit this into AB please?
00:40 πŸ”— markedL HP_Archiv is this incomplete http://web.archive.org/web/20191112053112/http://narniansky.onlinewebshop.net/girder/
00:43 πŸ”— markedL main thing I see missing is embedded youtube videos
00:43 πŸ”— HP_Archiv I'm not sure, honestly. I was reading the Wikipedia entry, https://en.wikipedia.org/wiki/Girder_and_Panel_building_sets, and some of the nariansky links to examples of the sets on this site had URLs that 404'ed
00:44 πŸ”— HP_Archiv If it's already on WBM I think whatever is there is all there is
00:44 πŸ”— BlueMax has quit IRC (Quit: Leaving)
00:45 πŸ”— HP_Archiv From what I'm reading, a lot of older/vintage toy company information is pretty ephemeral in that the information, unless already indexed somewhere online, is pretty much at-risk
00:45 πŸ”— HP_Archiv Thought I would ask for archiving of this site though if it wasn't already :)
00:47 πŸ”— markedL if you can get the webmaster to fix the 404s it would be worth doing again
00:49 πŸ”— HP_Archiv Sure, I'll shoot him an email
00:53 πŸ”— markedL Oops, yeah a few links are missing in WBM but are online. Safer to AB it now
00:53 πŸ”— m007a83_ has quit IRC (Fuck you Comcast)
00:54 πŸ”— HP_Archiv I just emailed the site owner with a brief summary asking if he can take a look at the links that 404, np
00:55 πŸ”— HP_Archiv I grew up with some of these sets that were re-released in the early 90's and it's very much a throwback, heh
00:56 πŸ”— HP_Archiv Did you put throw into AB just now?
01:07 πŸ”— markedL No, I don't have voice. Ryz seems active. AB? http://narniansky.onlinewebshop.net/girder/
01:08 πŸ”— Ryz This link in particular? Or just http://narniansky.onlinewebshop.net/ ?
01:09 πŸ”— markedL the girder subtree is the interesting part, but his tree store is harmless as well.
01:10 πŸ”— HP_Archiv Thank you markedL and Ryz, appreciate both your help
01:10 πŸ”— markedL I'd guess http://narniansky.onlinewebshop.net/girder/ recursive, no off site.
01:14 πŸ”— Ryz Any specific why using --no-offsite-links ?
01:14 πŸ”— Ryz *specific reason
01:15 πŸ”— killsushi has joined #archiveteam-bs
01:15 πŸ”— markedL I didn't see any offsite links besides youtube, but feel free to do whichever you think is best.
01:16 πŸ”— HP_Archiv I'll let you know if I hear back from the webmaster FYI
01:17 πŸ”— Ryz Do you want me to stand by on the archiving until the 404s are fixed?
01:20 πŸ”— HP_Archiv I don't know if he'll be able to fix them, maybe they were intentionally taken down, etc
01:20 πŸ”— HP_Archiv I would go ahead with it anyway, unless it would cause problems to submit into AB now, find out he fixes them in the future, and then try to put into AB again?
01:20 πŸ”— markedL yeah it's small enough to do twice
01:23 πŸ”— HP_Archiv Okay sounds good, yeah please submit for archiving now. Thank you
01:23 πŸ”— Ryz Running~
01:23 πŸ”— HP_Archiv ^^ Awesome
01:34 πŸ”— Ryz It's done, even with concurrency 1
01:36 πŸ”— HP_Archiv Awesome - What does concurrency 1 mean?
01:43 πŸ”— Ryz So when running either an individual link, a section of the website, or a whole website by default, it would start out with concurrency 3, so basically 3 things are being grabbed at a time, such as either URL (which would scan even more links to go through too in a queue) or a file like an image or ZIP file, the time in-between finished grabs is di
01:43 πŸ”— Ryz ctated by the delay
01:44 πŸ”— Ryz Concurrency 1 is like the minimum of grabbing
01:46 πŸ”— HP_Archiv Oh okay makes sense. So it' grabbed everything in the minimum time it takes to grab something eg; super easy site, right?
01:48 πŸ”— Ryz Yeah, whenever I put run it with concurrency 1 - it's more or less of a gut feeling reaction, like how slow the website is even browsing in general, or how fragile the website could be by how old-looking it's perceived to be
01:49 πŸ”— HP_Archiv I figured, based on the age/how the website looks, heh
01:49 πŸ”— HP_Archiv Alright, Ryz thanks again, you too markedL
02:03 πŸ”— Nick-PC has joined #archiveteam-bs
02:18 πŸ”— HP_Archiv has quit IRC (se.hub efnet.deic.eu)
02:18 πŸ”— X-Scale has quit IRC (se.hub efnet.deic.eu)
02:18 πŸ”— kiskaWee has quit IRC (se.hub efnet.deic.eu)
02:18 πŸ”— halt_ has quit IRC (se.hub efnet.deic.eu)
02:18 πŸ”— ctrl_ has quit IRC (se.hub efnet.deic.eu)
02:29 πŸ”— DigiDigi has quit IRC (Remote host closed the connection)
02:37 πŸ”— af10b3e5e has joined #archiveteam-bs
02:37 πŸ”— d5f4a3622 has quit IRC (Read error: Connection reset by peer)
02:58 πŸ”— X-Scale has joined #archiveteam-bs
03:09 πŸ”— britmob has joined #archiveteam-bs
03:11 πŸ”— britmob has quit IRC (Read error: Connection reset by peer)
03:15 πŸ”— britmob has joined #archiveteam-bs
03:17 πŸ”— britmob_ has joined #archiveteam-bs
04:07 πŸ”— DigiDigi has joined #archiveteam-bs
04:08 πŸ”— britmob_ has quit IRC (Read error: Operation timed out)
04:08 πŸ”— britmob has quit IRC (Read error: Operation timed out)
04:12 πŸ”— qw3rty2 has joined #archiveteam-bs
04:20 πŸ”— qw3rty has quit IRC (Ping timeout: 745 seconds)
04:20 πŸ”— britmob_ has joined #archiveteam-bs
04:20 πŸ”— britmob has joined #archiveteam-bs
04:29 πŸ”— BlueMax has joined #archiveteam-bs
05:13 πŸ”— DogsRNice has quit IRC (Read error: Connection reset by peer)
05:42 πŸ”— ripdog has joined #archiveteam-bs
06:18 πŸ”— Flashfire has quit IRC (Remote host closed the connection)
06:18 πŸ”— kiska has quit IRC (Remote host closed the connection)
06:19 πŸ”— kiska has joined #archiveteam-bs
06:19 πŸ”— Flashfire has joined #archiveteam-bs
06:19 πŸ”— svchfoo1 sets mode: +o kiska
06:19 πŸ”— svchfoo3 sets mode: +o kiska
07:08 πŸ”— Muad-Dib has quit IRC (Ping timeout: 745 seconds)
07:11 πŸ”— Muad-Dib has joined #archiveteam-bs
07:22 πŸ”— godane has quit IRC (Ping timeout: 360 seconds)
07:36 πŸ”— Flashfire has quit IRC (Remote host closed the connection)
07:36 πŸ”— kiska has quit IRC (Remote host closed the connection)
07:36 πŸ”— kiska has joined #archiveteam-bs
07:36 πŸ”— Flashfire has joined #archiveteam-bs
07:36 πŸ”— svchfoo3 sets mode: +o kiska
07:36 πŸ”— svchfoo1 sets mode: +o kiska
08:09 πŸ”— Stiletto has joined #archiveteam-bs
08:10 πŸ”— Stilett0 has quit IRC (Ping timeout: 246 seconds)
09:44 πŸ”— bluefoo has quit IRC (Ping timeout: 496 seconds)
10:26 πŸ”— bluefoo has joined #archiveteam-bs
10:43 πŸ”— BlueMax has quit IRC (Quit: Leaving)
11:39 πŸ”— Flashfire has quit IRC (Remote host closed the connection)
11:39 πŸ”— kiska has quit IRC (Remote host closed the connection)
11:39 πŸ”— kiska has joined #archiveteam-bs
11:40 πŸ”— Flashfire has joined #archiveteam-bs
11:40 πŸ”— svchfoo3 sets mode: +o kiska
11:40 πŸ”— svchfoo1 sets mode: +o kiska
12:10 πŸ”— Sora_Uta has quit IRC (Ping timeout: 276 seconds)
12:23 πŸ”— ctrl_ has joined #archiveteam-bs
12:27 πŸ”— kiskaWee has joined #archiveteam-bs
12:27 πŸ”— svchfoo1 sets mode: +o kiskaWee
12:27 πŸ”— svchfoo3 sets mode: +o kiskaWee
12:27 πŸ”— halt_ has joined #archiveteam-bs
12:32 πŸ”— Wingy has joined #archiveteam-bs
13:14 πŸ”— godane has joined #archiveteam-bs
13:43 πŸ”— godane has quit IRC (Leaving.)
15:19 πŸ”— X-Scale has quit IRC (Quit: HydraIRC -> http://www.hydrairc.com <- Organize your IRC)
16:00 πŸ”— bluefoo has quit IRC (Read error: Operation timed out)
16:02 πŸ”— DogsRNice has joined #archiveteam-bs
16:26 πŸ”— bluefoo has joined #archiveteam-bs
16:31 πŸ”— icedice has joined #archiveteam-bs
16:53 πŸ”— bluefoo has quit IRC (Read error: Operation timed out)
16:54 πŸ”— superkuh_ is now known as superkuh
17:11 πŸ”— Myself has quit IRC (Ping timeout: 276 seconds)
17:13 πŸ”— Myself has joined #archiveteam-bs
18:01 πŸ”— K4k has joined #archiveteam-bs
18:35 πŸ”— Stiletto has quit IRC (Read error: Connection reset by peer)
18:35 πŸ”— Stilett0 has joined #archiveteam-bs
20:06 πŸ”— jamiew has joined #archiveteam-bs
20:17 πŸ”— af10b3e5e has quit IRC (Ping timeout: 255 seconds)
20:19 πŸ”— SketchCow By the way, I talked with the Murfie people
20:20 πŸ”— SketchCow it is all a galacic mess, but we're in there (we, internet archive)
20:20 πŸ”— Sora_Uta has joined #archiveteam-bs
20:32 πŸ”— af10b3e5e has joined #archiveteam-bs
20:35 πŸ”— af10b3e5e has quit IRC (Read error: Connection reset by peer)
20:35 πŸ”— PurpleSym SketchCow: I’m not sure it’s a good idea to upload the current Yahoo! Groups data into archiveteam_yahoogroups. My data files are WARCs, yes, but not compatible with Wayback.
20:37 πŸ”— af10b3e5e has joined #archiveteam-bs
20:38 πŸ”— Kaz why aren't they compatible?
20:38 πŸ”— Kaz (leading to) if they're valid warcs, what needs to be done to make WBM work with them?
20:40 πŸ”— SketchCow Is this because they're new, they've done some very recent work to make those compatibvle.
20:40 πŸ”— PurpleSym Kaz: I’ve been grabbing API responses and splitting them into individual β€œrecords”.
20:41 πŸ”— PurpleSym So I had to invent a proprierary URI scheme that Wayback obviously cannot play back.
20:41 πŸ”— PurpleSym Like WARC-Target-URI: org.archive.yahoogroups:v1/group/academia_abap_jul2013/message/1/info
20:41 πŸ”— SketchCow Well, OK.
20:41 πŸ”— SketchCow 1. That was probably a mistake
20:41 πŸ”— SketchCow 2. We should take the data anyway
20:42 πŸ”— SketchCow So that PurpleSym can do his OWN whack-ass conversion into something, instead of tracking down your bones and hard drive in whatever shipping container you live in
20:42 πŸ”— SketchCow Sorry, I mean PurpleSyn 3000
20:42 πŸ”— SketchCow Your android ancestor
20:43 πŸ”— PurpleSym I was never interested in Wayback integration. These archives are better converted into mbox and viewed with a mail user agent.
20:45 πŸ”— PurpleSym Haha, I guess I have a new nickname now.
20:45 πŸ”— PurpleSym is now known as PSyn3000
20:48 πŸ”— SketchCow Well, upload them.
20:48 πŸ”— SketchCow And please mark this situation.
20:48 πŸ”— SketchCow And do not mark them web. Mark them data.
20:49 πŸ”— atphoenix I don't think getting direct API responses was a mistake. I agree that most of these records (messages) are best viewed in an email client. The saved files and photos *might* be better in web UI. Some kind of conversion will be needed either way, as we also have GMDs. My thought? Convert both API grabs and GMD results of publicly shareable content to something WBM can display. Or just convert them to discrete group data in IA.
20:50 πŸ”— icedice2 has joined #archiveteam-bs
20:51 πŸ”— PSyn3000 is now known as PurpleSym
20:51 πŸ”— jamiew has quit IRC (Textual IRC Client: www.textualapp.com)
20:52 πŸ”— JAA We do not convert data to put it into the WBM. Ever.
20:53 πŸ”— SketchCow You certainly don'.
20:53 πŸ”— SketchCow If archive ever does it, it will go into a thing saying "Shitbin Archive Team Conversion"
20:53 πŸ”— PurpleSym SketchCow: They’re already marked as data, as far as I see.
20:53 πŸ”— SketchCow We have that capability - if you browse a wayback page
20:54 πŸ”— SketchCow Will mention what sources it comes from. That feature's a few years old
20:55 πŸ”— JAA Speaking of AT grab WBM playback, I think it would be nice to add some URL rewrites to the WBM for plays.tv. I know the WBM already ignores values of session ID parameters like "sid"; this would be similar.
20:56 πŸ”— JAA Basically, there are tracking parameters and timestamps in every link on that site, so obviously you can't browse it and always have to remove them manually.
20:56 πŸ”— icedice has quit IRC (Read error: Operation timed out)
20:59 πŸ”— PurpleSym SketchCow: So, could you move either my data or the 2019 WARCs into a different collection? I’ll come up with a proper description for the collection and send it to you.
21:00 πŸ”— PurpleSym (i.e. one that includes these technical details mentioned above)
21:01 πŸ”— SketchCow Eventually.
21:01 πŸ”— SketchCow Send explicit instructions to jason@textfiles.com
21:15 πŸ”— Xibalba has quit IRC (Remote host closed the connection)
21:15 πŸ”— Xibalba has joined #archiveteam-bs
21:26 πŸ”— Mateon1 has quit IRC (Remote host closed the connection)
21:27 πŸ”— Mateon1 has joined #archiveteam-bs
22:29 πŸ”— BlueMax has joined #archiveteam-bs
23:11 πŸ”— Ajay1 has quit IRC (Ping timeout: 276 seconds)
23:11 πŸ”— klg has quit IRC (Ping timeout: 276 seconds)
23:11 πŸ”— klg has joined #archiveteam-bs
23:11 πŸ”— SoraUta has joined #archiveteam-bs
23:12 πŸ”— Ajay1 has joined #archiveteam-bs
23:12 πŸ”— Sora_Uta has quit IRC (Ping timeout: 276 seconds)
23:12 πŸ”— DogsRNice has quit IRC (Ping timeout: 276 seconds)
23:12 πŸ”— Flashfire has quit IRC (Ping timeout: 276 seconds)
23:12 πŸ”— closure has quit IRC (Ping timeout: 276 seconds)
23:12 πŸ”— tuluu has quit IRC (Ping timeout: 276 seconds)
23:12 πŸ”— tuluu has joined #archiveteam-bs
23:12 πŸ”— DogsRNice has joined #archiveteam-bs
23:14 πŸ”— closure has joined #archiveteam-bs
23:26 πŸ”— Stiletto has joined #archiveteam-bs
23:29 πŸ”— Stilett0 has quit IRC (Read error: Operation timed out)

irclogger-viewer