[00:05] *** Stilett0 has joined #archiveteam-bs [00:08] *** Stiletto has quit IRC (Read error: Operation timed out) [00:29] *** HP_Archiv has joined #archiveteam-bs [00:30] I have another request for an Ops - http://narniansky.onlinewebshop.net/girder/ [00:30] this site is a resource for the vintage 'Girders and Panels' educational construction toy sets [00:30] Can someone submit this into AB please? [00:40] HP_Archiv is this incomplete http://web.archive.org/web/20191112053112/http://narniansky.onlinewebshop.net/girder/ [00:43] main thing I see missing is embedded youtube videos [00:43] I'm not sure, honestly. I was reading the Wikipedia entry, https://en.wikipedia.org/wiki/Girder_and_Panel_building_sets, and some of the nariansky links to examples of the sets on this site had URLs that 404'ed [00:44] If it's already on WBM I think whatever is there is all there is [00:44] *** BlueMax has quit IRC (Quit: Leaving) [00:45] From what I'm reading, a lot of older/vintage toy company information is pretty ephemeral in that the information, unless already indexed somewhere online, is pretty much at-risk [00:45] Thought I would ask for archiving of this site though if it wasn't already :) [00:47] if you can get the webmaster to fix the 404s it would be worth doing again [00:49] Sure, I'll shoot him an email [00:53] Oops, yeah a few links are missing in WBM but are online. Safer to AB it now [00:53] *** m007a83_ has quit IRC (Fuck you Comcast) [00:54] I just emailed the site owner with a brief summary asking if he can take a look at the links that 404, np [00:55] I grew up with some of these sets that were re-released in the early 90's and it's very much a throwback, heh [00:56] Did you put throw into AB just now? [01:07] No, I don't have voice. Ryz seems active. AB? http://narniansky.onlinewebshop.net/girder/ [01:08] This link in particular? Or just http://narniansky.onlinewebshop.net/ ? [01:09] the girder subtree is the interesting part, but his tree store is harmless as well. [01:10] Thank you markedL and Ryz, appreciate both your help [01:10] I'd guess http://narniansky.onlinewebshop.net/girder/ recursive, no off site. [01:14] Any specific why using --no-offsite-links ? [01:14] *specific reason [01:15] *** killsushi has joined #archiveteam-bs [01:15] I didn't see any offsite links besides youtube, but feel free to do whichever you think is best. [01:16] I'll let you know if I hear back from the webmaster FYI [01:17] Do you want me to stand by on the archiving until the 404s are fixed? [01:20] I don't know if he'll be able to fix them, maybe they were intentionally taken down, etc [01:20] I would go ahead with it anyway, unless it would cause problems to submit into AB now, find out he fixes them in the future, and then try to put into AB again? [01:20] yeah it's small enough to do twice [01:23] Okay sounds good, yeah please submit for archiving now. Thank you [01:23] Running~ [01:23] ^^ Awesome [01:34] It's done, even with concurrency 1 [01:36] Awesome - What does concurrency 1 mean? [01:43] So when running either an individual link, a section of the website, or a whole website by default, it would start out with concurrency 3, so basically 3 things are being grabbed at a time, such as either URL (which would scan even more links to go through too in a queue) or a file like an image or ZIP file, the time in-between finished grabs is di [01:43] ctated by the delay [01:44] Concurrency 1 is like the minimum of grabbing [01:46] Oh okay makes sense. So it' grabbed everything in the minimum time it takes to grab something eg; super easy site, right? [01:48] Yeah, whenever I put run it with concurrency 1 - it's more or less of a gut feeling reaction, like how slow the website is even browsing in general, or how fragile the website could be by how old-looking it's perceived to be [01:49] I figured, based on the age/how the website looks, heh [01:49] Alright, Ryz thanks again, you too markedL [02:03] *** Nick-PC has joined #archiveteam-bs [02:18] *** HP_Archiv has quit IRC (se.hub efnet.deic.eu) [02:18] *** X-Scale has quit IRC (se.hub efnet.deic.eu) [02:18] *** kiskaWee has quit IRC (se.hub efnet.deic.eu) [02:18] *** halt_ has quit IRC (se.hub efnet.deic.eu) [02:18] *** ctrl_ has quit IRC (se.hub efnet.deic.eu) [02:29] *** DigiDigi has quit IRC (Remote host closed the connection) [02:37] *** af10b3e5e has joined #archiveteam-bs [02:37] *** d5f4a3622 has quit IRC (Read error: Connection reset by peer) [02:58] *** X-Scale has joined #archiveteam-bs [03:09] *** britmob has joined #archiveteam-bs [03:11] *** britmob has quit IRC (Read error: Connection reset by peer) [03:15] *** britmob has joined #archiveteam-bs [03:17] *** britmob_ has joined #archiveteam-bs [04:07] *** DigiDigi has joined #archiveteam-bs [04:08] *** britmob_ has quit IRC (Read error: Operation timed out) [04:08] *** britmob has quit IRC (Read error: Operation timed out) [04:12] *** qw3rty2 has joined #archiveteam-bs [04:20] *** qw3rty has quit IRC (Ping timeout: 745 seconds) [04:20] *** britmob_ has joined #archiveteam-bs [04:20] *** britmob has joined #archiveteam-bs [04:29] *** BlueMax has joined #archiveteam-bs [05:13] *** DogsRNice has quit IRC (Read error: Connection reset by peer) [05:42] *** ripdog has joined #archiveteam-bs [06:18] *** Flashfire has quit IRC (Remote host closed the connection) [06:18] *** kiska has quit IRC (Remote host closed the connection) [06:19] *** kiska has joined #archiveteam-bs [06:19] *** Flashfire has joined #archiveteam-bs [06:19] *** svchfoo1 sets mode: +o kiska [06:19] *** svchfoo3 sets mode: +o kiska [07:08] *** Muad-Dib has quit IRC (Ping timeout: 745 seconds) [07:11] *** Muad-Dib has joined #archiveteam-bs [07:22] *** godane has quit IRC (Ping timeout: 360 seconds) [07:36] *** Flashfire has quit IRC (Remote host closed the connection) [07:36] *** kiska has quit IRC (Remote host closed the connection) [07:36] *** kiska has joined #archiveteam-bs [07:36] *** Flashfire has joined #archiveteam-bs [07:36] *** svchfoo3 sets mode: +o kiska [07:36] *** svchfoo1 sets mode: +o kiska [08:09] *** Stiletto has joined #archiveteam-bs [08:10] *** Stilett0 has quit IRC (Ping timeout: 246 seconds) [09:44] *** bluefoo has quit IRC (Ping timeout: 496 seconds) [10:26] *** bluefoo has joined #archiveteam-bs [10:43] *** BlueMax has quit IRC (Quit: Leaving) [11:39] *** Flashfire has quit IRC (Remote host closed the connection) [11:39] *** kiska has quit IRC (Remote host closed the connection) [11:39] *** kiska has joined #archiveteam-bs [11:40] *** Flashfire has joined #archiveteam-bs [11:40] *** svchfoo3 sets mode: +o kiska [11:40] *** svchfoo1 sets mode: +o kiska [12:10] *** Sora_Uta has quit IRC (Ping timeout: 276 seconds) [12:23] *** ctrl_ has joined #archiveteam-bs [12:27] *** kiskaWee has joined #archiveteam-bs [12:27] *** svchfoo1 sets mode: +o kiskaWee [12:27] *** svchfoo3 sets mode: +o kiskaWee [12:27] *** halt_ has joined #archiveteam-bs [12:32] *** Wingy has joined #archiveteam-bs [13:14] *** godane has joined #archiveteam-bs [13:43] *** godane has quit IRC (Leaving.) [15:19] *** X-Scale has quit IRC (Quit: HydraIRC -> http://www.hydrairc.com <- Organize your IRC) [16:00] *** bluefoo has quit IRC (Read error: Operation timed out) [16:02] *** DogsRNice has joined #archiveteam-bs [16:26] *** bluefoo has joined #archiveteam-bs [16:31] *** icedice has joined #archiveteam-bs [16:53] *** bluefoo has quit IRC (Read error: Operation timed out) [16:54] *** superkuh_ is now known as superkuh [17:11] *** Myself has quit IRC (Ping timeout: 276 seconds) [17:13] *** Myself has joined #archiveteam-bs [18:01] *** K4k has joined #archiveteam-bs [18:35] *** Stiletto has quit IRC (Read error: Connection reset by peer) [18:35] *** Stilett0 has joined #archiveteam-bs [20:06] *** jamiew has joined #archiveteam-bs [20:17] *** af10b3e5e has quit IRC (Ping timeout: 255 seconds) [20:19] By the way, I talked with the Murfie people [20:20] it is all a galacic mess, but we're in there (we, internet archive) [20:20] *** Sora_Uta has joined #archiveteam-bs [20:32] *** af10b3e5e has joined #archiveteam-bs [20:35] *** af10b3e5e has quit IRC (Read error: Connection reset by peer) [20:35] SketchCow: I’m not sure it’s a good idea to upload the current Yahoo! Groups data into archiveteam_yahoogroups. My data files are WARCs, yes, but not compatible with Wayback. [20:37] *** af10b3e5e has joined #archiveteam-bs [20:38] why aren't they compatible? [20:38] (leading to) if they're valid warcs, what needs to be done to make WBM work with them? [20:40] Is this because they're new, they've done some very recent work to make those compatibvle. [20:40] Kaz: I’ve been grabbing API responses and splitting them into individual “records”. [20:41] So I had to invent a proprierary URI scheme that Wayback obviously cannot play back. [20:41] Like WARC-Target-URI: org.archive.yahoogroups:v1/group/academia_abap_jul2013/message/1/info [20:41] Well, OK. [20:41] 1. That was probably a mistake [20:41] 2. We should take the data anyway [20:42] So that PurpleSym can do his OWN whack-ass conversion into something, instead of tracking down your bones and hard drive in whatever shipping container you live in [20:42] Sorry, I mean PurpleSyn 3000 [20:42] Your android ancestor [20:43] I was never interested in Wayback integration. These archives are better converted into mbox and viewed with a mail user agent. [20:45] Haha, I guess I have a new nickname now. [20:45] *** PurpleSym is now known as PSyn3000 [20:48] Well, upload them. [20:48] And please mark this situation. [20:48] And do not mark them web. Mark them data. [20:49] I don't think getting direct API responses was a mistake. I agree that most of these records (messages) are best viewed in an email client. The saved files and photos *might* be better in web UI. Some kind of conversion will be needed either way, as we also have GMDs. My thought? Convert both API grabs and GMD results of publicly shareable content to something WBM can display. Or just convert them to discrete group data in IA. [20:50] *** icedice2 has joined #archiveteam-bs [20:51] *** PSyn3000 is now known as PurpleSym [20:51] *** jamiew has quit IRC (Textual IRC Client: www.textualapp.com) [20:52] We do not convert data to put it into the WBM. Ever. [20:53] You certainly don'. [20:53] If archive ever does it, it will go into a thing saying "Shitbin Archive Team Conversion" [20:53] SketchCow: They’re already marked as data, as far as I see. [20:53] We have that capability - if you browse a wayback page [20:54] Will mention what sources it comes from. That feature's a few years old [20:55] Speaking of AT grab WBM playback, I think it would be nice to add some URL rewrites to the WBM for plays.tv. I know the WBM already ignores values of session ID parameters like "sid"; this would be similar. [20:56] Basically, there are tracking parameters and timestamps in every link on that site, so obviously you can't browse it and always have to remove them manually. [20:56] *** icedice has quit IRC (Read error: Operation timed out) [20:59] SketchCow: So, could you move either my data or the 2019 WARCs into a different collection? I’ll come up with a proper description for the collection and send it to you. [21:00] (i.e. one that includes these technical details mentioned above) [21:01] Eventually. [21:01] Send explicit instructions to jason@textfiles.com [21:15] *** Xibalba has quit IRC (Remote host closed the connection) [21:15] *** Xibalba has joined #archiveteam-bs [21:26] *** Mateon1 has quit IRC (Remote host closed the connection) [21:27] *** Mateon1 has joined #archiveteam-bs [22:29] *** BlueMax has joined #archiveteam-bs [23:11] *** Ajay1 has quit IRC (Ping timeout: 276 seconds) [23:11] *** klg has quit IRC (Ping timeout: 276 seconds) [23:11] *** klg has joined #archiveteam-bs [23:11] *** SoraUta has joined #archiveteam-bs [23:12] *** Ajay1 has joined #archiveteam-bs [23:12] *** Sora_Uta has quit IRC (Ping timeout: 276 seconds) [23:12] *** DogsRNice has quit IRC (Ping timeout: 276 seconds) [23:12] *** Flashfire has quit IRC (Ping timeout: 276 seconds) [23:12] *** closure has quit IRC (Ping timeout: 276 seconds) [23:12] *** tuluu has quit IRC (Ping timeout: 276 seconds) [23:12] *** tuluu has joined #archiveteam-bs [23:12] *** DogsRNice has joined #archiveteam-bs [23:14] *** closure has joined #archiveteam-bs [23:26] *** Stiletto has joined #archiveteam-bs [23:29] *** Stilett0 has quit IRC (Read error: Operation timed out)