[00:01] i'm only 12k items away from 1.5 million items [01:24] this is interesting: https://www.ebay.com/itm/22-VHS-tapes-of-Japanese-tv-show-program-movie-as-blank-tapes-T120-may-has-ads/202389451320 [02:30] *** achip has quit IRC (west.us.hub irc.Prison.NET) [02:36] *** achip has joined #archiveteam-bs [03:31] *** BlueMax has joined #archiveteam-bs [03:46] *** wp494 has quit IRC (Ping timeout: 252 seconds) [03:47] *** wp494 has joined #archiveteam-bs [04:05] *** Mateon1 has quit IRC (Ping timeout: 268 seconds) [04:05] *** Mateon1 has joined #archiveteam-bs [04:20] *** achip has quit IRC (west.us.hub irc.Prison.NET) [04:27] *** achip has joined #archiveteam-bs [04:37] *** Odd0002 has quit IRC (Quit: ZNC - http://znc.in) [04:42] *** Odd0002 has joined #archiveteam-bs [05:44] *** Stilett0 has joined #archiveteam-bs [07:48] *** godane has quit IRC (Ping timeout: 268 seconds) [07:54] https://torrentfreak.com/movie-tv-show-companies-want-subtitle-sites-blocked-down-under-180802/ [07:54] I understand grabbing all of these sites is not viable as many host pirated content but i would appreciate any help in grabbing subtitles sites in the coming weeks [07:55] *** godane has joined #archiveteam-bs [08:31] *** Smiley has joined #archiveteam-bs [08:33] *** SmileyG has quit IRC (Read error: Operation timed out) [09:29] *** DragonMon has quit IRC (Ping timeout: 252 seconds) [10:15] *** Silvan has quit IRC (No Ping reply in 180 seconds.) [10:15] *** SilSte has joined #archiveteam-bs [12:20] *** Silvan has joined #archiveteam-bs [12:23] *** SilSte has quit IRC (Ping timeout: 480 seconds) [12:25] *** Silvan has quit IRC (Quit: No Ping reply in 180 seconds.) [12:31] *** SilSte has joined #archiveteam-bs [13:26] *** BlueMax has quit IRC (Leaving) [14:08] *** schbirid has joined #archiveteam-bs [14:21] jrwr, fair [14:22] So [14:22] sorry for the noise in the main channel [14:22] ah its all good [14:22] So [14:22] I do know the IA will take anything shipped to it and archive it [14:23] document/tape wise [14:24] that's good -- but arranging the shipping and collecting the fee it would require to move it across the pacific, and dealing with customs ... eh that'll take time [14:25] Ya [14:25] There might be local efforts, I would look into other places that handle computer history [14:26] True, it sounds like there are a number of efforts [14:26] Making a simple book scanner is pretty easy (A glass triangle works well from what I understand) [14:26] Those will keep longer then the tapes [14:27] The tapes will be hit or miss, I would research people who have dumped data from these formats in the past, the drives are a PITA and can do more harm then good sometimes [14:34] indeed ... right now I'm mostly concerned with their physical preservation [14:34] can't dump them if they've been bulldozed :/ [14:34] Right [14:34] and I'd rather they end up at the archive than some private collection, tbh [14:34] ya [14:34] go and email http://www.computerhistory.org/ [14:35] That's a good idea [14:35] and ask what would be best to look out for [14:35] they actually publish a want list [14:35] cool! [14:35] http://www.computerhistory.org/artifactdonation/ [14:35] pay a close eye [14:35] to anything they want [14:35] get everything you can [14:35] yeah, already made plans to go first thing tomorrow and search for things... know any other computer history museums with want lists? [14:36] (I miss usenet for things like this) [14:36] Ya [14:36] Im not too much on the phys side of things [14:37] Scott would be the best help I think, he is currently traveling I think [14:37] indeed. fortunately we have more time than we had for the tech manual rescue job in 2015 [14:37] but not much more [14:38] and it's in Australia... [14:38] Ya [14:38] Focus on the wanted list and spider from there [14:40] will do [14:58] 888 Woodville Rd, Villawood NSW 2163 is the location of the warehouse [15:03] that was surprisingly annoying to find [15:05] *** pikhq has quit IRC (Read error: Operation timed out) [15:10] *** pikhq has joined #archiveteam-bs [15:32] *** Sk2d has joined #archiveteam-bs [15:33] *** Sk1d has quit IRC (Read error: Operation timed out) [15:33] *** Sk2d is now known as Sk1d [16:00] *** vegmitemo has joined #archiveteam-bs [16:01] Holy crap, I've just figured out what's causing the 503s/failed attempts to set metadata. [16:02] This link: http://www.loverslab.com/topic/21438-illustrated-tips-guidelines-making-beautiful-female-characters/?p=520411 (best not to click on it) will cause it to fail. [16:02] But if you change loverslab to some other domain, it works! [16:03] I wouldn't have thought that archive.org blacklisted links from descriptions like that but what other explanation is there? [16:03] vegmitemo: weird, do you know why? What about a www.loverslab.com URL with a different path component? [16:04] znak, nope: http://www.loverslab.com/somepathorsomething fails too. [16:05] http://www.loverslob.com/somepathorsomething works though. [16:05] and I forgot some context--this is when the URL appears in some metadata field of an item you're trying to upload, like {"description": "Source is http://www.loverslab.com/..."}? [16:05] znak, yeah, I had to strip the description from https://archive.org/details/youtube-pKJ4atlC2fA [16:05] That's interesting, seems like a spam filter or something. [16:07] Bit of a weird choice, seems like it hinders archiving efforts rather than helping. [16:08] loverslab.com is in the Wayback Machine too, so maybe a keyword problem like 'lover'? [16:08] But you said that loverslob.com isn't a problem... [16:08] Yeah, just realised. [16:09] https://archive.org/details/whoahitsatest here's the test item for reference. [16:11] Not sure how some lude mods site got the ire of IA, haha. [16:12] It could also plausibly be a malware filter--like maybe at some point loverslab.com got hacked and had JavaScript malware, and ended up on some domain blacklist--and archive.org is filtering it as part of its anti-virus check or whatever. [16:13] znak, I wonder if the big G's safebrowsing filter might have a problem with it. [16:16] Even more interesting, adding https://web.archive.org/ at the beginning doesn't work, so no work around. [16:17] Maybe not safebrowsing specifically, "No unsafe content found" https://transparencyreport.google.com/safe-browsing/search?url=http:%2F%2Fwww.loverslab.com%2F [16:18] znak, wonder who IA use, or there own solution? [16:18] their* [16:18] People using Tubeup must have been pretty lucky so far not to hit a blacklisted domain (or maybe they haven't realised yet). [16:22] Does IA give older accounts any sort of privelleges maybe with uploads? [16:34] *** Stiletto has joined #archiveteam-bs [16:36] *** Stilett0 has quit IRC (Ping timeout: 264 seconds) [16:38] nope [16:38] there's no automatic grants of privileges [16:38] i have an elevated account but that's because i do a bunch of stuff with it [16:44] astrid, any idea how a domain might be rechecked or something? As infamous as that place is I don't remember any sort of malware scandal. [16:54] domain rechecked? what do you mean? [16:56] astrid, if the domain is blocked because of found malware, if it were rechecked the malicious file(s) might have been purged since. It's not impossible for people to get hacked so a perma-block would make archiving things 'lossless'ly more difficult. [16:56] i still need more context [16:56] is this on ia? [16:56] what do you mean by "blocked" [16:56] can you provide an example? [16:58] just so you know, i'm not super up on internal IA processes [16:58] i don't work there ;) [16:58] An upload to archive.org or metadata change will fail if a field contains a blacklisted domain (in this case loverslab.com). I was wondering if there's a way domains can be rechecked for malware (or whatever IA didn't like about the site) so they aren't blacklisted. Realise you aren't from IA, just curious if you had some insight. [16:59] ahh [16:59] well that probly means they got a spam run mentioning that domain [16:59] email info@archive.org and they will probably help, or fix it, or whatever needs to be done [16:59] IA has been suffering under a lot of spam items lately [17:00] *** SilSte has quit IRC (Read error: Operation timed out) [17:00] Yeah, I've seen some of them. [17:00] and one thing they might do is blacklist mentioning particular domains in metadata [17:00] *** SilSte has joined #archiveteam-bs [17:00] I even encountered an item that redirected to another domain when clicked (this was months ago though). [17:00] eep [17:00] that's a bug and should be reported [17:01] do you know what item that was? [17:01] I'll see if it still exists, one moment. [17:03] https://archive.org/details/fav-zolotaia_orda_kriak this was it I think, looks like that was fixed though. [17:04] ah yeah they did [17:04] [17:04] Ouch. [17:04] I was going to say, sounds like an XSS. [17:05] (at the bottom of the description field) [17:05] Always a shame when people abuse such a useful service. [17:06] it's gonna happen [17:06] always does [17:06] One of those laws of the universe or something. [17:08] *** JH88 has joined #archiveteam-bs [17:15] Another one for future reference, a link per field limit of 16. [17:44] *** SilSte has quit IRC (Read error: Operation timed out) [17:47] *** SilSte has joined #archiveteam-bs [17:52] *** Silvan has joined #archiveteam-bs [17:55] *** SilSte has quit IRC (Read error: Operation timed out) [18:10] *** ebel_ has quit IRC (Ping timeout: 480 seconds) [18:16] *** vegmitemo has quit IRC (Quit: Leaving) [18:18] *** ebel_ has joined #archiveteam-bs [18:31] *** godane has quit IRC (Leaving.) [19:05] this hero is uploading old MS Knowledge Base entries to Github : https://jeffpar.github.io/kbarchive/ [19:41] *** godane has joined #archiveteam-bs [20:22] *** ta9le has joined #archiveteam-bs [21:33] *** JH88 has quit IRC (Quit: JH88) [21:35] *** jspiros has quit IRC (Read error: Operation timed out) [21:35] *** Petri152 has quit IRC (Read error: Operation timed out) [21:36] *** SketchCo1 has joined #archiveteam-bs [21:36] *** swebb sets mode: +o SketchCo1 [21:36] *** wabu has quit IRC (Read error: Operation timed out) [21:36] *** SketchCow has quit IRC (Read error: Operation timed out) [21:36] *** Stiletto has quit IRC (Ping timeout: 246 seconds) [21:36] *** nightpool has quit IRC (Ping timeout: 246 seconds) [21:36] *** mundus201 has quit IRC (Ping timeout: 246 seconds) [21:38] *** zyphlar has quit IRC (Ping timeout: 246 seconds) [21:38] *** TC01 has quit IRC (Ping timeout: 246 seconds) [21:38] *** midas2 has quit IRC (Ping timeout: 246 seconds) [21:38] *** closure_ has quit IRC (Ping timeout: 246 seconds) [21:38] *** chfoo has quit IRC (Ping timeout: 246 seconds) [21:38] *** ivan` has joined #archiveteam-bs [21:38] *** robogoat has quit IRC (Ping timeout: 246 seconds) [21:38] *** second has quit IRC (Ping timeout: 246 seconds) [21:38] *** ivan has quit IRC (Ping timeout: 246 seconds) [21:38] *** underscor has quit IRC (Ping timeout: 246 seconds) [21:38] *** JAA has quit IRC (Ping timeout: 246 seconds) [21:38] *** balrog has quit IRC (Read error: Operation timed out) [21:39] *** Stilett0 has joined #archiveteam-bs [21:39] *** mundus201 has joined #archiveteam-bs [21:40] *** Mayonaise has quit IRC (Read error: Operation timed out) [21:40] *** balrog has joined #archiveteam-bs [21:40] *** swebb sets mode: +o balrog [21:40] *** Silvan has quit IRC (Read error: Operation timed out) [21:41] *** TC01 has joined #archiveteam-bs [21:42] *** closure has joined #archiveteam-bs [21:42] *** second has joined #archiveteam-bs [21:42] *** underscor has joined #archiveteam-bs [21:42] *** swebb sets mode: +o underscor [21:43] *** chfoo has joined #archiveteam-bs [21:45] *** SilSte has joined #archiveteam-bs [21:45] *** nightpool has joined #archiveteam-bs [21:47] *** robogoat has joined #archiveteam-bs [21:51] *** midas2 has joined #archiveteam-bs [21:52] *** Mayonaise has joined #archiveteam-bs [22:17] so this item is down for some reason : https://archive.org/details/DTIC_ADA530010 [22:19] same for this item : https://archive.org/details/DTIC_ADA530317 [22:19] what was it? [22:19] DTIC ADA530010: Quality Improvement of Cheese Spread [22:19] yup [22:19] strange, i can't see much in the info about why it got darked or whatever [22:19] its not darked based on the history logs [22:20] er right [22:20] because it's "defense" cheese spread [22:21] this one is waiting to derive: https://archive.org/details/DTIC_AD0530744 [22:36] *** Petri152 has joined #archiveteam-bs [22:36] *** zyphlar has joined #archiveteam-bs [22:36] *** wabu has joined #archiveteam-bs [22:37] *** JAA has joined #archiveteam-bs [22:37] *** swebb sets mode: +o JAA [22:37] *** bakJAA sets mode: +o JAA [22:39] *** jspiros has joined #archiveteam-bs [22:49] *** BlueMax has joined #archiveteam-bs [23:20] *** balrog has quit IRC (Quit: Bye) [23:36] *** ta9le has quit IRC (Quit: Connection closed for inactivity)