[00:14] *** Asparagir has quit IRC (Asparagir) [00:14] *** godane has joined #archiveteam-bs [00:20] *** pizzaiolo has joined #archiveteam-bs [00:24] looks like 2 of my tapes i bought have white mold in them [00:29] *** godane has quit IRC (Ping timeout: 260 seconds) [00:30] *** pizzaiolo has quit IRC (Quit: pizzaiolo) [00:44] *** pizzaiolo has joined #archiveteam-bs [00:45] *** pizzaiolo has quit IRC (pizzaiolo) [00:46] *** BlueMaxim has joined #archiveteam-bs [00:55] *** godane has joined #archiveteam-bs [00:55] *** pizzaiolo has joined #archiveteam-bs [01:16] euw [01:21] hey astrid [01:21] hey hi [01:22] was the euw of the white mold i found? [01:22] yeah [01:22] ok [01:22] how're you doing these days, godane? [01:22] i'm doing ok [01:23] so think i got 20 tapes instead of 19 [01:24] well that's a bit of an extra win to make up for the mold, i hope :) [01:25] only one of the 2 types i wanted to digitize really [01:25] it was really bad with mold [01:25] the charlie brown was not has bad has that one [01:26] i also got 2 Time Life's The Best of the Muppet Show [01:26] still sealed in package [01:41] *** schbirid2 has joined #archiveteam-bs [01:45] *** schbirid has quit IRC (Read error: Operation timed out) [01:45] *** pizzaiolo has quit IRC (Quit: pizzaiolo) [01:46] *** pizzaiolo has joined #archiveteam-bs [01:50] *** pizzaiolo has quit IRC (Client Quit) [01:51] *** godane has quit IRC (Quit: Leaving.) [01:53] *** godane has joined #archiveteam-bs [01:54] my wifi keeps on disconnecting [01:55] *** kvieta has quit IRC (Quit: greedo shot first) [01:55] *** kvieta- is now known as kvieta [01:58] *** mkram has quit IRC (Ping timeout: 194 seconds) [02:35] *** godane has quit IRC (Ping timeout: 633 seconds) [03:20] *** godane has joined #archiveteam-bs [03:27] I don't know if I'm ready for pre-calc and I have to decide in 1.5 hours if I want a refund for this class [03:48] hook54321, the thing is, Lunduke is really working for the Aliens..His mission: Hand over control of W3C to the base on Planet Nibiru.. [03:49] huh? [03:49] *** ola_norsk has quit IRC (Mark my woooooords..Fleeee) [04:16] *** Stilett0- has joined #archiveteam-bs [04:32] *** Sk1d has quit IRC (Ping timeout: 250 seconds) [04:39] *** Sk1d has joined #archiveteam-bs [06:20] *** Mateon1 has quit IRC (Read error: Operation timed out) [06:25] *** Mateon1 has joined #archiveteam-bs [08:00] *** atomicthu has quit IRC (No Ping reply in 180 seconds.) [08:06] *** atomicthu has joined #archiveteam-bs [10:32] *** icedice has joined #archiveteam-bs [10:32] *** icedice has quit IRC (Remote host closed the connection) [10:36] *** icedice has joined #archiveteam-bs [11:13] i suck at regex... wired has some nested errors: [11:13] https://www.wired.com/video/2016/11/worried-about-your-privacy-now-here-s-how-to-protect-it/h_274/wp-content/uploads/2017/03/wired_ryan-reynolds-jake-gyllenhaal-answer-the-web-s-most-searched-questions-6-600x338.jpg [11:14] i want to disallow any url with /wp-content/ in it UNLESS it is preceeded by wired.com: www.wired.com/wp-content/ is fine [11:14] halp :D [11:14] target is wpull, so python [11:16] schbirid2: (? That's a negative lookbehind, in case you want to look it up. [11:18] yay, thanks! [11:18] Alternatively, if you want to allow only https://www.wired.com/wp-content/... but not https://www.wired.com/something/blub/www.wired.com/wp-content/...: ^https?://www\.wired\.com/[^/]+.*/wp-content/ [11:19] we will see =) [11:19] https://regex101.com/ is amazing btw [11:19] Or pythex.org if you're working with Python. [11:20] regex101 seems to require cookies and possibly other stuff, nothanks. [11:21] (pythex.org works fine with just scripting.) [12:05] *** pizzaiolo has joined #archiveteam-bs [12:26] JAA: avoiding things that need cookies makes no sense; they're an important part of HTTP for persisting sessions and such [12:26] if you're concerned about trackers, just use something like privacy badger :) [12:27] (also, regex101 is far, *far* more extensive than pythex) [12:28] *** icedice has quit IRC (Read error: Operation timed out) [12:29] joepie91: I'm fine with cookies where they make sense. regex101 doesn't use any server-side sessions etc. unless you log in, so it should work without cookies. [12:30] that seems like a fairly irrelevant thing to nitpick on... [12:31] like, you're painting cookies here as some sort of inherently evil thing that must be justified on a case-by-case basis [12:31] which is wildly off from what cookies actually are [12:33] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [12:33] This isn't just about cookies. I'm against misusing browser features in general. Another example are sites which are unusable without JavaScript although they don't need JS to work (at least for the most part), like newspaper websites covering the article contents with a white box which is removed with JS. [12:35] If regex101 stored the regex and test strings in localStorage, it'd be fine that it requires "cookies" (since that setting also controls access to localStorage at least in Firefox), but it doesn't. [12:37] *** icedice has joined #archiveteam-bs [13:34] *** dd0a13f37 has joined #archiveteam-bs [14:35] mundus: What is MedicalDocs.tar ? [14:36] second, this https://the-eye.eu/DataHoarder/Medical/ [14:37] Thanks [14:41] hey odemg, you're archiving private trackers right? [14:41] Are you archiving the torrent files too? (not the contents) [15:11] https://pastebin.com/Nz672PWc - archives torrent files from bittorrent dht [15:40] Where the comic collection on theye go? [15:47] dd0a13f37, yes. [15:48] good [15:48] Just got another response from one of the torrent site operators [15:53] dd0a13f37, I'm being banned for doing this btw. [15:55] *** Smiley has quit IRC (Ping timeout: 255 seconds) [15:57] Banned for what? Scraping? [15:58] Yup, I was banned from RED and PTP last night over this, well kinda. [15:59] How did they catch you? [15:59] They didn't someone grassed me up [16:00] Now tracker staff lurk where I hang out and ban people that affil with me that they can link to accounts on their sites [16:02] :( [16:03] *** Smiley has joined #archiveteam-bs [16:09] How often do people reuse usernames on private trackers? Not often? [16:16] all the time, people are stupid [16:25] *** Stilett0- is now known as Stiletto [16:36] But as a fraction [16:36] 5%? 10%? 50%? [16:43] I mean, there is always the nuclear option of applying a combolist until you get an account to use for scraping, hard to rat you out when they don't know, but then you'd get permabanned on all "legitimate" accounts and piss them off [16:49] *** icedice has quit IRC (Quit: Leaving) [17:21] *** qwebirc89 has joined #archiveteam-bs [17:23] *** dd0a13f37 has quit IRC (Ping timeout: 268 seconds) [17:24] *** qwebirc89 is now known as dd0a13f37 [17:51] *** Stiletto has quit IRC () [18:02] Yeah, I got banned of RED just because I'm in the-eye [18:05] Disclaimer: I am not affiliated with odemg. [18:16] *** dd0a13f37 has quit IRC (Ping timeout: 268 seconds) [18:28] *** Stilett0- has joined #archiveteam-bs [18:28] *** Stilett0- is now known as Stiletto [19:25] *** schbirid2 has quit IRC (Quit: Leaving) [19:26] oh shit azure launched tape storage last month [19:26] 0.18 cents/gb/month [19:37] espes__: Link? I guess that's the "Azure Archive Blob Storage", but I couldn't find any details about it. [19:38] Ah wait, there it is. A small footnote mentioning that "Archive preview is only available in East US 2 region". [19:41] Also, "The Archive prices shown below are preview prices and will go up at general availability." [19:43] *** VADemon has joined #archiveteam-bs [20:51] JAA: still interested in EPG data things? [20:55] dashcloud: Yep [20:56] I found xmltv.se in the meantime, which has current EPG data at least. [20:56] your best choice in the US is Schedules Direct: http://schedulesdirect.org/ [20:58] Ah yes, I read about that somewhere. Do they have archives as well or only current data? [20:58] if you wanted historical EPG data, I'm not even sure anyone sells that- maybe buy TV Guides [20:58] if you have a lot of money, I'm sure you could buy an archive of historical data from one of the big players, but unlikely otherwise [20:59] :-| [20:59] TV Guide + OCR would be my recommedation if you want historical data [20:59] which won't get you everything, but would give you a great cross-section of data, especially on broadcast and the low cable channels [21:02] if you want an archive going forward though, Schedules Direct is insanely cheap for what it offers [21:05] Mhm. Unfortunately though, their subscription agreement forbids redistribution. So uploading to IA would be a no-no. [21:10] yes, that would be a problem [21:24] *** Jonison has joined #archiveteam-bs [21:26] *** Asparagir has joined #archiveteam-bs [22:12] *** Jonison has quit IRC (Read error: Connection reset by peer) [22:20] *** fie has quit IRC (Ping timeout: 250 seconds) [22:27] Ugh [22:28] Im reduced to sleeping on a air mattress in my new apartment. Need to work on getting an internet connection now [22:35] *** fie has joined #archiveteam-bs [23:01] *** arbin has joined #archiveteam-bs [23:04] joepie91: is the despeckle filter only supposed to work on the perimeter of the scan? [23:05] it seems to remove speckling from all around the outside, destroying text in the process, but it doesn't despeckle the other 99% of the image [23:05] arbin: no, it's supposed to work everywhere; however, I'd imagine that it's not as accurate on something where the contrast isn't as high [23:05] it's been a while since I've used it though [23:06] you may need to twiddle the settings a bunch [23:06] doesn't seem to do any better then 3x blur + 3x downsample [23:06] thank you though :) [23:54] *** VADemon has quit IRC (Quit: left4dead) [23:55] *** BlueMaxim has joined #archiveteam-bs