[00:23] *** Atom-- has joined #archiveteam-ot [00:25] *** Atom__ has quit IRC (Ping timeout: 252 seconds) [03:09] Is there a twitch archiver? [03:28] *** qw3rty has joined #archiveteam-ot [03:38] *** qw3rty2 has quit IRC (Ping timeout: 745 seconds) [03:42] *** bluefoo_ has quit IRC (Ping timeout: 492 seconds) [03:58] seems like youtube-dl has some support [04:04] *** bluefoo has joined #archiveteam-ot [04:34] *** m007a83 has joined #archiveteam-ot [04:53] *** ivan_ has quit IRC (Quit: Leaving) [04:54] *** ivan_ has joined #archiveteam-ot [04:55] *** Fusl____ sets mode: +o ivan_ [04:55] *** Fusl sets mode: +o ivan_ [04:55] *** Fusl_ sets mode: +o ivan_ [04:58] *** ivan- has joined #archiveteam-ot [04:58] *** Fusl____ sets mode: +o ivan- [04:58] *** Fusl sets mode: +o ivan- [04:58] *** Fusl_ sets mode: +o ivan- [05:09] *** ivan_ has quit IRC (Ping timeout: 745 seconds) [05:10] *** ivan- is now known as ivan_ [05:19] *** bluefoo has quit IRC (Read error: Operation timed out) [05:28] *** kode54 has quit IRC (Quit: The Lounge - https://thelounge.chat) [05:32] *** killsushi has joined #archiveteam-ot [05:38] *** kode54 has joined #archiveteam-ot [05:57] Last time I tried it on Twitch it had problems if the audio is muted at the start of the stream [06:06] *** fuzzy8021 has quit IRC (Read error: Connection reset by peer) [06:08] *** fuzzy8021 has joined #archiveteam-ot [07:01] https://news.ycombinator.com/item?id=21052952 [07:01] 600k Images Removed from ImageNet After Art Project Exposes Racist Bias | Hacker News [07:02] https://hyperallergic.com/518822/600000-images-removed-from-ai-database-after-art-project-exposes-racist-bias/ [07:23] *** eythian has quit IRC (Ping timeout: 246 seconds) [07:33] *** bluefoo has joined #archiveteam-ot [07:47] *** eythian has joined #archiveteam-ot [08:10] *** deevious has quit IRC (Quit: deevious) [09:46] *** deevious has joined #archiveteam-ot [10:11] *** killsushi has quit IRC (Quit: Leaving) [10:13] lol @ Thomas Cook's last blog post (which they appear to have published AFTER going into liquidation): [10:13] How to Make the Most of Your Family Holiday [10:14] I think the first piece of advice should be "book with someone else, we aren't operating any more" [10:32] *** kiska18 has quit IRC (Remote host closed the connection) [10:32] *** kiska18 has joined #archiveteam-ot [10:32] *** Fusl____ sets mode: +o kiska18 [10:32] *** Fusl sets mode: +o kiska18 [10:32] *** Fusl_ sets mode: +o kiska18 [11:01] *** icedice has joined #archiveteam-ot [11:05] My SSD almost grinds down to a halt sometimes when I try to enter a folder. Everything looks good in CrystalDiskInfo except for ECC Error Rate which is at 200. CrystalDiskInfo still says that the health status is "Good" and "100%" though. The SSD is 1TB and I have 18.8GB free currently (it was down to 16.5GB for a short while before though). Is my SSD going to fail? [11:07] icedice: you don't have enough free space, get it to 10% and defragment [11:08] I thought I wasn't supposed to defragment SSDs [11:08] That defragmentation is only for HDDs [11:08] is this NTFS? [11:08] Yeah [11:09] Did I permanently damage it? [11:09] yeah you might have severe MFT fragmentation [11:09] ECC Error Rate doesn't necessarily mean much [11:09] Will it fail? [11:09] SSDs can fail suddenly and completely at any time with no warning [11:10] Was the MFT fragmentation message an answer to my question about permanent damage? [11:10] https://docs.microsoft.com/en-us/windows-server/administration/windows-commands/defrag [11:10] defrag /A C: or whichever drive it is will give you a report [11:10] icedice: no [11:11] Thank good [11:11] * God [11:11] And thanks [11:22] Does defrag /A C start the defrag or just give a report? [11:23] just a report [11:25] Ok [11:26] "The given volume path is invalid. (0x89000001)" [11:26] It is the C drive though [11:28] Gotta go [11:28] *** icedice has quit IRC (Quit: Leaving) [11:38] someone tell icedice C: is not C [12:13] *** ShellyRol has quit IRC (Read error: Operation timed out) [12:19] *** ShellyRol has joined #archiveteam-ot [13:50] *** second has quit IRC (Remote host closed the connection) [14:03] *** icedice has joined #archiveteam-ot [14:03] Back [14:13] <@ivan_> someone tell icedice C: is not C [14:14] Never hurts to buy a second SSD. I've been having very good success with the Samsung EVO line. [14:14] Ah [14:15] I have a Samsung EVO [14:15] Their great [14:15] I'm just bad at everything I attempt and do [14:15] * They're [14:15] What version of windows are you using [14:16] Moving files to my external HDD now [14:16] Windows 10 [14:16] It hasn't warned me that I was low on space though [14:16] I found Windows 7 automatically did all the things it needed from the get go. So I assume Windows 10 would too [14:16] But I think that's usually when it gets below 10 GB [14:16] And at that point it would probably be too late [14:17] Windows 10 has TRIM [14:17] I'll see if I can activate that manually without having to wait for the automatic monthly TRIM [14:17] truth be told, you'll get the most life out of it by keeping it 10 to 50% empty for greater wearleveling distribution. [14:17] I'll go with 20% [14:17] if you're going to let media sit, just stick it on a platter drive [14:18] It was all good until I got into Avatar: The Last Airbender and The Legend of Korra [14:18] *** icedice has quit IRC (Read error: Connection reset by peer) [14:26] *** icedice has joined #archiveteam-ot [14:27] You can't have enough of these. https://www.amazon.com/dp/B0713WPGLL/ [14:27] I've probably shortened my SSD's life span by quite a bit, huh? [14:27] i doubt it. [14:27] Everything I touch I break [14:27] Oh [14:27] Ok [14:28] That's nice [14:28] just do a check disk under properties > tools [14:28] *** yano_ is now known as yano [14:30] *** DogsRNice has joined #archiveteam-ot [14:30] i personally don't like storing much on SSD and mainly use it for OS and downloads in progress (for speed) and editing (for speed). then files find their final resting place on external platter drives [14:31] but once platter drives go away and the price is right, everything will be ssd [14:31] 250 GB free now [14:32] 27.5% free now [14:32] nice [14:33] Yeah [14:33] And that's just from moving Avatar: The Last Airbender and The Legend of Korra over [14:35] Try these file management programs. Everything (by voidtools), SpaceSniffer, and WizTree. Also FastCopy and dupeGuru and HashCheck Shell Extension (github release) [14:35] I have HashCheck already [14:35] It's pretty nice [14:36] I keep a .sha512 of every file over 500 megs [14:36] in addition to media collections [14:37] so at least you don't have to worry whether you corrupted everything with bitflips everywhere [14:38] I have FreeFileSync [14:38] So it will notice if there are any changes to the files when it compares between my external backup and my internal desktop HDD [14:39] I have this btw: [14:39] https://www.amazon.com/Glyph-BlackBox-BBPR6000-External-Drive/dp/B00Z14R5VM [14:39] https://www.glyphtech.com/product/blackbox-pro [14:39] Hmm [14:40] dupeGuru is a bit like Duplicate Cleaner, I think [14:40] "Windows successfully scanned the drive. No errors were found." [14:44] Any suggestions on a good 10+ TB HDD? [14:45] I met a guy online who has a 10TB cartoon collection in my native language [14:45] Which is pretty rare [14:45] A lot of stuff that was thought to be extinct but people found on recorded VHS tapes [14:48] *** kiska18 has quit IRC (Read error: Operation timed out) [14:49] *** kiska18 has joined #archiveteam-ot [14:49] *** Fusl____ sets mode: +o kiska18 [14:49] *** Fusl sets mode: +o kiska18 [14:50] *** Fusl_ sets mode: +o kiska18 [15:02] *** BlueMax has quit IRC (Quit: Leaving) [15:06] anyone got access to https://www.stan.com.au/watch/spaced ? if so, i would be super grateful for a 1:1 resolution screenshot of any episode [15:16] schibirid2: https://old.reddit.com/r/australia/ and #redditaustralia on Freenode are probably better places to ask [15:24] I would help you, but my stan subscription expired months ago and there isn't anything there to watch for $17/mo [15:33] *** godane has joined #archiveteam-ot [15:45] are you trying to watch it or archive it? [15:47] it plays / is included in US Amazon Prime Video [15:48] https://www.amazon.com/Art/dp/B07JJ98ZV5 [15:52] *** Stilett0 is now known as Stiletto [15:53] i want to evaluate the quality compared to my dvd before i "archive" it [16:07] do you think the stan version is different than other stream versions? [16:08] https://www.imdb.com/title/tt0187664/episodes?season=1 [16:14] no idea [16:22] the imdb one says it's free with account creation [16:29] i dont see that [16:33] https://www.awesomescreenshot.com/image/4250007/3a4b4d8cecce9c11c1b79e86fc95cd69 [16:33] what country is your IP in ? [16:36] huh, neat [16:36] de [16:37] but i doubt imdb would have the best quality available or would it? [16:40] hm, well that's what comparisons are for. Wikipedia said that show was original released on Channel 4 at 576i[50] [16:42] and that there were DVD releases in the UK and US [16:48] https://www.channel4.com/programmes/spaced free for UK IP's, and original channel [16:49] Volume Information: [16:49] Volume size = 930,97 GB [16:49] Free space = 393,25 GB [16:49] Total fragmented space = 22% [16:49] Largest free space size = 5,40 GB [16:49] Note: File fragments larger than 64MB are not included in the fragmentation statistics. [16:49] It is recommended that you defragment this volume. [16:50] So, if I use Properties > Tools > Optimize, will that brick my SSD? [17:11] *** icedice2 has joined #archiveteam-ot [17:13] *** icedice has quit IRC (Ping timeout: 252 seconds) [17:27] TRIM did nothing [17:27] defrag /A C: still says the exact same thing [17:35] TRIM just discards unused space. It does not defragment your drive. [17:47] Ok [17:47] I'm a bit afraid that I'll brick the SSD if I defragment it [17:50] *** MaximeleG has joined #archiveteam-ot [17:55] *** icedice2 has quit IRC (Quit: Leaving) [17:56] *** MaximeleG has quit IRC (Quit: MaximeleG) [17:56] *** icedice has joined #archiveteam-ot [17:58] isn't TRIM more important than defrag? [18:07] *** Leslie has left [18:10] https://www.hanselman.com/blog/TheRealAndCompleteStoryDoesWindowsDefragmentYourSSD.aspx [18:20] *** bluefoo has quit IRC (Ping timeout: 246 seconds) [18:20] its pretty much impossible to brick a ssd by IO unless you stress it in extremely artifical conditions [18:21] any storage can die in an instant, that's just life [18:52] icedice, defragmentation may help a little bit by putting files in contiguous space. Flash drives have very large cluster sizes that have to be deleted and re-written as a group. [18:52] It would speed up writes, not reeds. [18:53] *reads [18:55] Regular defragmentation is not recommened becuase it si a textbok case of write amplification (and flash memory cells have limited write cycles). [18:55] "Storage Optimizer will defrag an SSD once a month if volume snapshots are enabled." [18:55] ^ Yeah, I just found that via StackExchange [18:56] Looks like I might not be shit out of luck after all [18:56] It would be nice if I could see whem the next time Storage Optimizer will run is though [18:59] No option to force it to run immediately? [19:11] I don't know [19:11] i mean, what's one write amplification out of the mean time to failure rating of 1 million writes [19:11] Sounds like it's scheduled only, but I'm not sure [19:11] defrag to your heart's content. you won't be breathing long enough for the drive to die [19:12] Yeah, but does regular defrags even work on SSDs? [19:12] it's just not very as useful to do so, but i wouldn't call it deadly [19:12] Did you say something? [19:12] no [19:12] My NVMe drive would like a word: Data Units Written: 475,361,288 [243 TB] [19:13] *** bluefoo has joined #archiveteam-ot [19:14] wouldn't a typical defrag program effectively force data to reshuffle to the least worn (wearleveling) physical sectors [19:15] I just didn't tell you which server its running on. Its an EX42-NVMe from hetzner [20:11] *** icedice has quit IRC (Ping timeout: 252 seconds) [20:46] *** schbirid2 has quit IRC (Remote host closed the connection) [20:59] 4.8 TB written each here on the SSDs of my hirola AB pipeline which has only been in operation for a month. [20:59] AB is quite a good way to shred disks. [21:00] Since every response is written to disk three times etc. [21:00] Or being an rsync target is also a good way as well [21:01] Especially during large projects like #googleminus [21:01] Yeah [21:01] Or even the smaller ones like #sketchedout [21:01] The AB pipeline obviously doesn't get anywhere near saturating a Gbit link. [21:02] Or another project *wink* *wink* [21:03] while :; do dd if=/dev/zero of=file bs=16M; rm file; done [21:11] why would you write each response to the disk 3 times? [21:12] Because that's how wpull does it and nobody bothered to fix it so far. [21:12] It might actually be something between three and four times, I'm not entirely sure. [21:12] isn't that wat caching is for? allow the dumb program to do dumb things, and then flush the final outcome to etch [21:15] So wpull fetches something. The response body is written to disk once for later processing. The full data sent exactly by the server, i.e. with headers and transfer encoding intact, is also written to a temporary file so that it can later be written to the WARC file (after compression, so that's a partial write in terms of data size). And I think it might write the entire body to disk again to a file [21:15] which is deleted again immediately. [21:16] I'm sure about the first three writes (or 2.5 or something depending on the compression ratio). [21:17] I think it should simply write it to disk once with all encoding intact etc., then read it again from that file and processing the transfer encoding etc. on the fly. [21:17] what sizes are we talking about [21:18] Uh, whatever the HTTP server sends back? [21:22] just imagining whether it'd make more sense for it to strictly use RAM for all that [21:22] the way a normal web browser does [21:23] That could work for small responses, but there's no limit on what a server can send back. [21:23] And if you download a file with a browser, it's not kept in memory either. [21:23] sure, but then you have a separate job for memory management and such [21:23] you get what i mean i hope [21:24] Another problem is that you might not know the response size in the beginning if the server uses chunked TE. So then you'd have to flush everything to disk if a limit is exceeded etc. [21:24] but i guess that's why ramdisk exists [21:25] Keeping everything in memory is fine if you know what you're fetching. I do that in qwarc. [21:25] But if you have a tool for the general case of retrieving any URL, it needs to be able to handle large stuff. [21:25] We've had AB pipelines run out of disk space due to big files many, many times. [21:26] Imagine how much worse that would've been if everything was kept in RAM... [22:13] the kernel might not be writing it to disk though it depends on the fs and sync patterns [22:14] Maybe not for small files, yes, but large ones for sure. [22:15] There’s so need to ‘well, actually’ your way out of accepting that things could be improved [22:19] for files that are written and then soon deleted you might be able to reduce write load with [22:19] "vm.dirty_expire_centisecs" = 7 * 60 * 100; [22:19] "vm.dirty_writeback_centisecs" = 60 * 100; [22:21] *** katocala has quit IRC () [22:25] *** katocala has joined #archiveteam-ot [22:47] *** killsushi has joined #archiveteam-ot [23:20] *** BlueMax has joined #archiveteam-ot [23:51] it bugs me a bit, but lots of businesses were recycled [23:51] Instagram, Facebook, slack [23:53] recycled? [23:55] before facebook there was myspace, friendster [23:56] right. but it sounded like you said instagram, facebook and slack were recycled [23:56] I think what you mean is the concepts have gone stale [23:57] I mean they were tried before and failed [23:57] before instagram there was moblogging [23:57] businesses models fail, technologies not necessarily. it's difficult to monitize something that seems like it should be free [23:57] they're all built on the premise that email has been free since at least 1997 [23:58] or rather, built against that cultural truth [23:59] the only thing keeping most of these businesses afloat on their free platforms, are other businesses who are still accustomed to practices like paying for advertisment