[00:06] *** betamax_o has quit IRC (leaving) [00:23] *** Dragnog2 has quit IRC (Quit: Connection closed for inactivity) [00:52] germanys media markt is selling 8tb mybooks for 125,10 eur. (139 without ebay coupon) [01:06] seems better than US Amazon prices by 10 eur [01:53] *** tuluu has quit IRC (Ping timeout: 258 seconds) [01:55] *** mls has quit IRC (Ping timeout: 258 seconds) [01:57] *** anarcat has joined #archiveteam-ot [01:57] *** anarcat has quit IRC (Handshake flooding) [01:58] *** anarchat has quit IRC (Ping timeout: 258 seconds) [01:58] *** tuluu has joined #archiveteam-ot [02:02] *** anarcat has joined #archiveteam-ot [02:02] *** anarcat has quit IRC (Handshake flooding) [02:07] *** anarcat has joined #archiveteam-ot [02:10] *** Laverne_ has quit IRC (se.hub efnet.portlane.se) [02:10] *** Shen has quit IRC (se.hub efnet.portlane.se) [02:17] *** manjaro-u has quit IRC (Read error: Operation timed out) [02:48] *** mls has joined #archiveteam-ot [02:48] *** Laverne_ has joined #archiveteam-ot [02:48] *** Shen has joined #archiveteam-ot [04:37] *** X-Scale` has joined #archiveteam-ot [04:39] *** X-Scale has quit IRC (Ping timeout: 252 seconds) [04:39] *** X-Scale` is now known as X-Scale [04:46] *** qw3rty2 has joined #archiveteam-ot [04:49] *** odemg has quit IRC (Ping timeout: 745 seconds) [04:52] *** IAmbience has quit IRC (Quit: Connection closed for inactivity) [04:53] *** X-Scale` has joined #archiveteam-ot [04:54] *** odemg has joined #archiveteam-ot [04:54] *** qw3rty has quit IRC (Ping timeout: 745 seconds) [04:54] *** X-Scale has quit IRC (Ping timeout: 252 seconds) [04:54] *** X-Scale` is now known as X-Scale [05:10] *** ugh has joined #archiveteam-ot [05:13] *** legoktm has quit IRC (Read error: Operation timed out) [05:44] *** asdf0101 has quit IRC (The Lounge - https://thelounge.chat) [05:44] *** markedL has quit IRC (Quit: The Lounge - https://thelounge.chat) [05:45] *** markedL has joined #archiveteam-ot [05:45] *** asdf0101 has joined #archiveteam-ot [06:30] Does anyone want 4096 samples of recaptcha's audio captcha from 2012. it sounds like drunk people saying random words, with background chatter. [06:31] it's about 260 mb [06:36] or 37 hours worth of pretty obnoxious random audio [07:27] https://torrentfreak.com/backing-up-doomed-radio-station-online-archives-was-mass-piracy-191028/ [07:40] *** BlueMax has quit IRC (Quit: Leaving) [07:46] Raccoon: Upload to IA! [08:33] h3ndr1k, yeah, but then youo would eb stuck with a macbook. https://petapixel.com/2019/06/11/this-is-why-macs-are-slower-than-pcs/ (TL;DW: only apples iphone deosn't themal throttle) [08:34] * phillipsj needs sleep, possibly. [08:57] *** kiska18 has quit IRC (Remote host closed the connection) [08:57] *** Ryz has quit IRC (Remote host closed the connection) [08:57] *** kiska18 has joined #archiveteam-ot [08:57] *** Fusl sets mode: +o kiska18 [08:57] *** Fusl__ sets mode: +o kiska18 [08:57] *** Fusl_ sets mode: +o kiska18 [08:57] *** Ryz has joined #archiveteam-ot [08:58] *** schbirid has joined #archiveteam-ot [09:36] phillipsj: I meant Western Digital MyBooks which are external harddrives :P [09:37] A Macbook for 125€ would be nice though... [10:30] Was wondering why the price was only triple digit. [10:31] Yeah, I definitely mis-read your comment. [10:50] *** IAmbience has joined #archiveteam-ot [13:13] *** tuluu has quit IRC (Remote host closed the connection) [13:14] *** tuluu has joined #archiveteam-ot [13:25] *** vitzli has joined #archiveteam-ot [15:01] *** vitzli has quit IRC (Quit: Leaving) [15:27] *** bluefoo has quit IRC (Remote host closed the connection) [15:30] *** bluefoo has joined #archiveteam-ot [15:52] *** killsushi has quit IRC (Quit: Leaving) [16:40] *** lunik19 is now known as lunik1 [17:07] *** Hani has joined #archiveteam-ot [17:42] *** DogsRNice has joined #archiveteam-ot [17:48] *** icedice has joined #archiveteam-ot [18:26] *** kiska18 has quit IRC (Read error: Operation timed out) [18:27] *** Ryz has quit IRC (Quit: Ping timeout (120 seconds)) [18:28] *** Ryz has joined #archiveteam-ot [18:33] *** Ryz has quit IRC (Quit: Ping timeout (120 seconds)) [18:33] *** Ryz has joined #archiveteam-ot [18:56] *** Ivy has quit IRC (Quit: Connection closed for inactivity) [21:39] so i got my new 5TB seagate usb drive [21:39] my WD arrives tomorrow [21:40] move some of the animes i have watch to it [21:40] already format it to EXT4 [21:41] If Windows 10 would support EXT4, I might switch my main desktop OS. Dumb microsoft. [21:42] i have a fear of one my hard drive going like one of my usb stick that i mounted on windows a few years ago [21:43] the label went weird and it was stuck as read-only [21:43] but you couldn't copy the files off if [21:43] *it [21:43] do you keep redundancy copies? if not for big media, then definitely keep SHA checksums [21:44] i do keep some copies of stuff [21:44] on multible drives [21:44] same, but i can't afford to keep the hefty stuff. i just share with friends and hope I never have to use them as a backup [21:45] i have like 400gb of anime i can copy to this new drive to free my main 8tb drive [21:45] would be cool to create some touch-n-go sync with a traveling harddrive to update all my offsite offline friend/family copies, without doing a full disk image [21:46] *** manjaro-u has joined #archiveteam-ot [21:46] gets tricky when renaming and moving folders. end up copying an entire folder again when you could have just renamed it on the target. [21:47] agree [21:48] a super advanced journaling filesystem could be invented. EXT5 [21:48] git/svn backend [21:49] Sounds like awful performance. [21:49] *** icedice has quit IRC (Quit: Leaving) [21:50] JAA: keeping a log of "renamed {jXgDwNKUrrhsKATtCFrqJ} from ''The Family (1913)'' to ''Family, The (1913)''" doesn't seem awful [21:51] No, but the performance of a git-based backend would be. [21:51] *** schbirid has quit IRC (Quit: Leaving) [21:51] i don't mean with internet queries [21:51] just versioning log [21:52] i think Raccoon wants a git filelist log [21:52] not a git full file diff log [21:52] merges only take place when you rub two harddisks together [21:53] i will say this new hard drive is a lot faster i think [21:53] i'm getting 100mb+ per second transfer [21:53] godane: even for that, selective full file diff could exist for anticipated filetypes and explicitly flagged files [21:54] otherwise, a simple SHA is recorded [21:54] ok [21:55] running SHA computation is calculated block by block during file write, and then completed when file is closed [21:55] no more bit flips [21:55] *no more undetected bitflip ignorance [22:27] *** BlueMax has joined #archiveteam-ot [22:35] there are file systems that can detect bit flips, at least on LInux [22:35] Raccoon: maybe you want git-annex [22:36] ivan: yes, JAA mentioned it last night. looking it over [22:37] also the rsync rename issue is annoying but I'm not sure it's bad enough to eliminate rsync as a solution [22:37] pick good directory structures and filenames in advance and treat them as mostly-immutable [22:38] rename big changes on the target in advance on the rare occasion that you need to [22:39] ivan: yeah. it's tough. I need to invent or adopt a metadata tagging system. I tend/always leave filenames alone and immutable, but use their folder names for tagging and pretty human-readable access [22:40] need to strike that balance between maintenance and lay use [22:40] https://github.com/search?q=rsync%20rename+fork:true&ref=simplesearch&type=Repositories has some odd stuff [22:42] https://git-annex.branchable.com/tips/metadata_driven_views/ [22:56] I'm looking at writing an extension for Voidtools Everything for Windows, which keeps a file index and monitors for file changes [22:57] I can start writing out stand-alone metadata/journal entries the contents of different folders and collection sets [22:58] dump ffprobe info to the same metadata file [22:59] SHA checksums, torrent magnet link, etc [23:41] You might also be interested in ffmpeg's framemd5 "codec". [23:43] That dumps an MD5 hash of each frame's raw data stream (or something like that), so that can be used whether a corruption is in the metadata vs data or in which frame it is in the latter case. [23:43] could be useful for locating dupes [23:43] I recently came across that while trying to verify that a combination of options actually produced a lossless encode. [23:43] does it work for a variety of video and audio encoding schemes? [23:44] Yes, it's codec-independent. But obviously it'll only produce matching hashes for lossless encodes. [23:44] well, lossless encodes, or lossy encodes that have been corrupted, truncated, or lossless-cropped [23:45] ie, half of the mp3s you'd find on gnutella/edonkey/kazaa etc [23:46] Well yeah, I meant regarding finding dupes among different encodes, only works if those are lossless. [23:46] true [23:48] I've gone over pretty much all mp3s with mp3val, mp3checker and audiotester, eliminating anything that identified as truncated or otherwise corrupt, that's not truly unique. [23:49] i need to try ffmpeg as a validater, but it's not so straight forward. rather some read/encode-to-null nonsense to identify corrupt frames, that doesn't seem very effecient at its job [23:50] hoping to bribe somebody into adding a -validate flag function [23:52] fun trick. before deleting files that look corrupt, validate it through 7zip to see if it finds an appended archive [23:54] *** phirephly has quit IRC (Read error: Operation timed out) [23:54] not only did i find .mp3s turned out to be .mp3.zip collections/albums, but also found dumb things like zipped keygens, nfos, porn [23:56] so these are not mp3s at all, just zips renamed to .mp3? [23:56] yeah. many players like winamp and vlc, probably mpc will play zipped up mp3s, even if it's named .mp3. sometimes they be named .zab for (zipped audiobook). [23:58] and if you appendage a zip or rar to the end of almost any media file format -- images, audio, video -- the player will play them fine and ignore the zip at the end. and pretty much every archive softare -- winzip, winrar, 7zip -- will scan an entire file for an embedded zip/rar container that starts anywhere. [23:59] if you're clever, you can create a list of which image hosting websites don't stript out embedded zips; preserving / hosting them for you. [23:59] or mp3 hosting websites