#archiveteam-ot 2019-08-20,Tue

↑back Search

Time Nickname Message
00:40 🔗 justas1 has joined #archiveteam-ot
00:46 🔗 justas has quit IRC (Ping timeout: 745 seconds)
00:56 🔗 JAA Lovely: a subtle behaviour of Tornado's IOLoop seems to have hidden a crash in ArchiveBot for several years. (For those interested: the pipeline reporting doesn't start immediately because the IOLoop doesn't run until the pipeline has attempted to get jobs from the control node. Although the callback for this is launched immediately, it doesn't run until the synchronous code lets the IOLoop run, or
00:56 🔗 JAA something like that (didn't analyse it in detail, just assume it's similar to async/await). This is what caused new pipelines to show up as "(anonymous)" for a while. Among other things, the monitoring code checks the free disk space in the data directory. But that directory doesn't exist yet until the first item gets processed in seesaw. So the only reason this monitoring stuff didn't crash and burn
00:56 🔗 JAA is because it indirectly delayed its execution until after the seesaw pipeline started running. And I only noticed this because I wanted to make it report to the control node immediately.)
01:03 🔗 Raccoon Impatience begets patients
02:02 🔗 Verified_ has quit IRC (Ping timeout: 252 seconds)
02:05 🔗 Verified_ has joined #archiveteam-ot
03:17 🔗 DogsRNice has quit IRC (Read error: Connection reset by peer)
04:00 🔗 lunik1 has quit IRC (:x)
04:02 🔗 lunik1 has joined #archiveteam-ot
04:34 🔗 nataraj has joined #archiveteam-ot
05:28 🔗 drcd_ has joined #archiveteam-ot
05:29 🔗 drcd has quit IRC (Ping timeout: 186 seconds)
05:29 🔗 drcd_ is now known as drcd
06:18 🔗 systwi has quit IRC (Read error: Connection reset by peer)
06:19 🔗 ShellyRol has quit IRC (ircd.choopa.net irc.mzima.net)
06:19 🔗 Fusl__ has quit IRC (ircd.choopa.net irc.mzima.net)
06:19 🔗 nyany_ has quit IRC (ircd.choopa.net irc.mzima.net)
06:19 🔗 nyany has quit IRC (ircd.choopa.net irc.mzima.net)
06:19 🔗 h3ndr1k has quit IRC (ircd.choopa.net irc.mzima.net)
06:19 🔗 eientei95 has quit IRC (ircd.choopa.net irc.mzima.net)
06:19 🔗 svchfoo3 has quit IRC (ircd.choopa.net irc.mzima.net)
06:20 🔗 systwi has joined #archiveteam-ot
06:22 🔗 Verified_ has quit IRC (Ping timeout: 252 seconds)
06:25 🔗 ShellyRol has joined #archiveteam-ot
06:25 🔗 Fusl__ has joined #archiveteam-ot
06:25 🔗 nyany_ has joined #archiveteam-ot
06:25 🔗 nyany has joined #archiveteam-ot
06:25 🔗 h3ndr1k has joined #archiveteam-ot
06:25 🔗 eientei95 has joined #archiveteam-ot
06:25 🔗 svchfoo3 has joined #archiveteam-ot
06:25 🔗 irc.mzima.net sets mode: +oo Fusl__ svchfoo3
06:25 🔗 Fusl sets mode: +o Fusl__
06:26 🔗 Fusl sets mode: +o svchfoo3
06:26 🔗 Fusl_ sets mode: +o Fusl__
06:26 🔗 Fusl_ sets mode: +o svchfoo3
06:27 🔗 Fusl__ sets mode: +o kiska
06:27 🔗 Fusl__ sets mode: +o ivan_
06:27 🔗 Fusl__ sets mode: +o kiska1
06:27 🔗 Fusl__ sets mode: +o HCross
06:27 🔗 Fusl__ sets mode: +o hook54321
06:27 🔗 Fusl__ sets mode: +o Kaz
06:27 🔗 Fusl__ sets mode: +o Fusl_
06:27 🔗 Fusl__ sets mode: +o SketchCow
06:27 🔗 Fusl__ sets mode: +o AlsoJAA
06:27 🔗 Fusl__ sets mode: +o Fusl
06:27 🔗 Fusl__ sets mode: +o Kenshin
06:27 🔗 Fusl__ sets mode: +o dxrt
06:28 🔗 Fusl__ sets mode: +o kiskabak
06:28 🔗 Fusl__ sets mode: +o JAA
06:28 🔗 Fusl__ sets mode: +o jrwr
06:28 🔗 Fusl__ sets mode: +o astrid
06:28 🔗 Fusl__ sets mode: +o chfoo
06:28 🔗 Fusl__ sets mode: +o svchfoo1
06:35 🔗 benjinsmi has joined #archiveteam-ot
06:37 🔗 benjins has quit IRC (Ping timeout: 252 seconds)
06:38 🔗 Verified_ has joined #archiveteam-ot
07:10 🔗 bluefoo has quit IRC (Read error: Operation timed out)
08:26 🔗 N4Y has quit IRC (Ping timeout: 745 seconds)
08:27 🔗 N4Y has joined #archiveteam-ot
08:34 🔗 bluefoo has joined #archiveteam-ot
09:07 🔗 Maylay has quit IRC (Ping timeout: 252 seconds)
09:08 🔗 Verified_ has quit IRC (Quit: Quit)
09:10 🔗 Maylay has joined #archiveteam-ot
09:29 🔗 icedice has joined #archiveteam-ot
09:47 🔗 icedice I'm thinking about setting up borgbackup on a cheap Kimsufi (aka OVH) dedicated server
09:47 🔗 icedice Is Kimsufi good enough for that?
10:32 🔗 ivan_ yes but check out hetzner auction if you want more disk or gigabit
10:37 🔗 BlueMax has quit IRC (Quit: Leaving)
10:39 🔗 benjinss has joined #archiveteam-ot
10:41 🔗 benjinsmi has quit IRC (Ping timeout: 252 seconds)
10:49 🔗 bluefoo has quit IRC (Quit: bluefoo)
11:04 🔗 bluefoo has joined #archiveteam-ot
12:07 🔗 kiskabak has quit IRC (Remote host closed the connection)
12:07 🔗 kiskabak has joined #archiveteam-ot
12:08 🔗 Fusl sets mode: +o kiskabak
12:08 🔗 Fusl__ sets mode: +o kiskabak
12:08 🔗 Fusl_ sets mode: +o kiskabak
12:16 🔗 icedice Not too sure about Hetzner
12:17 🔗 icedice Germany seems like a terrible place to store copyrighted content
12:17 🔗 icedice Though I guess full disk encryption could help with that
12:18 🔗 icedice Hmm, they have servers in Finland nowadays as well
12:18 🔗 icedice Tg
12:18 🔗 icedice * though
12:19 🔗 icedice I guess they'd still be somewhat under German jurisdiction
13:20 🔗 asie has joined #archiveteam-ot
13:55 🔗 killsushi has quit IRC (Quit: Leaving)
14:08 🔗 Dallas What content are you storing and where are you getting it from icedice ? Unless you get takedowns hetzner leave you alone, they won't go scanning your disk
14:11 🔗 mike__ has joined #archiveteam-ot
14:11 🔗 Dallas What do you need to do to sign up for an api key mike__ ? Could you sign up for a few ?
14:12 🔗 mike__ Hi all, I posted about this over in the main #archiveteam channel, but I'm trying to spread the word about a project to crowdsource legal data from Harvard Law Library. If you're interested in helping, we have a macOS app you can install or a docker image you can run.
14:12 🔗 mike__ The macOS app is here: https://apps.apple.com/us/app/legal-api-downloader/id1476586208?mt=12. The docker image instructions are here: https://free.law/legal-api-downloader/docker/
14:13 🔗 mike__ The TOS of the API should be your guide, @Dallas: https://case.law/
14:15 🔗 icedice Dallas: Rare manga, cartoons from the 1990's and 2000's, some YouTube stuff, personal documents, nature photos that I've taken myself
14:16 🔗 icedice Not sure if I'll bother with the more real TV series and movies
14:16 🔗 icedice Those are higher risk and not as rare
14:18 🔗 icedice The cartoons are mostly torrented, as with the TV series and movies
14:18 🔗 icedice The manga is mostly DDL or collected by myself
14:19 🔗 icedice The server wouldn't be publically accessible though
14:20 🔗 Dallas I can't obviously advocate piracy, but I can say that aside from torrents all those things will likely be fine for you to do with Hetzner icedice :)
14:20 🔗 Dallas I'll take a look mike__
14:21 🔗 icedice By not having the content easily available via legitimate means the copyright holders are encouraging piracy
14:23 🔗 icedice Most people aren't as financially irresponsible as me that they import boxes filled with 15-20 year old manga magazines from Japan
14:24 🔗 icedice Or as lucky that they found a anime dub on YouTube - that had only been broadcasted on TV and never available on DVD/streaming - before it was taken down
14:25 🔗 asie At least in the anime world, the copyright holders often don't know many older works of theirs exist
14:25 🔗 Dallas I don't disagree haha, I'm just saying for the record piracy is illegal etc. etc. but you won't have a prob on hetzner :D
14:25 🔗 icedice Yeah
14:25 🔗 asie as in, they probably have it in the paperwork, they're just not very aware of them
14:26 🔗 icedice I'd probably go with Kimsufi or Online.net just to be sure
14:26 🔗 icedice Abuse reports sent to them about their baguette boxes go to dev/null
14:27 🔗 asie or "you've HEARD of this? nobody in Japan's heard of this!" kind of stuff
14:28 🔗 icedice A manga artist started leaking her own old works on Twitter after she got tired of the publisher almost never publishing it in volumes and just letting it get lost in magazine land
14:28 🔗 asie Ryuutama's story is fascinating, it's a semi-notable Japanese traditional pen & paper RPG game
14:28 🔗 Dallas I think OVH are pretty ignorant of abuse reports but I'm sure I got one from online at some point, better plan, run any ... 'questionable', traffic through a vpn
14:28 🔗 asie the author had a print run with their publisher, but it ran out, and he got lots of requests to restock it
14:28 🔗 asie the publisher, however, refused to do another print run; the publisher also told him he's forbidden from putting it on competing "print-on-demand" services like Amazon's
14:29 🔗 asie the author then re-read his licensing agreement with the publisher, and ran it through a lawyer; apparently it was constructed in such a way that he only gave up the rights to *sell* the book
14:29 🔗 asie but could legally put it up for free... so he did
14:29 🔗 icedice I and another person have imported all of her magazine exclusive works
14:29 🔗 icedice hmm, or almost
14:29 🔗 icedice Still one series that's not completely collected
14:30 🔗 icedice And a bunch of magazine exclusive color pages that the artist herself has lost
14:30 🔗 icedice I have a bunch of those as well
14:30 🔗 asie Another fascinating licensing story I always think of is when Tokyopop (an English manga licensor) C&D'd Kodansha (the Japanese manga publisher)... and won
14:30 🔗 asie (not in court, just, like, had a valid legal point)
14:30 🔗 icedice Another artist wants to put her manga series in ebooks, but the publisher won't let her
14:33 🔗 icedice Some manga I've managed to get have no record of existance on the Internet
14:33 🔗 icedice So it'll be fun to scan those and surprise everyone
14:33 🔗 asie I have a Polish manga/anime fanzine whose sole record of existence on the Internet is a review on an equally obscure Polish website
14:38 🔗 DogsRNice has joined #archiveteam-ot
14:39 🔗 icedice The manga I have was sold in connection with a tournament two decades ago
14:44 🔗 icedice I and a few friends started a project where we imported English 41 manga volumes from a Southeast Asian publisher that went out of business years ago
14:45 🔗 icedice We have a scanner with an automatic document feeder
14:45 🔗 icedice https://www.youtube.com/watch?v=aaBLcgpXHZ4
14:46 🔗 icedice ^ This is an ADF scanner in action
14:46 🔗 icedice Pretty excited about that
14:46 🔗 icedice Had to import them from a secondhand marketplace via a forwarder
15:02 🔗 JAA icedice: Noone in Germany (or anywhere else really) will care about your stored data as long as you don't distribute it. At least I've never heard of anything like that. Also, Borg has encryption anyway.
15:13 🔗 icedice I guess upload filters won't apply to hosting providers and only to platforms
15:29 🔗 joepie91 has joined #archiveteam-ot
15:30 🔗 nataraj has quit IRC (Read error: Operation timed out)
15:46 🔗 bluefoo has quit IRC (Read error: Operation timed out)
16:32 🔗 icedice has quit IRC (Leaving)
17:13 🔗 mike__ has quit IRC (Quit: Page closed)
17:14 🔗 Arkiver2 has joined #archiveteam-ot
17:14 🔗 Fusl sets mode: +o Arkiver2
17:14 🔗 Fusl__ sets mode: +o Arkiver2
17:14 🔗 Fusl_ sets mode: +o Arkiver2
17:18 🔗 Arkiver2 has quit IRC (Connection closed)
17:18 🔗 arkiver has joined #archiveteam-ot
17:18 🔗 Fusl__ sets mode: +o arkiver
17:18 🔗 Fusl sets mode: +o arkiver
17:18 🔗 Fusl_ sets mode: +o arkiver
17:45 🔗 Dragnog94 has quit IRC (The Lounge - https://thelounge.chat)
17:48 🔗 Dragnog94 has joined #archiveteam-ot
17:49 🔗 asdf0101 has quit IRC (Read error: Operation timed out)
17:49 🔗 markedL has quit IRC (Read error: Operation timed out)
18:52 🔗 ola_norsk has joined #archiveteam-ot
18:53 🔗 ola_norsk has quit IRC (leaving)
19:34 🔗 nataraj has joined #archiveteam-ot
19:46 🔗 JAA I just saw for the first time how seesaw executes a pipeline, and I'm disgusted.
19:46 🔗 JAA It reads the file contents and then exec()s the string.
19:47 🔗 Igloo Aer you really?
19:47 🔗 Igloo *are
19:47 🔗 Igloo I mean, There are worse things it could do.
19:47 🔗 Igloo And does :P
19:49 🔗 JAA I can be disgusted at multiple things. :-P
20:13 🔗 antomatic has joined #archiveteam-ot
20:19 🔗 ShellyRol has quit IRC (Read error: Operation timed out)
20:20 🔗 ShellyRol has joined #archiveteam-ot
20:54 🔗 nataraj has quit IRC (Read error: Operation timed out)
21:16 🔗 schbirid my "merge millions (hundreds of thousands rather i think) of files into a deeply nested directory tree" finished in a day using a list from find as input for mv
21:16 🔗 schbirid instead of ETA of 1 week with rsync or mc
21:16 🔗 schbirid \o/
22:34 🔗 BlueMax has joined #archiveteam-ot
23:38 🔗 markedL has joined #archiveteam-ot
23:38 🔗 asdf0101 has joined #archiveteam-ot
23:45 🔗 kiska1 has quit IRC (Remote host closed the connection)
23:45 🔗 kiska1 has joined #archiveteam-ot
23:45 🔗 Fusl__ sets mode: +o kiska1
23:46 🔗 Fusl sets mode: +o kiska1
23:46 🔗 Fusl_ sets mode: +o kiska1

irclogger-viewer