#archiveteam-ot 2018-12-16,Sun

↑back Search

Time Nickname Message
00:05 🔗 Jens "<redacted> Jason Scott looks like a cross between George R. R. Martin and Hugh Hefner."
00:07 🔗 ivan` is now known as ivan_
01:05 🔗 Kaz anyone know of a tool that I can point to a folder and get a list of every video in it, with associated resolution, bitrate etc? Windows pref, but open to most things
01:15 🔗 Kaz mediainfo appears to be the tool I was looking for
01:23 🔗 trvz has quit IRC ()
01:48 🔗 terorie has quit IRC (Remote host closed the connection)
01:48 🔗 terorie has joined #archiveteam-ot
01:51 🔗 terorie has quit IRC (Remote host closed the connection)
01:52 🔗 terorie has joined #archiveteam-ot
01:57 🔗 terorie has quit IRC (Ping timeout: 268 seconds)
02:05 🔗 VerifiedJ has quit IRC (Quit: Leaving)
02:05 🔗 terorie has joined #archiveteam-ot
02:17 🔗 terorie_ has joined #archiveteam-ot
02:21 🔗 terorie has quit IRC (Ping timeout: 268 seconds)
02:30 🔗 terorie_ has quit IRC (Remote host closed the connection)
02:31 🔗 terorie has joined #archiveteam-ot
02:32 🔗 terorie has quit IRC (Read error: Operation timed out)
03:43 🔗 m007a83_ has joined #archiveteam-ot
03:44 🔗 m007a83 has quit IRC (Ping timeout: 252 seconds)
03:46 🔗 m007a83_ is now known as m007a83
03:48 🔗 boutique has quit IRC (Quit: zzzzz)
03:55 🔗 uberushax has quit IRC (Remote host closed the connection)
04:13 🔗 boutique has joined #archiveteam-ot
04:15 🔗 odemg has quit IRC (Ping timeout: 265 seconds)
04:18 🔗 ubahn_ has joined #archiveteam-ot
04:21 🔗 ubahn has quit IRC (Read error: Operation timed out)
04:25 🔗 wp494 has quit IRC (Ping timeout: 268 seconds)
04:26 🔗 wp494 has joined #archiveteam-ot
04:26 🔗 svchfoo3 sets mode: +o wp494
04:27 🔗 odemg has joined #archiveteam-ot
04:36 🔗 wp494 these DHCP disconnects are getting pretty damn annoying
04:37 🔗 wp494 sets mode: +ooo arkiver godane swebb
04:54 🔗 terorie has joined #archiveteam-ot
04:58 🔗 terorie has quit IRC (Read error: Operation timed out)
05:02 🔗 terorie has joined #archiveteam-ot
05:07 🔗 terorie has quit IRC (Ping timeout: 268 seconds)
05:21 🔗 boutique_ has joined #archiveteam-ot
05:24 🔗 boutique has quit IRC (Ping timeout: 252 seconds)
05:26 🔗 boutique has joined #archiveteam-ot
05:28 🔗 boutique has quit IRC (Read error: Connection reset by peer)
05:28 🔗 boutique has joined #archiveteam-ot
05:29 🔗 boutique_ has quit IRC (Ping timeout: 252 seconds)
05:33 🔗 Stiletto has quit IRC (Ping timeout: 265 seconds)
05:41 🔗 boutique_ has joined #archiveteam-ot
05:45 🔗 boutique has quit IRC (Ping timeout: 252 seconds)
05:45 🔗 voltagex_ where is the line between archiving and data hoarding?
05:47 🔗 ivan_ a data hoarder is more of a person who is trying to fill up their too-many-hard drives with whatever they want
05:47 🔗 ivan_ archiving pays some attention to the general value of the content and has some plan for future accessibility
05:47 🔗 boutique has joined #archiveteam-ot
05:48 🔗 ivan_ I guess the line is blurry in many cases
05:49 🔗 ivan_ Brewster is just the best data hoarder :-)
05:49 🔗 boutique_ has quit IRC (Ping timeout: 252 seconds)
06:02 🔗 eientei95 ivan_: Data hoarding is just making the stuff for digital archaeologists to look through :P
06:03 🔗 voltagex_ Well, my current issue is I need to reduce the stuff I have, and I've got ~100GB of a Tomorrowland livestream that probably shouldn't be lost.
06:03 🔗 ivan_ you can put many petabytes into google drive
06:04 🔗 voltagex_ I was hoping FOS could take it :P
06:06 🔗 ivan_ you can also upload things directly to IA
06:07 🔗 ivan_ https://archive.org/help/abouts3.txt
06:07 🔗 voltagex_ legal grey area I guess
06:07 🔗 voltagex_ not quite as bad as Nintendo but ID&T are a weird company.
06:07 🔗 JAA Email Jason then, I guess.
06:08 🔗 voltagex_ I've got to work out whether this video file is valid :/
06:08 🔗 voltagex_ plays in VLC != accessible in the future
06:08 🔗 voltagex_ MPEG4-TS is an abomination.
06:08 🔗 JAA Eww, yeah.
06:12 🔗 voltagex_ hm, Xbox One plays it, and it's a strangely compliant player.
06:13 🔗 boutique_ has joined #archiveteam-ot
06:13 🔗 JAA There must be some tool which strictly checks whether a video file complies with the specifications, right?
06:16 🔗 boutique has quit IRC (Ping timeout: 252 seconds)
06:16 🔗 voltagex_ possibly.
06:17 🔗 voltagex_ JAA: sigh. https://forum.doom9.org/showthread.php?s=028d37878e073193b81c74c58b06e01d&p=1067204#post1067204
06:18 🔗 JAA I'm not surprised.
06:18 🔗 JAA Also, that thread is from 2007.
06:20 🔗 boutique has joined #archiveteam-ot
06:20 🔗 boutique_ has quit IRC (Ping timeout: 252 seconds)
06:21 🔗 JAA Found a commercial tool: http://www.jongbel.com/automated-validation/media-validator/
06:23 🔗 voltagex_ 149 EUR per month lol
06:27 🔗 voltagex_ props to them for writing their own decoders instead of just using ffmpeg though
06:30 🔗 JAA has quit IRC (leaving)
06:34 🔗 JAA has joined #archiveteam-ot
06:34 🔗 svchfoo3 sets mode: +o JAA
06:35 🔗 bakJAA sets mode: +o JAA
06:40 🔗 JAA voltagex_: So Stack Overflow recommends transcoding it to nothing with ffmpeg. I guess that works and ffmpeg should produce warnings and errors, but I'm not sure how strict it is.
06:41 🔗 voltagex_ JAA: sorry, I didn't mean to take up your time on one of my rabbit holes
06:41 🔗 voltagex_ we're all going to be underwater / on fire or both in the future, so it may not matter.
06:47 🔗 DarkWorld has joined #archiveteam-ot
07:16 🔗 terorie has joined #archiveteam-ot
07:22 🔗 terorie has quit IRC (Ping timeout: 268 seconds)
07:27 🔗 terorie has joined #archiveteam-ot
08:29 🔗 m007a83_ has joined #archiveteam-ot
08:30 🔗 m007a83 has quit IRC (Ping timeout: 252 seconds)
08:34 🔗 m007a83_ is now known as m007a83
10:17 🔗 hook54321 has quit IRC (Quit: Connection closed for inactivity)
10:37 🔗 terorie has quit IRC (Remote host closed the connection)
10:37 🔗 terorie has joined #archiveteam-ot
10:38 🔗 terorie has quit IRC (Client Quit)
10:59 🔗 Stiletto has joined #archiveteam-ot
11:08 🔗 DarkWorld has quit IRC (Leaving)
11:20 🔗 BlueMax has quit IRC (Quit: Leaving)
11:20 🔗 caff_ has quit IRC (Read error: Connection reset by peer)
12:01 🔗 boutique has quit IRC (Quit: Leaving)
12:07 🔗 vitzli has joined #archiveteam-ot
12:15 🔗 VoynichCr JAA: https://github.com/emijrp/internet-archive/blob/master/archivebot.py
12:16 🔗 VoynichCr that is the bot which updates tables in wiki
12:16 🔗 VoynichCr it requires pywikibot (and configured)
12:18 🔗 VoynichCr i can write detailed instructions if needed
12:20 🔗 VoynichCr the scripts for the deaths and disestablishements pages are in the same repo
12:43 🔗 ivan_ do people use pywb for looking inside WARCs or something else?
12:43 🔗 * ivan_ spots https://github.com/webrecorder/webrecorder-player
12:49 🔗 hook54321 has joined #archiveteam-ot
12:49 🔗 svchfoo3 sets mode: +o hook54321
12:52 🔗 HCross ivan_: warcio
12:52 🔗 HCross Because it doesn't need to load the entire warc into disk
12:52 🔗 HCross Which makes working with megawarcs so much nicer
12:53 🔗 ivan_ ah but this person wanted a thing to play them back / browse them
12:53 🔗 ivan_ looks like pywb uses it
12:56 🔗 Mateon1 has quit IRC (Read error: Operation timed out)
12:56 🔗 Mateon1 has joined #archiveteam-ot
13:00 🔗 vitzli has quit IRC (Quit: Leaving)
13:14 🔗 JAA VoynichCr: Sweet, thanks, I'll have a look. I did look at pywikibot, but mwclient just seemed much more straightforward and Pythonic. My code is here if you're interested: https://github.com/JustAnotherArchivist/atwikibot/blob/master/currentwarriorproject.py
13:18 🔗 JAA ivan_: I use pywb for WARC playback when I need it. Apart from the fact that it copies around the WARCs and doesn't easily let you avoid that (but anarcat is working on that at https://github.com/webrecorder/pywb/pull/409 ), it's pretty good. Often enough, I just look at the raw file with zless though.
13:19 🔗 ivan_ thanks
13:26 🔗 wp494 has quit IRC (Ping timeout: 268 seconds)
13:26 🔗 wp494 has joined #archiveteam-ot
13:26 🔗 svchfoo3 sets mode: +o wp494
13:31 🔗 Soni has joined #archiveteam-ot
13:33 🔗 Soni hi
13:36 🔗 jesso has joined #archiveteam-ot
14:01 🔗 eientei95 [02:30:22] <Soni> we have phones now
14:01 🔗 eientei95 [02:30:26] <Soni> they get thrown out every 3 months
14:01 🔗 eientei95 https://www.youtube.com/watch?v=lW17rr20tGY
14:04 🔗 anarcat JAA: i'm working on that? for the record i've been waiting for them to figure out if it's okay or not at this step, did i miss something?
14:05 🔗 anarcat python-internetarchive just entered debian stable https://tracker.debian.org/pkg/python-internetarchive
14:30 🔗 JAA anarcat: Yeah, "working on it" in a broader sense.
14:31 🔗 JAA And great news regarding python-internetarchive! Thanks for that!
14:31 🔗 JAA s/stable/unstable/ though :-)
14:39 🔗 VerifiedJ has joined #archiveteam-ot
14:43 🔗 JAA "Alex jones infowars - Do you have this?"
14:43 🔗 JAA This is what you get via PM when you post in a popular thread on /r/DataHoarder. :-|
15:06 🔗 eientei95 I prefer David Dees for my conspiracy nutjobs thanks
15:24 🔗 t2t2 has quit IRC (Quit: t2t2)
15:30 🔗 voltagex_ Hi anarcat - I recognise that handle
15:34 🔗 t2t2 has joined #archiveteam-ot
16:33 🔗 vitzli has joined #archiveteam-ot
16:38 🔗 vitzli has quit IRC (Quit: Leaving)
17:13 🔗 Kolam has joined #archiveteam-ot
17:29 🔗 Verified_ has joined #archiveteam-ot
17:31 🔗 bithippo has joined #archiveteam-ot
17:32 🔗 VerifiedJ has quit IRC (Ping timeout: 252 seconds)
17:39 🔗 chferfa has joined #archiveteam-ot
17:56 🔗 Kolam has quit IRC (Quit: http://www.mibbit.com ajax IRC Client)
18:24 🔗 schbirid what was that esp like board that can be powered by ambient wifi again?
18:25 🔗 Soni are you gonna try to run an warrior on an ultra-low-power device that's powered by ambient wifi?!
18:25 🔗 adinbied has quit IRC (Read error: Operation timed out)
18:26 🔗 schbirid lol no that would not work
18:26 🔗 Soni would be cool if it did
18:26 🔗 Soni I mean, just program 100s of them and put them on all sorts of places with free wifi
18:28 🔗 adinbied has joined #archiveteam-ot
18:29 🔗 schbirid that would be a great way to get people against the warrior and internet archival projects
18:30 🔗 schbirid so please dont ever abuse services like that
18:30 🔗 schbirid !
18:30 🔗 schbirid (yes i get the idea and i like it but the consequences would be bad)
18:33 🔗 kiska Free wifi = bad
18:34 🔗 kiska Captive pages = bad
18:37 🔗 Soni okay
18:37 🔗 Soni most of the world runs on HTTPS these days, so it should be fine
18:38 🔗 kiska You do know what a captive page is right?
18:43 🔗 Soni yeah
18:43 🔗 Soni it hijacks HTTP connections
18:43 🔗 Soni which are not HTTPS connections
18:44 🔗 kiska Captive portals don't care if you have a https connection or not, captive pages force their way to your screen
18:44 🔗 schbirid if you try to access a https site, a captive portal can only make the connection fail
18:44 🔗 schbirid afaik
18:46 🔗 kiska So instead of helping us, it will only be polluting the eventual warcs
18:46 🔗 bithippo This ^^^
18:46 🔗 JAA Note that we often have certificate validation turned off because target sites may have expired certs etc.
18:47 🔗 JAA In that case, the captive portal would happily hijack any HTTPS connection.
18:47 🔗 bithippo soni: ArchiveTeam operations rely on clean connectivity. The cost of traditional compute and network is cheap compared to possible ingesting garbage because of non-quality connectivity.
18:49 🔗 bithippo In an ideal world, we'd archive from within web property infra or at their network edge.
18:49 🔗 Soni okay
19:05 🔗 Soni so uh, have y'all tried BGP hijacking?
19:08 🔗 schbirid uh http://petecogle.co.uk/blog/2018/12/14/free-music-archives-new-home-kitsplit/
19:09 🔗 schbirid sorry, direct link http://freemusicarchive.org/member/cheyenne_h/blog/Free_Music_Archives_new_home_KitSplit
19:10 🔗 Soni (like, when you need lots of IPs, just make them with BGP?)
19:10 🔗 kiska JAA Kaz HCross hook54321: pls kick Soni
19:11 🔗 schbirid script kiddies go to #kindergarten please
19:11 🔗 Kaz sigh
19:11 🔗 Soni ?
19:11 🔗 Soni why?
19:11 🔗 schbirid archiveteam is not doing illegal shit
19:12 🔗 Soni this is illegal?
19:12 🔗 kiska yes
19:12 🔗 Soni really?
19:12 🔗 Kaz Soni: I'm not sure if you're stupid or just a troll, but this ends now
19:14 🔗 Soni :/
19:16 🔗 miked has joined #archiveteam-ot
19:16 🔗 Kaz was kicked by hook54321 (Kaz)
19:16 🔗 hook54321 sets mode: +b *!*@autism.nbextension.download
19:17 🔗 Kaz has joined #archiveteam-ot
19:17 🔗 Kaz i mean.. close
19:17 🔗 hook54321 sets mode: +b soni!*@*
19:17 🔗 hook54321 sets mode: +o kiska
19:17 🔗 hook54321 sets mode: +o Kaz
19:17 🔗 Soni was kicked by Kaz (Soni)
19:17 🔗 kiska thanks
19:17 🔗 schbirid lol
19:17 🔗 Kaz sets mode: +b #archivet!*@*
19:17 🔗 Kaz uh
19:18 🔗 Kaz sets mode: -b #archivet!*@*
19:18 🔗 schbirid our ops are competent <3
19:18 🔗 schbirid :)
19:18 🔗 bithippo THANK YOU
19:19 🔗 kiska I've had my dose of stupid today
19:20 🔗 hook54321 We might want to try to check if he's been running the warrior, if possible
19:23 🔗 MrRadar2 has quit IRC (Quit: Rebooting)
19:25 🔗 MrRadar2 has joined #archiveteam-ot
19:32 🔗 t3 has quit IRC ()
19:36 🔗 teej_ has joined #archiveteam-ot
20:45 🔗 BlueMax has joined #archiveteam-ot
21:31 🔗 mgrytbak^ is now known as mgrytbak
22:20 🔗 BlueMax has quit IRC (Read error: Connection reset by peer)
22:22 🔗 BlueMax has joined #archiveteam-ot
22:25 🔗 wp494 has quit IRC (Ping timeout: 255 seconds)
22:25 🔗 wp494 has joined #archiveteam-ot
22:26 🔗 svchfoo3 sets mode: +o wp494
22:37 🔗 ubahn_ has quit IRC (Quit: ubahn_)
23:41 🔗 Cypher has joined #archiveteam-ot

irclogger-viewer