[00:05] " Jason Scott looks like a cross between George R. R. Martin and Hugh Hefner." [00:07] *** ivan` is now known as ivan_ [01:05] anyone know of a tool that I can point to a folder and get a list of every video in it, with associated resolution, bitrate etc? Windows pref, but open to most things [01:15] mediainfo appears to be the tool I was looking for [01:23] *** trvz has quit IRC () [01:48] *** terorie has quit IRC (Remote host closed the connection) [01:48] *** terorie has joined #archiveteam-ot [01:51] *** terorie has quit IRC (Remote host closed the connection) [01:52] *** terorie has joined #archiveteam-ot [01:57] *** terorie has quit IRC (Ping timeout: 268 seconds) [02:05] *** VerifiedJ has quit IRC (Quit: Leaving) [02:05] *** terorie has joined #archiveteam-ot [02:17] *** terorie_ has joined #archiveteam-ot [02:21] *** terorie has quit IRC (Ping timeout: 268 seconds) [02:30] *** terorie_ has quit IRC (Remote host closed the connection) [02:31] *** terorie has joined #archiveteam-ot [02:32] *** terorie has quit IRC (Read error: Operation timed out) [03:43] *** m007a83_ has joined #archiveteam-ot [03:44] *** m007a83 has quit IRC (Ping timeout: 252 seconds) [03:46] *** m007a83_ is now known as m007a83 [03:48] *** boutique has quit IRC (Quit: zzzzz) [03:55] *** uberushax has quit IRC (Remote host closed the connection) [04:13] *** boutique has joined #archiveteam-ot [04:15] *** odemg has quit IRC (Ping timeout: 265 seconds) [04:18] *** ubahn_ has joined #archiveteam-ot [04:21] *** ubahn has quit IRC (Read error: Operation timed out) [04:25] *** wp494 has quit IRC (Ping timeout: 268 seconds) [04:26] *** wp494 has joined #archiveteam-ot [04:26] *** svchfoo3 sets mode: +o wp494 [04:27] *** odemg has joined #archiveteam-ot [04:36] these DHCP disconnects are getting pretty damn annoying [04:37] *** wp494 sets mode: +ooo arkiver godane swebb [04:54] *** terorie has joined #archiveteam-ot [04:58] *** terorie has quit IRC (Read error: Operation timed out) [05:02] *** terorie has joined #archiveteam-ot [05:07] *** terorie has quit IRC (Ping timeout: 268 seconds) [05:21] *** boutique_ has joined #archiveteam-ot [05:24] *** boutique has quit IRC (Ping timeout: 252 seconds) [05:26] *** boutique has joined #archiveteam-ot [05:28] *** boutique has quit IRC (Read error: Connection reset by peer) [05:28] *** boutique has joined #archiveteam-ot [05:29] *** boutique_ has quit IRC (Ping timeout: 252 seconds) [05:33] *** Stiletto has quit IRC (Ping timeout: 265 seconds) [05:41] *** boutique_ has joined #archiveteam-ot [05:45] *** boutique has quit IRC (Ping timeout: 252 seconds) [05:45] where is the line between archiving and data hoarding? [05:47] a data hoarder is more of a person who is trying to fill up their too-many-hard drives with whatever they want [05:47] archiving pays some attention to the general value of the content and has some plan for future accessibility [05:47] *** boutique has joined #archiveteam-ot [05:48] I guess the line is blurry in many cases [05:49] Brewster is just the best data hoarder :-) [05:49] *** boutique_ has quit IRC (Ping timeout: 252 seconds) [06:02] ivan_: Data hoarding is just making the stuff for digital archaeologists to look through :P [06:03] Well, my current issue is I need to reduce the stuff I have, and I've got ~100GB of a Tomorrowland livestream that probably shouldn't be lost. [06:03] you can put many petabytes into google drive [06:04] I was hoping FOS could take it :P [06:06] you can also upload things directly to IA [06:07] https://archive.org/help/abouts3.txt [06:07] legal grey area I guess [06:07] not quite as bad as Nintendo but ID&T are a weird company. [06:07] Email Jason then, I guess. [06:08] I've got to work out whether this video file is valid :/ [06:08] plays in VLC != accessible in the future [06:08] MPEG4-TS is an abomination. [06:08] Eww, yeah. [06:12] hm, Xbox One plays it, and it's a strangely compliant player. [06:13] *** boutique_ has joined #archiveteam-ot [06:13] There must be some tool which strictly checks whether a video file complies with the specifications, right? [06:16] *** boutique has quit IRC (Ping timeout: 252 seconds) [06:16] possibly. [06:17] JAA: sigh. https://forum.doom9.org/showthread.php?s=028d37878e073193b81c74c58b06e01d&p=1067204#post1067204 [06:18] I'm not surprised. [06:18] Also, that thread is from 2007. [06:20] *** boutique has joined #archiveteam-ot [06:20] *** boutique_ has quit IRC (Ping timeout: 252 seconds) [06:21] Found a commercial tool: http://www.jongbel.com/automated-validation/media-validator/ [06:23] 149 EUR per month lol [06:27] props to them for writing their own decoders instead of just using ffmpeg though [06:30] *** JAA has quit IRC (leaving) [06:34] *** JAA has joined #archiveteam-ot [06:34] *** svchfoo3 sets mode: +o JAA [06:35] *** bakJAA sets mode: +o JAA [06:40] voltagex_: So Stack Overflow recommends transcoding it to nothing with ffmpeg. I guess that works and ffmpeg should produce warnings and errors, but I'm not sure how strict it is. [06:41] JAA: sorry, I didn't mean to take up your time on one of my rabbit holes [06:41] we're all going to be underwater / on fire or both in the future, so it may not matter. [06:47] *** DarkWorld has joined #archiveteam-ot [07:16] *** terorie has joined #archiveteam-ot [07:22] *** terorie has quit IRC (Ping timeout: 268 seconds) [07:27] *** terorie has joined #archiveteam-ot [08:29] *** m007a83_ has joined #archiveteam-ot [08:30] *** m007a83 has quit IRC (Ping timeout: 252 seconds) [08:34] *** m007a83_ is now known as m007a83 [10:17] *** hook54321 has quit IRC (Quit: Connection closed for inactivity) [10:37] *** terorie has quit IRC (Remote host closed the connection) [10:37] *** terorie has joined #archiveteam-ot [10:38] *** terorie has quit IRC (Client Quit) [10:59] *** Stiletto has joined #archiveteam-ot [11:08] *** DarkWorld has quit IRC (Leaving) [11:20] *** BlueMax has quit IRC (Quit: Leaving) [11:20] *** caff_ has quit IRC (Read error: Connection reset by peer) [12:01] *** boutique has quit IRC (Quit: Leaving) [12:07] *** vitzli has joined #archiveteam-ot [12:15] JAA: https://github.com/emijrp/internet-archive/blob/master/archivebot.py [12:16] that is the bot which updates tables in wiki [12:16] it requires pywikibot (and configured) [12:18] i can write detailed instructions if needed [12:20] the scripts for the deaths and disestablishements pages are in the same repo [12:43] do people use pywb for looking inside WARCs or something else? [12:43] * ivan_ spots https://github.com/webrecorder/webrecorder-player [12:49] *** hook54321 has joined #archiveteam-ot [12:49] *** svchfoo3 sets mode: +o hook54321 [12:52] ivan_: warcio [12:52] Because it doesn't need to load the entire warc into disk [12:52] Which makes working with megawarcs so much nicer [12:53] ah but this person wanted a thing to play them back / browse them [12:53] looks like pywb uses it [12:56] *** Mateon1 has quit IRC (Read error: Operation timed out) [12:56] *** Mateon1 has joined #archiveteam-ot [13:00] *** vitzli has quit IRC (Quit: Leaving) [13:14] VoynichCr: Sweet, thanks, I'll have a look. I did look at pywikibot, but mwclient just seemed much more straightforward and Pythonic. My code is here if you're interested: https://github.com/JustAnotherArchivist/atwikibot/blob/master/currentwarriorproject.py [13:18] ivan_: I use pywb for WARC playback when I need it. Apart from the fact that it copies around the WARCs and doesn't easily let you avoid that (but anarcat is working on that at https://github.com/webrecorder/pywb/pull/409 ), it's pretty good. Often enough, I just look at the raw file with zless though. [13:19] thanks [13:26] *** wp494 has quit IRC (Ping timeout: 268 seconds) [13:26] *** wp494 has joined #archiveteam-ot [13:26] *** svchfoo3 sets mode: +o wp494 [13:31] *** Soni has joined #archiveteam-ot [13:33] hi [13:36] *** jesso has joined #archiveteam-ot [14:01] [02:30:22] we have phones now [14:01] [02:30:26] they get thrown out every 3 months [14:01] https://www.youtube.com/watch?v=lW17rr20tGY [14:04] JAA: i'm working on that? for the record i've been waiting for them to figure out if it's okay or not at this step, did i miss something? [14:05] python-internetarchive just entered debian stable https://tracker.debian.org/pkg/python-internetarchive [14:30] anarcat: Yeah, "working on it" in a broader sense. [14:31] And great news regarding python-internetarchive! Thanks for that! [14:31] s/stable/unstable/ though :-) [14:39] *** VerifiedJ has joined #archiveteam-ot [14:43] "Alex jones infowars - Do you have this?" [14:43] This is what you get via PM when you post in a popular thread on /r/DataHoarder. :-| [15:06] I prefer David Dees for my conspiracy nutjobs thanks [15:24] *** t2t2 has quit IRC (Quit: t2t2) [15:30] Hi anarcat - I recognise that handle [15:34] *** t2t2 has joined #archiveteam-ot [16:33] *** vitzli has joined #archiveteam-ot [16:38] *** vitzli has quit IRC (Quit: Leaving) [17:13] *** Kolam has joined #archiveteam-ot [17:29] *** Verified_ has joined #archiveteam-ot [17:31] *** bithippo has joined #archiveteam-ot [17:32] *** VerifiedJ has quit IRC (Ping timeout: 252 seconds) [17:39] *** chferfa has joined #archiveteam-ot [17:56] *** Kolam has quit IRC (Quit: http://www.mibbit.com ajax IRC Client) [18:24] what was that esp like board that can be powered by ambient wifi again? [18:25] are you gonna try to run an warrior on an ultra-low-power device that's powered by ambient wifi?! [18:25] *** adinbied has quit IRC (Read error: Operation timed out) [18:26] lol no that would not work [18:26] would be cool if it did [18:26] I mean, just program 100s of them and put them on all sorts of places with free wifi [18:28] *** adinbied has joined #archiveteam-ot [18:29] that would be a great way to get people against the warrior and internet archival projects [18:30] so please dont ever abuse services like that [18:30] ! [18:30] (yes i get the idea and i like it but the consequences would be bad) [18:33] Free wifi = bad [18:34] Captive pages = bad [18:37] okay [18:37] most of the world runs on HTTPS these days, so it should be fine [18:38] You do know what a captive page is right? [18:43] yeah [18:43] it hijacks HTTP connections [18:43] which are not HTTPS connections [18:44] Captive portals don't care if you have a https connection or not, captive pages force their way to your screen [18:44] if you try to access a https site, a captive portal can only make the connection fail [18:44] afaik [18:46] So instead of helping us, it will only be polluting the eventual warcs [18:46] This ^^^ [18:46] Note that we often have certificate validation turned off because target sites may have expired certs etc. [18:47] In that case, the captive portal would happily hijack any HTTPS connection. [18:47] soni: ArchiveTeam operations rely on clean connectivity. The cost of traditional compute and network is cheap compared to possible ingesting garbage because of non-quality connectivity. [18:49] In an ideal world, we'd archive from within web property infra or at their network edge. [18:49] okay [19:05] so uh, have y'all tried BGP hijacking? [19:08] uh http://petecogle.co.uk/blog/2018/12/14/free-music-archives-new-home-kitsplit/ [19:09] sorry, direct link http://freemusicarchive.org/member/cheyenne_h/blog/Free_Music_Archives_new_home_KitSplit [19:10] (like, when you need lots of IPs, just make them with BGP?) [19:10] JAA Kaz HCross hook54321: pls kick Soni [19:11] script kiddies go to #kindergarten please [19:11] sigh [19:11] ? [19:11] why? [19:11] archiveteam is not doing illegal shit [19:12] this is illegal? [19:12] yes [19:12] really? [19:12] Soni: I'm not sure if you're stupid or just a troll, but this ends now [19:14] :/ [19:16] *** miked has joined #archiveteam-ot [19:16] *** Kaz was kicked by hook54321 (Kaz) [19:16] *** hook54321 sets mode: +b *!*@autism.nbextension.download [19:17] *** Kaz has joined #archiveteam-ot [19:17] i mean.. close [19:17] *** hook54321 sets mode: +b soni!*@* [19:17] *** hook54321 sets mode: +o kiska [19:17] *** hook54321 sets mode: +o Kaz [19:17] *** Soni was kicked by Kaz (Soni) [19:17] thanks [19:17] lol [19:17] *** Kaz sets mode: +b #archivet!*@* [19:17] uh [19:18] *** Kaz sets mode: -b #archivet!*@* [19:18] our ops are competent <3 [19:18] :) [19:18] THANK YOU [19:19] I've had my dose of stupid today [19:20] We might want to try to check if he's been running the warrior, if possible [19:23] *** MrRadar2 has quit IRC (Quit: Rebooting) [19:25] *** MrRadar2 has joined #archiveteam-ot [19:32] *** t3 has quit IRC () [19:36] *** teej_ has joined #archiveteam-ot [20:45] *** BlueMax has joined #archiveteam-ot [21:31] *** mgrytbak^ is now known as mgrytbak [22:20] *** BlueMax has quit IRC (Read error: Connection reset by peer) [22:22] *** BlueMax has joined #archiveteam-ot [22:25] *** wp494 has quit IRC (Ping timeout: 255 seconds) [22:25] *** wp494 has joined #archiveteam-ot [22:26] *** svchfoo3 sets mode: +o wp494 [22:37] *** ubahn_ has quit IRC (Quit: ubahn_) [23:41] *** Cypher has joined #archiveteam-ot