Time |
Nickname |
Message |
00:05
🔗
|
Jens |
"<redacted> Jason Scott looks like a cross between George R. R. Martin and Hugh Hefner." |
00:07
🔗
|
|
ivan` is now known as ivan_ |
01:05
🔗
|
Kaz |
anyone know of a tool that I can point to a folder and get a list of every video in it, with associated resolution, bitrate etc? Windows pref, but open to most things |
01:15
🔗
|
Kaz |
mediainfo appears to be the tool I was looking for |
01:23
🔗
|
|
trvz has quit IRC () |
01:48
🔗
|
|
terorie has quit IRC (Remote host closed the connection) |
01:48
🔗
|
|
terorie has joined #archiveteam-ot |
01:51
🔗
|
|
terorie has quit IRC (Remote host closed the connection) |
01:52
🔗
|
|
terorie has joined #archiveteam-ot |
01:57
🔗
|
|
terorie has quit IRC (Ping timeout: 268 seconds) |
02:05
🔗
|
|
VerifiedJ has quit IRC (Quit: Leaving) |
02:05
🔗
|
|
terorie has joined #archiveteam-ot |
02:17
🔗
|
|
terorie_ has joined #archiveteam-ot |
02:21
🔗
|
|
terorie has quit IRC (Ping timeout: 268 seconds) |
02:30
🔗
|
|
terorie_ has quit IRC (Remote host closed the connection) |
02:31
🔗
|
|
terorie has joined #archiveteam-ot |
02:32
🔗
|
|
terorie has quit IRC (Read error: Operation timed out) |
03:43
🔗
|
|
m007a83_ has joined #archiveteam-ot |
03:44
🔗
|
|
m007a83 has quit IRC (Ping timeout: 252 seconds) |
03:46
🔗
|
|
m007a83_ is now known as m007a83 |
03:48
🔗
|
|
boutique has quit IRC (Quit: zzzzz) |
03:55
🔗
|
|
uberushax has quit IRC (Remote host closed the connection) |
04:13
🔗
|
|
boutique has joined #archiveteam-ot |
04:15
🔗
|
|
odemg has quit IRC (Ping timeout: 265 seconds) |
04:18
🔗
|
|
ubahn_ has joined #archiveteam-ot |
04:21
🔗
|
|
ubahn has quit IRC (Read error: Operation timed out) |
04:25
🔗
|
|
wp494 has quit IRC (Ping timeout: 268 seconds) |
04:26
🔗
|
|
wp494 has joined #archiveteam-ot |
04:26
🔗
|
|
svchfoo3 sets mode: +o wp494 |
04:27
🔗
|
|
odemg has joined #archiveteam-ot |
04:36
🔗
|
wp494 |
these DHCP disconnects are getting pretty damn annoying |
04:37
🔗
|
|
wp494 sets mode: +ooo arkiver godane swebb |
04:54
🔗
|
|
terorie has joined #archiveteam-ot |
04:58
🔗
|
|
terorie has quit IRC (Read error: Operation timed out) |
05:02
🔗
|
|
terorie has joined #archiveteam-ot |
05:07
🔗
|
|
terorie has quit IRC (Ping timeout: 268 seconds) |
05:21
🔗
|
|
boutique_ has joined #archiveteam-ot |
05:24
🔗
|
|
boutique has quit IRC (Ping timeout: 252 seconds) |
05:26
🔗
|
|
boutique has joined #archiveteam-ot |
05:28
🔗
|
|
boutique has quit IRC (Read error: Connection reset by peer) |
05:28
🔗
|
|
boutique has joined #archiveteam-ot |
05:29
🔗
|
|
boutique_ has quit IRC (Ping timeout: 252 seconds) |
05:33
🔗
|
|
Stiletto has quit IRC (Ping timeout: 265 seconds) |
05:41
🔗
|
|
boutique_ has joined #archiveteam-ot |
05:45
🔗
|
|
boutique has quit IRC (Ping timeout: 252 seconds) |
05:45
🔗
|
voltagex_ |
where is the line between archiving and data hoarding? |
05:47
🔗
|
ivan_ |
a data hoarder is more of a person who is trying to fill up their too-many-hard drives with whatever they want |
05:47
🔗
|
ivan_ |
archiving pays some attention to the general value of the content and has some plan for future accessibility |
05:47
🔗
|
|
boutique has joined #archiveteam-ot |
05:48
🔗
|
ivan_ |
I guess the line is blurry in many cases |
05:49
🔗
|
ivan_ |
Brewster is just the best data hoarder :-) |
05:49
🔗
|
|
boutique_ has quit IRC (Ping timeout: 252 seconds) |
06:02
🔗
|
eientei95 |
ivan_: Data hoarding is just making the stuff for digital archaeologists to look through :P |
06:03
🔗
|
voltagex_ |
Well, my current issue is I need to reduce the stuff I have, and I've got ~100GB of a Tomorrowland livestream that probably shouldn't be lost. |
06:03
🔗
|
ivan_ |
you can put many petabytes into google drive |
06:04
🔗
|
voltagex_ |
I was hoping FOS could take it :P |
06:06
🔗
|
ivan_ |
you can also upload things directly to IA |
06:07
🔗
|
ivan_ |
https://archive.org/help/abouts3.txt |
06:07
🔗
|
voltagex_ |
legal grey area I guess |
06:07
🔗
|
voltagex_ |
not quite as bad as Nintendo but ID&T are a weird company. |
06:07
🔗
|
JAA |
Email Jason then, I guess. |
06:08
🔗
|
voltagex_ |
I've got to work out whether this video file is valid :/ |
06:08
🔗
|
voltagex_ |
plays in VLC != accessible in the future |
06:08
🔗
|
voltagex_ |
MPEG4-TS is an abomination. |
06:08
🔗
|
JAA |
Eww, yeah. |
06:12
🔗
|
voltagex_ |
hm, Xbox One plays it, and it's a strangely compliant player. |
06:13
🔗
|
|
boutique_ has joined #archiveteam-ot |
06:13
🔗
|
JAA |
There must be some tool which strictly checks whether a video file complies with the specifications, right? |
06:16
🔗
|
|
boutique has quit IRC (Ping timeout: 252 seconds) |
06:16
🔗
|
voltagex_ |
possibly. |
06:17
🔗
|
voltagex_ |
JAA: sigh. https://forum.doom9.org/showthread.php?s=028d37878e073193b81c74c58b06e01d&p=1067204#post1067204 |
06:18
🔗
|
JAA |
I'm not surprised. |
06:18
🔗
|
JAA |
Also, that thread is from 2007. |
06:20
🔗
|
|
boutique has joined #archiveteam-ot |
06:20
🔗
|
|
boutique_ has quit IRC (Ping timeout: 252 seconds) |
06:21
🔗
|
JAA |
Found a commercial tool: http://www.jongbel.com/automated-validation/media-validator/ |
06:23
🔗
|
voltagex_ |
149 EUR per month lol |
06:27
🔗
|
voltagex_ |
props to them for writing their own decoders instead of just using ffmpeg though |
06:30
🔗
|
|
JAA has quit IRC (leaving) |
06:34
🔗
|
|
JAA has joined #archiveteam-ot |
06:34
🔗
|
|
svchfoo3 sets mode: +o JAA |
06:35
🔗
|
|
bakJAA sets mode: +o JAA |
06:40
🔗
|
JAA |
voltagex_: So Stack Overflow recommends transcoding it to nothing with ffmpeg. I guess that works and ffmpeg should produce warnings and errors, but I'm not sure how strict it is. |
06:41
🔗
|
voltagex_ |
JAA: sorry, I didn't mean to take up your time on one of my rabbit holes |
06:41
🔗
|
voltagex_ |
we're all going to be underwater / on fire or both in the future, so it may not matter. |
06:47
🔗
|
|
DarkWorld has joined #archiveteam-ot |
07:16
🔗
|
|
terorie has joined #archiveteam-ot |
07:22
🔗
|
|
terorie has quit IRC (Ping timeout: 268 seconds) |
07:27
🔗
|
|
terorie has joined #archiveteam-ot |
08:29
🔗
|
|
m007a83_ has joined #archiveteam-ot |
08:30
🔗
|
|
m007a83 has quit IRC (Ping timeout: 252 seconds) |
08:34
🔗
|
|
m007a83_ is now known as m007a83 |
10:17
🔗
|
|
hook54321 has quit IRC (Quit: Connection closed for inactivity) |
10:37
🔗
|
|
terorie has quit IRC (Remote host closed the connection) |
10:37
🔗
|
|
terorie has joined #archiveteam-ot |
10:38
🔗
|
|
terorie has quit IRC (Client Quit) |
10:59
🔗
|
|
Stiletto has joined #archiveteam-ot |
11:08
🔗
|
|
DarkWorld has quit IRC (Leaving) |
11:20
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
11:20
🔗
|
|
caff_ has quit IRC (Read error: Connection reset by peer) |
12:01
🔗
|
|
boutique has quit IRC (Quit: Leaving) |
12:07
🔗
|
|
vitzli has joined #archiveteam-ot |
12:15
🔗
|
VoynichCr |
JAA: https://github.com/emijrp/internet-archive/blob/master/archivebot.py |
12:16
🔗
|
VoynichCr |
that is the bot which updates tables in wiki |
12:16
🔗
|
VoynichCr |
it requires pywikibot (and configured) |
12:18
🔗
|
VoynichCr |
i can write detailed instructions if needed |
12:20
🔗
|
VoynichCr |
the scripts for the deaths and disestablishements pages are in the same repo |
12:43
🔗
|
ivan_ |
do people use pywb for looking inside WARCs or something else? |
12:43
🔗
|
* |
ivan_ spots https://github.com/webrecorder/webrecorder-player |
12:49
🔗
|
|
hook54321 has joined #archiveteam-ot |
12:49
🔗
|
|
svchfoo3 sets mode: +o hook54321 |
12:52
🔗
|
HCross |
ivan_: warcio |
12:52
🔗
|
HCross |
Because it doesn't need to load the entire warc into disk |
12:52
🔗
|
HCross |
Which makes working with megawarcs so much nicer |
12:53
🔗
|
ivan_ |
ah but this person wanted a thing to play them back / browse them |
12:53
🔗
|
ivan_ |
looks like pywb uses it |
12:56
🔗
|
|
Mateon1 has quit IRC (Read error: Operation timed out) |
12:56
🔗
|
|
Mateon1 has joined #archiveteam-ot |
13:00
🔗
|
|
vitzli has quit IRC (Quit: Leaving) |
13:14
🔗
|
JAA |
VoynichCr: Sweet, thanks, I'll have a look. I did look at pywikibot, but mwclient just seemed much more straightforward and Pythonic. My code is here if you're interested: https://github.com/JustAnotherArchivist/atwikibot/blob/master/currentwarriorproject.py |
13:18
🔗
|
JAA |
ivan_: I use pywb for WARC playback when I need it. Apart from the fact that it copies around the WARCs and doesn't easily let you avoid that (but anarcat is working on that at https://github.com/webrecorder/pywb/pull/409 ), it's pretty good. Often enough, I just look at the raw file with zless though. |
13:19
🔗
|
ivan_ |
thanks |
13:26
🔗
|
|
wp494 has quit IRC (Ping timeout: 268 seconds) |
13:26
🔗
|
|
wp494 has joined #archiveteam-ot |
13:26
🔗
|
|
svchfoo3 sets mode: +o wp494 |
13:31
🔗
|
|
Soni has joined #archiveteam-ot |
13:33
🔗
|
Soni |
hi |
13:36
🔗
|
|
jesso has joined #archiveteam-ot |
14:01
🔗
|
eientei95 |
[02:30:22] <Soni> we have phones now |
14:01
🔗
|
eientei95 |
[02:30:26] <Soni> they get thrown out every 3 months |
14:01
🔗
|
eientei95 |
https://www.youtube.com/watch?v=lW17rr20tGY |
14:04
🔗
|
anarcat |
JAA: i'm working on that? for the record i've been waiting for them to figure out if it's okay or not at this step, did i miss something? |
14:05
🔗
|
anarcat |
python-internetarchive just entered debian stable https://tracker.debian.org/pkg/python-internetarchive |
14:30
🔗
|
JAA |
anarcat: Yeah, "working on it" in a broader sense. |
14:31
🔗
|
JAA |
And great news regarding python-internetarchive! Thanks for that! |
14:31
🔗
|
JAA |
s/stable/unstable/ though :-) |
14:39
🔗
|
|
VerifiedJ has joined #archiveteam-ot |
14:43
🔗
|
JAA |
"Alex jones infowars - Do you have this?" |
14:43
🔗
|
JAA |
This is what you get via PM when you post in a popular thread on /r/DataHoarder. :-| |
15:06
🔗
|
eientei95 |
I prefer David Dees for my conspiracy nutjobs thanks |
15:24
🔗
|
|
t2t2 has quit IRC (Quit: t2t2) |
15:30
🔗
|
voltagex_ |
Hi anarcat - I recognise that handle |
15:34
🔗
|
|
t2t2 has joined #archiveteam-ot |
16:33
🔗
|
|
vitzli has joined #archiveteam-ot |
16:38
🔗
|
|
vitzli has quit IRC (Quit: Leaving) |
17:13
🔗
|
|
Kolam has joined #archiveteam-ot |
17:29
🔗
|
|
Verified_ has joined #archiveteam-ot |
17:31
🔗
|
|
bithippo has joined #archiveteam-ot |
17:32
🔗
|
|
VerifiedJ has quit IRC (Ping timeout: 252 seconds) |
17:39
🔗
|
|
chferfa has joined #archiveteam-ot |
17:56
🔗
|
|
Kolam has quit IRC (Quit: http://www.mibbit.com ajax IRC Client) |
18:24
🔗
|
schbirid |
what was that esp like board that can be powered by ambient wifi again? |
18:25
🔗
|
Soni |
are you gonna try to run an warrior on an ultra-low-power device that's powered by ambient wifi?! |
18:25
🔗
|
|
adinbied has quit IRC (Read error: Operation timed out) |
18:26
🔗
|
schbirid |
lol no that would not work |
18:26
🔗
|
Soni |
would be cool if it did |
18:26
🔗
|
Soni |
I mean, just program 100s of them and put them on all sorts of places with free wifi |
18:28
🔗
|
|
adinbied has joined #archiveteam-ot |
18:29
🔗
|
schbirid |
that would be a great way to get people against the warrior and internet archival projects |
18:30
🔗
|
schbirid |
so please dont ever abuse services like that |
18:30
🔗
|
schbirid |
! |
18:30
🔗
|
schbirid |
(yes i get the idea and i like it but the consequences would be bad) |
18:33
🔗
|
kiska |
Free wifi = bad |
18:34
🔗
|
kiska |
Captive pages = bad |
18:37
🔗
|
Soni |
okay |
18:37
🔗
|
Soni |
most of the world runs on HTTPS these days, so it should be fine |
18:38
🔗
|
kiska |
You do know what a captive page is right? |
18:43
🔗
|
Soni |
yeah |
18:43
🔗
|
Soni |
it hijacks HTTP connections |
18:43
🔗
|
Soni |
which are not HTTPS connections |
18:44
🔗
|
kiska |
Captive portals don't care if you have a https connection or not, captive pages force their way to your screen |
18:44
🔗
|
schbirid |
if you try to access a https site, a captive portal can only make the connection fail |
18:44
🔗
|
schbirid |
afaik |
18:46
🔗
|
kiska |
So instead of helping us, it will only be polluting the eventual warcs |
18:46
🔗
|
bithippo |
This ^^^ |
18:46
🔗
|
JAA |
Note that we often have certificate validation turned off because target sites may have expired certs etc. |
18:47
🔗
|
JAA |
In that case, the captive portal would happily hijack any HTTPS connection. |
18:47
🔗
|
bithippo |
soni: ArchiveTeam operations rely on clean connectivity. The cost of traditional compute and network is cheap compared to possible ingesting garbage because of non-quality connectivity. |
18:49
🔗
|
bithippo |
In an ideal world, we'd archive from within web property infra or at their network edge. |
18:49
🔗
|
Soni |
okay |
19:05
🔗
|
Soni |
so uh, have y'all tried BGP hijacking? |
19:08
🔗
|
schbirid |
uh http://petecogle.co.uk/blog/2018/12/14/free-music-archives-new-home-kitsplit/ |
19:09
🔗
|
schbirid |
sorry, direct link http://freemusicarchive.org/member/cheyenne_h/blog/Free_Music_Archives_new_home_KitSplit |
19:10
🔗
|
Soni |
(like, when you need lots of IPs, just make them with BGP?) |
19:10
🔗
|
kiska |
JAA Kaz HCross hook54321: pls kick Soni |
19:11
🔗
|
schbirid |
script kiddies go to #kindergarten please |
19:11
🔗
|
Kaz |
sigh |
19:11
🔗
|
Soni |
? |
19:11
🔗
|
Soni |
why? |
19:11
🔗
|
schbirid |
archiveteam is not doing illegal shit |
19:12
🔗
|
Soni |
this is illegal? |
19:12
🔗
|
kiska |
yes |
19:12
🔗
|
Soni |
really? |
19:12
🔗
|
Kaz |
Soni: I'm not sure if you're stupid or just a troll, but this ends now |
19:14
🔗
|
Soni |
:/ |
19:16
🔗
|
|
miked has joined #archiveteam-ot |
19:16
🔗
|
|
Kaz was kicked by hook54321 (Kaz) |
19:16
🔗
|
|
hook54321 sets mode: +b *!*@autism.nbextension.download |
19:17
🔗
|
|
Kaz has joined #archiveteam-ot |
19:17
🔗
|
Kaz |
i mean.. close |
19:17
🔗
|
|
hook54321 sets mode: +b soni!*@* |
19:17
🔗
|
|
hook54321 sets mode: +o kiska |
19:17
🔗
|
|
hook54321 sets mode: +o Kaz |
19:17
🔗
|
|
Soni was kicked by Kaz (Soni) |
19:17
🔗
|
kiska |
thanks |
19:17
🔗
|
schbirid |
lol |
19:17
🔗
|
|
Kaz sets mode: +b #archivet!*@* |
19:17
🔗
|
Kaz |
uh |
19:18
🔗
|
|
Kaz sets mode: -b #archivet!*@* |
19:18
🔗
|
schbirid |
our ops are competent <3 |
19:18
🔗
|
schbirid |
:) |
19:18
🔗
|
bithippo |
THANK YOU |
19:19
🔗
|
kiska |
I've had my dose of stupid today |
19:20
🔗
|
hook54321 |
We might want to try to check if he's been running the warrior, if possible |
19:23
🔗
|
|
MrRadar2 has quit IRC (Quit: Rebooting) |
19:25
🔗
|
|
MrRadar2 has joined #archiveteam-ot |
19:32
🔗
|
|
t3 has quit IRC () |
19:36
🔗
|
|
teej_ has joined #archiveteam-ot |
20:45
🔗
|
|
BlueMax has joined #archiveteam-ot |
21:31
🔗
|
|
mgrytbak^ is now known as mgrytbak |
22:20
🔗
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
22:22
🔗
|
|
BlueMax has joined #archiveteam-ot |
22:25
🔗
|
|
wp494 has quit IRC (Ping timeout: 255 seconds) |
22:25
🔗
|
|
wp494 has joined #archiveteam-ot |
22:26
🔗
|
|
svchfoo3 sets mode: +o wp494 |
22:37
🔗
|
|
ubahn_ has quit IRC (Quit: ubahn_) |
23:41
🔗
|
|
Cypher has joined #archiveteam-ot |