| Time |
Nickname |
Message |
|
00:05
🔗
|
Jens |
"<redacted> Jason Scott looks like a cross between George R. R. Martin and Hugh Hefner." |
|
00:07
🔗
|
|
ivan` is now known as ivan_ |
|
01:05
🔗
|
Kaz |
anyone know of a tool that I can point to a folder and get a list of every video in it, with associated resolution, bitrate etc? Windows pref, but open to most things |
|
01:15
🔗
|
Kaz |
mediainfo appears to be the tool I was looking for |
|
01:23
🔗
|
|
trvz has quit IRC () |
|
01:48
🔗
|
|
terorie has quit IRC (Remote host closed the connection) |
|
01:48
🔗
|
|
terorie has joined #archiveteam-ot |
|
01:51
🔗
|
|
terorie has quit IRC (Remote host closed the connection) |
|
01:52
🔗
|
|
terorie has joined #archiveteam-ot |
|
01:57
🔗
|
|
terorie has quit IRC (Ping timeout: 268 seconds) |
|
02:05
🔗
|
|
VerifiedJ has quit IRC (Quit: Leaving) |
|
02:05
🔗
|
|
terorie has joined #archiveteam-ot |
|
02:17
🔗
|
|
terorie_ has joined #archiveteam-ot |
|
02:21
🔗
|
|
terorie has quit IRC (Ping timeout: 268 seconds) |
|
02:30
🔗
|
|
terorie_ has quit IRC (Remote host closed the connection) |
|
02:31
🔗
|
|
terorie has joined #archiveteam-ot |
|
02:32
🔗
|
|
terorie has quit IRC (Read error: Operation timed out) |
|
03:43
🔗
|
|
m007a83_ has joined #archiveteam-ot |
|
03:44
🔗
|
|
m007a83 has quit IRC (Ping timeout: 252 seconds) |
|
03:46
🔗
|
|
m007a83_ is now known as m007a83 |
|
03:48
🔗
|
|
boutique has quit IRC (Quit: zzzzz) |
|
03:55
🔗
|
|
uberushax has quit IRC (Remote host closed the connection) |
|
04:13
🔗
|
|
boutique has joined #archiveteam-ot |
|
04:15
🔗
|
|
odemg has quit IRC (Ping timeout: 265 seconds) |
|
04:18
🔗
|
|
ubahn_ has joined #archiveteam-ot |
|
04:21
🔗
|
|
ubahn has quit IRC (Read error: Operation timed out) |
|
04:25
🔗
|
|
wp494 has quit IRC (Ping timeout: 268 seconds) |
|
04:26
🔗
|
|
wp494 has joined #archiveteam-ot |
|
04:26
🔗
|
|
svchfoo3 sets mode: +o wp494 |
|
04:27
🔗
|
|
odemg has joined #archiveteam-ot |
|
04:36
🔗
|
wp494 |
these DHCP disconnects are getting pretty damn annoying |
|
04:37
🔗
|
|
wp494 sets mode: +ooo arkiver godane swebb |
|
04:54
🔗
|
|
terorie has joined #archiveteam-ot |
|
04:58
🔗
|
|
terorie has quit IRC (Read error: Operation timed out) |
|
05:02
🔗
|
|
terorie has joined #archiveteam-ot |
|
05:07
🔗
|
|
terorie has quit IRC (Ping timeout: 268 seconds) |
|
05:21
🔗
|
|
boutique_ has joined #archiveteam-ot |
|
05:24
🔗
|
|
boutique has quit IRC (Ping timeout: 252 seconds) |
|
05:26
🔗
|
|
boutique has joined #archiveteam-ot |
|
05:28
🔗
|
|
boutique has quit IRC (Read error: Connection reset by peer) |
|
05:28
🔗
|
|
boutique has joined #archiveteam-ot |
|
05:29
🔗
|
|
boutique_ has quit IRC (Ping timeout: 252 seconds) |
|
05:33
🔗
|
|
Stiletto has quit IRC (Ping timeout: 265 seconds) |
|
05:41
🔗
|
|
boutique_ has joined #archiveteam-ot |
|
05:45
🔗
|
|
boutique has quit IRC (Ping timeout: 252 seconds) |
|
05:45
🔗
|
voltagex_ |
where is the line between archiving and data hoarding? |
|
05:47
🔗
|
ivan_ |
a data hoarder is more of a person who is trying to fill up their too-many-hard drives with whatever they want |
|
05:47
🔗
|
ivan_ |
archiving pays some attention to the general value of the content and has some plan for future accessibility |
|
05:47
🔗
|
|
boutique has joined #archiveteam-ot |
|
05:48
🔗
|
ivan_ |
I guess the line is blurry in many cases |
|
05:49
🔗
|
ivan_ |
Brewster is just the best data hoarder :-) |
|
05:49
🔗
|
|
boutique_ has quit IRC (Ping timeout: 252 seconds) |
|
06:02
🔗
|
eientei95 |
ivan_: Data hoarding is just making the stuff for digital archaeologists to look through :P |
|
06:03
🔗
|
voltagex_ |
Well, my current issue is I need to reduce the stuff I have, and I've got ~100GB of a Tomorrowland livestream that probably shouldn't be lost. |
|
06:03
🔗
|
ivan_ |
you can put many petabytes into google drive |
|
06:04
🔗
|
voltagex_ |
I was hoping FOS could take it :P |
|
06:06
🔗
|
ivan_ |
you can also upload things directly to IA |
|
06:07
🔗
|
ivan_ |
https://archive.org/help/abouts3.txt |
|
06:07
🔗
|
voltagex_ |
legal grey area I guess |
|
06:07
🔗
|
voltagex_ |
not quite as bad as Nintendo but ID&T are a weird company. |
|
06:07
🔗
|
JAA |
Email Jason then, I guess. |
|
06:08
🔗
|
voltagex_ |
I've got to work out whether this video file is valid :/ |
|
06:08
🔗
|
voltagex_ |
plays in VLC != accessible in the future |
|
06:08
🔗
|
voltagex_ |
MPEG4-TS is an abomination. |
|
06:08
🔗
|
JAA |
Eww, yeah. |
|
06:12
🔗
|
voltagex_ |
hm, Xbox One plays it, and it's a strangely compliant player. |
|
06:13
🔗
|
|
boutique_ has joined #archiveteam-ot |
|
06:13
🔗
|
JAA |
There must be some tool which strictly checks whether a video file complies with the specifications, right? |
|
06:16
🔗
|
|
boutique has quit IRC (Ping timeout: 252 seconds) |
|
06:16
🔗
|
voltagex_ |
possibly. |
|
06:17
🔗
|
voltagex_ |
JAA: sigh. https://forum.doom9.org/showthread.php?s=028d37878e073193b81c74c58b06e01d&p=1067204#post1067204 |
|
06:18
🔗
|
JAA |
I'm not surprised. |
|
06:18
🔗
|
JAA |
Also, that thread is from 2007. |
|
06:20
🔗
|
|
boutique has joined #archiveteam-ot |
|
06:20
🔗
|
|
boutique_ has quit IRC (Ping timeout: 252 seconds) |
|
06:21
🔗
|
JAA |
Found a commercial tool: http://www.jongbel.com/automated-validation/media-validator/ |
|
06:23
🔗
|
voltagex_ |
149 EUR per month lol |
|
06:27
🔗
|
voltagex_ |
props to them for writing their own decoders instead of just using ffmpeg though |
|
06:30
🔗
|
|
JAA has quit IRC (leaving) |
|
06:34
🔗
|
|
JAA has joined #archiveteam-ot |
|
06:34
🔗
|
|
svchfoo3 sets mode: +o JAA |
|
06:35
🔗
|
|
bakJAA sets mode: +o JAA |
|
06:40
🔗
|
JAA |
voltagex_: So Stack Overflow recommends transcoding it to nothing with ffmpeg. I guess that works and ffmpeg should produce warnings and errors, but I'm not sure how strict it is. |
|
06:41
🔗
|
voltagex_ |
JAA: sorry, I didn't mean to take up your time on one of my rabbit holes |
|
06:41
🔗
|
voltagex_ |
we're all going to be underwater / on fire or both in the future, so it may not matter. |
|
06:47
🔗
|
|
DarkWorld has joined #archiveteam-ot |
|
07:16
🔗
|
|
terorie has joined #archiveteam-ot |
|
07:22
🔗
|
|
terorie has quit IRC (Ping timeout: 268 seconds) |
|
07:27
🔗
|
|
terorie has joined #archiveteam-ot |
|
08:29
🔗
|
|
m007a83_ has joined #archiveteam-ot |
|
08:30
🔗
|
|
m007a83 has quit IRC (Ping timeout: 252 seconds) |
|
08:34
🔗
|
|
m007a83_ is now known as m007a83 |
|
10:17
🔗
|
|
hook54321 has quit IRC (Quit: Connection closed for inactivity) |
|
10:37
🔗
|
|
terorie has quit IRC (Remote host closed the connection) |
|
10:37
🔗
|
|
terorie has joined #archiveteam-ot |
|
10:38
🔗
|
|
terorie has quit IRC (Client Quit) |
|
10:59
🔗
|
|
Stiletto has joined #archiveteam-ot |
|
11:08
🔗
|
|
DarkWorld has quit IRC (Leaving) |
|
11:20
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
|
11:20
🔗
|
|
caff_ has quit IRC (Read error: Connection reset by peer) |
|
12:01
🔗
|
|
boutique has quit IRC (Quit: Leaving) |
|
12:07
🔗
|
|
vitzli has joined #archiveteam-ot |
|
12:15
🔗
|
VoynichCr |
JAA: https://github.com/emijrp/internet-archive/blob/master/archivebot.py |
|
12:16
🔗
|
VoynichCr |
that is the bot which updates tables in wiki |
|
12:16
🔗
|
VoynichCr |
it requires pywikibot (and configured) |
|
12:18
🔗
|
VoynichCr |
i can write detailed instructions if needed |
|
12:20
🔗
|
VoynichCr |
the scripts for the deaths and disestablishements pages are in the same repo |
|
12:43
🔗
|
ivan_ |
do people use pywb for looking inside WARCs or something else? |
|
12:43
🔗
|
* |
ivan_ spots https://github.com/webrecorder/webrecorder-player |
|
12:49
🔗
|
|
hook54321 has joined #archiveteam-ot |
|
12:49
🔗
|
|
svchfoo3 sets mode: +o hook54321 |
|
12:52
🔗
|
HCross |
ivan_: warcio |
|
12:52
🔗
|
HCross |
Because it doesn't need to load the entire warc into disk |
|
12:52
🔗
|
HCross |
Which makes working with megawarcs so much nicer |
|
12:53
🔗
|
ivan_ |
ah but this person wanted a thing to play them back / browse them |
|
12:53
🔗
|
ivan_ |
looks like pywb uses it |
|
12:56
🔗
|
|
Mateon1 has quit IRC (Read error: Operation timed out) |
|
12:56
🔗
|
|
Mateon1 has joined #archiveteam-ot |
|
13:00
🔗
|
|
vitzli has quit IRC (Quit: Leaving) |
|
13:14
🔗
|
JAA |
VoynichCr: Sweet, thanks, I'll have a look. I did look at pywikibot, but mwclient just seemed much more straightforward and Pythonic. My code is here if you're interested: https://github.com/JustAnotherArchivist/atwikibot/blob/master/currentwarriorproject.py |
|
13:18
🔗
|
JAA |
ivan_: I use pywb for WARC playback when I need it. Apart from the fact that it copies around the WARCs and doesn't easily let you avoid that (but anarcat is working on that at https://github.com/webrecorder/pywb/pull/409 ), it's pretty good. Often enough, I just look at the raw file with zless though. |
|
13:19
🔗
|
ivan_ |
thanks |
|
13:26
🔗
|
|
wp494 has quit IRC (Ping timeout: 268 seconds) |
|
13:26
🔗
|
|
wp494 has joined #archiveteam-ot |
|
13:26
🔗
|
|
svchfoo3 sets mode: +o wp494 |
|
13:31
🔗
|
|
Soni has joined #archiveteam-ot |
|
13:33
🔗
|
Soni |
hi |
|
13:36
🔗
|
|
jesso has joined #archiveteam-ot |
|
14:01
🔗
|
eientei95 |
[02:30:22] <Soni> we have phones now |
|
14:01
🔗
|
eientei95 |
[02:30:26] <Soni> they get thrown out every 3 months |
|
14:01
🔗
|
eientei95 |
https://www.youtube.com/watch?v=lW17rr20tGY |
|
14:04
🔗
|
anarcat |
JAA: i'm working on that? for the record i've been waiting for them to figure out if it's okay or not at this step, did i miss something? |
|
14:05
🔗
|
anarcat |
python-internetarchive just entered debian stable https://tracker.debian.org/pkg/python-internetarchive |
|
14:30
🔗
|
JAA |
anarcat: Yeah, "working on it" in a broader sense. |
|
14:31
🔗
|
JAA |
And great news regarding python-internetarchive! Thanks for that! |
|
14:31
🔗
|
JAA |
s/stable/unstable/ though :-) |
|
14:39
🔗
|
|
VerifiedJ has joined #archiveteam-ot |
|
14:43
🔗
|
JAA |
"Alex jones infowars - Do you have this?" |
|
14:43
🔗
|
JAA |
This is what you get via PM when you post in a popular thread on /r/DataHoarder. :-| |
|
15:06
🔗
|
eientei95 |
I prefer David Dees for my conspiracy nutjobs thanks |
|
15:24
🔗
|
|
t2t2 has quit IRC (Quit: t2t2) |
|
15:30
🔗
|
voltagex_ |
Hi anarcat - I recognise that handle |
|
15:34
🔗
|
|
t2t2 has joined #archiveteam-ot |
|
16:33
🔗
|
|
vitzli has joined #archiveteam-ot |
|
16:38
🔗
|
|
vitzli has quit IRC (Quit: Leaving) |
|
17:13
🔗
|
|
Kolam has joined #archiveteam-ot |
|
17:29
🔗
|
|
Verified_ has joined #archiveteam-ot |
|
17:31
🔗
|
|
bithippo has joined #archiveteam-ot |
|
17:32
🔗
|
|
VerifiedJ has quit IRC (Ping timeout: 252 seconds) |
|
17:39
🔗
|
|
chferfa has joined #archiveteam-ot |
|
17:56
🔗
|
|
Kolam has quit IRC (Quit: http://www.mibbit.com ajax IRC Client) |
|
18:24
🔗
|
schbirid |
what was that esp like board that can be powered by ambient wifi again? |
|
18:25
🔗
|
Soni |
are you gonna try to run an warrior on an ultra-low-power device that's powered by ambient wifi?! |
|
18:25
🔗
|
|
adinbied has quit IRC (Read error: Operation timed out) |
|
18:26
🔗
|
schbirid |
lol no that would not work |
|
18:26
🔗
|
Soni |
would be cool if it did |
|
18:26
🔗
|
Soni |
I mean, just program 100s of them and put them on all sorts of places with free wifi |
|
18:28
🔗
|
|
adinbied has joined #archiveteam-ot |
|
18:29
🔗
|
schbirid |
that would be a great way to get people against the warrior and internet archival projects |
|
18:30
🔗
|
schbirid |
so please dont ever abuse services like that |
|
18:30
🔗
|
schbirid |
! |
|
18:30
🔗
|
schbirid |
(yes i get the idea and i like it but the consequences would be bad) |
|
18:33
🔗
|
kiska |
Free wifi = bad |
|
18:34
🔗
|
kiska |
Captive pages = bad |
|
18:37
🔗
|
Soni |
okay |
|
18:37
🔗
|
Soni |
most of the world runs on HTTPS these days, so it should be fine |
|
18:38
🔗
|
kiska |
You do know what a captive page is right? |
|
18:43
🔗
|
Soni |
yeah |
|
18:43
🔗
|
Soni |
it hijacks HTTP connections |
|
18:43
🔗
|
Soni |
which are not HTTPS connections |
|
18:44
🔗
|
kiska |
Captive portals don't care if you have a https connection or not, captive pages force their way to your screen |
|
18:44
🔗
|
schbirid |
if you try to access a https site, a captive portal can only make the connection fail |
|
18:44
🔗
|
schbirid |
afaik |
|
18:46
🔗
|
kiska |
So instead of helping us, it will only be polluting the eventual warcs |
|
18:46
🔗
|
bithippo |
This ^^^ |
|
18:46
🔗
|
JAA |
Note that we often have certificate validation turned off because target sites may have expired certs etc. |
|
18:47
🔗
|
JAA |
In that case, the captive portal would happily hijack any HTTPS connection. |
|
18:47
🔗
|
bithippo |
soni: ArchiveTeam operations rely on clean connectivity. The cost of traditional compute and network is cheap compared to possible ingesting garbage because of non-quality connectivity. |
|
18:49
🔗
|
bithippo |
In an ideal world, we'd archive from within web property infra or at their network edge. |
|
18:49
🔗
|
Soni |
okay |
|
19:05
🔗
|
Soni |
so uh, have y'all tried BGP hijacking? |
|
19:08
🔗
|
schbirid |
uh http://petecogle.co.uk/blog/2018/12/14/free-music-archives-new-home-kitsplit/ |
|
19:09
🔗
|
schbirid |
sorry, direct link http://freemusicarchive.org/member/cheyenne_h/blog/Free_Music_Archives_new_home_KitSplit |
|
19:10
🔗
|
Soni |
(like, when you need lots of IPs, just make them with BGP?) |
|
19:10
🔗
|
kiska |
JAA Kaz HCross hook54321: pls kick Soni |
|
19:11
🔗
|
schbirid |
script kiddies go to #kindergarten please |
|
19:11
🔗
|
Kaz |
sigh |
|
19:11
🔗
|
Soni |
? |
|
19:11
🔗
|
Soni |
why? |
|
19:11
🔗
|
schbirid |
archiveteam is not doing illegal shit |
|
19:12
🔗
|
Soni |
this is illegal? |
|
19:12
🔗
|
kiska |
yes |
|
19:12
🔗
|
Soni |
really? |
|
19:12
🔗
|
Kaz |
Soni: I'm not sure if you're stupid or just a troll, but this ends now |
|
19:14
🔗
|
Soni |
:/ |
|
19:16
🔗
|
|
miked has joined #archiveteam-ot |
|
19:16
🔗
|
|
Kaz was kicked by hook54321 (Kaz) |
|
19:16
🔗
|
|
hook54321 sets mode: +b *!*@autism.nbextension.download |
|
19:17
🔗
|
|
Kaz has joined #archiveteam-ot |
|
19:17
🔗
|
Kaz |
i mean.. close |
|
19:17
🔗
|
|
hook54321 sets mode: +b soni!*@* |
|
19:17
🔗
|
|
hook54321 sets mode: +o kiska |
|
19:17
🔗
|
|
hook54321 sets mode: +o Kaz |
|
19:17
🔗
|
|
Soni was kicked by Kaz (Soni) |
|
19:17
🔗
|
kiska |
thanks |
|
19:17
🔗
|
schbirid |
lol |
|
19:17
🔗
|
|
Kaz sets mode: +b #archivet!*@* |
|
19:17
🔗
|
Kaz |
uh |
|
19:18
🔗
|
|
Kaz sets mode: -b #archivet!*@* |
|
19:18
🔗
|
schbirid |
our ops are competent <3 |
|
19:18
🔗
|
schbirid |
:) |
|
19:18
🔗
|
bithippo |
THANK YOU |
|
19:19
🔗
|
kiska |
I've had my dose of stupid today |
|
19:20
🔗
|
hook54321 |
We might want to try to check if he's been running the warrior, if possible |
|
19:23
🔗
|
|
MrRadar2 has quit IRC (Quit: Rebooting) |
|
19:25
🔗
|
|
MrRadar2 has joined #archiveteam-ot |
|
19:32
🔗
|
|
t3 has quit IRC () |
|
19:36
🔗
|
|
teej_ has joined #archiveteam-ot |
|
20:45
🔗
|
|
BlueMax has joined #archiveteam-ot |
|
21:31
🔗
|
|
mgrytbak^ is now known as mgrytbak |
|
22:20
🔗
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
|
22:22
🔗
|
|
BlueMax has joined #archiveteam-ot |
|
22:25
🔗
|
|
wp494 has quit IRC (Ping timeout: 255 seconds) |
|
22:25
🔗
|
|
wp494 has joined #archiveteam-ot |
|
22:26
🔗
|
|
svchfoo3 sets mode: +o wp494 |
|
22:37
🔗
|
|
ubahn_ has quit IRC (Quit: ubahn_) |
|
23:41
🔗
|
|
Cypher has joined #archiveteam-ot |