Time |
Nickname |
Message |
00:22
🔗
|
|
BlueMax has joined #archiveteam-bs |
00:27
🔗
|
|
Silas has joined #archiveteam-bs |
00:40
🔗
|
|
bsmith093 has quit IRC (Read error: Connection reset by peer) |
00:41
🔗
|
|
bsmith093 has joined #archiveteam-bs |
00:55
🔗
|
Silas |
What is the best way to download and upload youtube videos? Tubeup with youtube-dl seems to be the most convenient, but youtube-dl doesn't seem to offer support for WARCs. Wget is my preferred tool for stuff like this, but it would be more of a hassle to download and upload videos neatly. Is there any way to have youtube-dl output WARCs or tubeup to use wget? |
00:58
🔗
|
Flashfire |
No clue |
00:58
🔗
|
Flashfire |
That is all |
01:18
🔗
|
moufu |
you could use youtube-dl with warcprox for warc output, but I don't know if it would be replayable |
01:18
🔗
|
moufu |
probably better to just use tubeup |
01:30
🔗
|
|
ndiddy_ has quit IRC () |
01:45
🔗
|
Silas |
Ok, thank you! |
02:42
🔗
|
|
Silas has quit IRC (Quit: http://www.mibbit.com ajax IRC Client) |
03:38
🔗
|
|
w0rmhole has joined #archiveteam-bs |
03:41
🔗
|
|
odemg has quit IRC (Ping timeout: 260 seconds) |
03:42
🔗
|
|
RichardG has joined #archiveteam-bs |
03:43
🔗
|
|
RichardG_ has quit IRC (Read error: Connection reset by peer) |
03:53
🔗
|
|
odemg has joined #archiveteam-bs |
07:26
🔗
|
|
Sk1d has quit IRC (Read error: Operation timed out) |
07:26
🔗
|
|
Sk2d has joined #archiveteam-bs |
07:26
🔗
|
|
Sk2d is now known as Sk1d |
08:19
🔗
|
|
schbirid has joined #archiveteam-bs |
08:55
🔗
|
|
second has quit IRC (Read error: Operation timed out) |
09:34
🔗
|
|
jut has quit IRC (Read error: Operation timed out) |
09:42
🔗
|
|
jut has joined #archiveteam-bs |
10:00
🔗
|
|
Soni has quit IRC (Ping timeout: 258 seconds) |
11:29
🔗
|
|
Mateon1 has quit IRC (Read error: Operation timed out) |
12:04
🔗
|
|
Mateon1 has joined #archiveteam-bs |
12:55
🔗
|
|
BlueMax has quit IRC (Quit: Leaving) |
13:42
🔗
|
|
bitBaron has joined #archiveteam-bs |
14:30
🔗
|
Vito` |
Does WARC even make sense for HLS/DASH/reassembled YouTube videos? Most of the time youtube-dl picks streams it has to use FFMPEG to reassemble, so even though the WARC might have a record of the download, it's not representative of the file that was generated at the end, and you wouldn't be able to assemble it from the WARC information alone, you'd still need to know whatever ffmpeg was doing. |
14:42
🔗
|
|
zhongfu has quit IRC (Ping timeout: 260 seconds) |
14:44
🔗
|
|
Mateon1 has quit IRC (Ping timeout: 252 seconds) |
14:45
🔗
|
|
zhongfu has joined #archiveteam-bs |
14:45
🔗
|
|
bitBaron has quit IRC (Quit: My computer has gone to sleep. 😴😪ZZZzzz…) |
14:49
🔗
|
Meroje |
Vito` you can reassemble the final file from the manifest and the parts |
14:51
🔗
|
|
Mateon1 has joined #archiveteam-bs |
14:58
🔗
|
Vito` |
Meroje: ffmpeg isn't further analyzing anything, it's just going on the m3u8 or whatever? |
15:06
🔗
|
Vito` |
I see pywb is editing the manifest so it's just the streams you downloaded |
15:09
🔗
|
jmtd |
can I ask someone to throw something into the crawler pls? https://blogs.ncl.ac.uk/compscisupport/ likely to be deleted |
15:17
🔗
|
|
RichardG_ has joined #archiveteam-bs |
15:18
🔗
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
15:23
🔗
|
|
jmtd is now known as Jon |
15:24
🔗
|
|
TC04 has joined #archiveteam-bs |
15:25
🔗
|
|
TC01 has quit IRC (Ping timeout: 268 seconds) |
15:38
🔗
|
|
bitBaron has joined #archiveteam-bs |
15:59
🔗
|
fenn |
it would be nice to have WARCs for browsing youtube descriptions and comments in the wayback machine for an entire channel or playlist |
15:59
🔗
|
fenn |
(not even downloading the video) |
16:10
🔗
|
|
bitBaron has quit IRC (Quit: Bye.) |
16:39
🔗
|
SketchCow |
Hi, where's ohdemgirls |
17:03
🔗
|
|
wp494 has quit IRC (Ping timeout: 268 seconds) |
17:04
🔗
|
|
wp494 has joined #archiveteam-bs |
17:14
🔗
|
|
ndiddy has joined #archiveteam-bs |
17:25
🔗
|
|
PhrackD has joined #archiveteam-bs |
17:30
🔗
|
PhrackD |
Thanks, JAA. It's not clear from https://www.archiveteam.org/index.php?title=The_WARC_Ecosystem what is working and what isn't. |
17:32
🔗
|
godane |
so this fixed wifi on rpi3 using wlan0 : https://www.raspberrypi.org/forums/viewtopic.php?t=177629#p1132992 |
17:40
🔗
|
JAA |
PhrackD: wpull and pywb work reasonably well for grabbing and displaying the archives, respectively. For wpull, you'll want either version 1.2.3 or a fork (FalconK's or mine); 2.0.1 is very unstable and hardly usable. |
17:41
🔗
|
JAA |
wget works also, but it's not very customisable. With wpull, you can always write a hook script for modifying its behaviour, e.g. adding URLs manually, filtering, etc. |
17:42
🔗
|
|
caff has joined #archiveteam-bs |
17:45
🔗
|
PhrackD |
JAA: I did notice you had a fork that was far ahead of chfoo's. |
18:17
🔗
|
JAA |
PhrackD: Not that far actually, just one commit in addition to FalconK's fixes from early last year. I have a version which has a lot of additional stuff, but it's not on GitHub (or anywhere else except for my machines) yet. |
18:19
🔗
|
JAA |
I should work on that again sometime. |
18:23
🔗
|
|
Pixi has quit IRC (Quit: Pixi) |
18:28
🔗
|
|
Pixi has joined #archiveteam-bs |
18:50
🔗
|
|
sep332 has joined #archiveteam-bs |
19:15
🔗
|
|
Dark_Star has quit IRC (Ping timeout: 492 seconds) |
19:15
🔗
|
|
Dark_Star has joined #archiveteam-bs |
19:19
🔗
|
|
chferfa has joined #archiveteam-bs |
20:46
🔗
|
|
chferfa has quit IRC () |
22:34
🔗
|
|
second has joined #archiveteam-bs |
23:30
🔗
|
|
BlueMax has joined #archiveteam-bs |