#archiveteam-bs 2017-08-05,Sat

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)

WhoWhatWhen
***pizzaiolo has joined #archiveteam-bs [00:02]
.... (idle for 16mn)
bsmith093sho: gpodder, it even has a cli shell with gpo
literally what it was built to do
[00:18]
***Aranje has quit IRC (Quit: Three sheets to the wind) [00:19]
bitBaron has quit IRC (Quit: My computer has gone to sleep. ZZZzzz…) [00:30]
zinogodane: Good job [00:31]
***drumstick has quit IRC (Read error: Operation timed out) [00:35]
.... (idle for 17mn)
odemggodane, https://www.reddit.com/r/DataHoarder/comments/6r3dc5/youtube_request_so_the_channel_that_one_video/dl2jow8
godane, started uploading those playlists already derive is taking it's sweet time so they are their but 'unpublished' so far
[00:52]
godane, last uploaded was: https://archive.org/history/youtube-20hZnSkhDgs [01:00]
***schbirid2 has joined #archiveteam-bs
Swizzle has joined #archiveteam-bs
schbirid has quit IRC (Read error: Operation timed out)
[01:10]
...... (idle for 26mn)
Asparagir has joined #archiveteam-bs [01:40]
j08nY has quit IRC (Quit: Leaving) [01:46]
Swizzle has quit IRC (Quit: Leaving)
drumstick has joined #archiveteam-bs
dboard has quit IRC (Remote host closed the connection!)
dboard has joined #archiveteam-bs
dboard has quit IRC (Read error: Connection reset by peer)
[01:57]
dboard has joined #archiveteam-bs
dboard has quit IRC (Connection closed)
[02:12]
godaneodemg: i found tons of Late Night with David Letterman on youtube [02:13]
odemghow much is tons [02:13]
godanehttps://www.youtube.com/channel/UCqkkzIyGnwkEShBIGYRRgqQ/videos [02:14]
odemghe's currently uploading too.. expect more!! [02:15]
godanethat is at least over 100+ videos there
i know
[02:15]
odemg512 videos
https://pastebin.com/raw/stksw9Y7
[02:17]
***dboard2 has joined #archiveteam-bs [02:20]
drumstick has quit IRC (Read error: Operation timed out)
drumstick has joined #archiveteam-bs
[02:32]
pizzaiolo has quit IRC (Quit: pizzaiolo) [02:50]
..... (idle for 21mn)
drumstick has quit IRC (Read error: Operation timed out) [03:11]
..... (idle for 24mn)
kristian_ has joined #archiveteam-bs [03:35]
......... (idle for 42mn)
kristian_ has quit IRC (Quit: Leaving)
wabu has quit IRC (Read error: Operation timed out)
[04:17]
.... (idle for 15mn)
wabu has joined #archiveteam-bs
Sk1d has quit IRC (Ping timeout: 250 seconds)
[04:33]
Sk1d has joined #archiveteam-bs [04:44]
..... (idle for 20mn)
drumstick has joined #archiveteam-bs [05:04]
............... (idle for 1h11mn)
Asparagir has quit IRC (Asparagir)
Mateon1 has quit IRC (Read error: Operation timed out)
Mateon1 has joined #archiveteam-bs
[06:15]
qw3rty14 has joined #archiveteam-bs
qw3rty13 has quit IRC (Read error: Operation timed out)
[06:23]
.... (idle for 16mn)
odemg has quit IRC (Read error: Operation timed out) [06:43]
drumstick has quit IRC (Read error: Operation timed out)
REiN^ has joined #archiveteam-bs
odemg has joined #archiveteam-bs
[06:49]
.......... (idle for 48mn)
BlueMaxim has quit IRC (Read error: Operation timed out)
BlueMaxim has joined #archiveteam-bs
drumstick has joined #archiveteam-bs
[07:44]
........... (idle for 50mn)
drumstick has quit IRC (Ping timeout: 633 seconds) [08:39]
.... (idle for 18mn)
godaneleffi: i have a problem with the youtube comment downloader not taking youtube id with dash (-) in front of them
-- doesn't work
\ doesn't work
" and ' don't work
[08:57]
schbirid2godane: do you run it in a linux shell?
if so, try putting a single - in front of the id and have the id to be the last thing in your line. eg "youtube-dl -this --that - -ABCA" if "-ABCA" was such id
[09:00]
***drumstick has joined #archiveteam-bs [09:02]
godanepython downloader.py --youtube-dl -KaK2SOsiw4 --output -KaK2SOsiw4.json
i'm using this: https://github.com/egbertbouman/youtube-comment-downloader
[09:10]
schbirid2ah crap, two times the id
ah, wont work here
[09:11]
.......... (idle for 47mn)
HCross2Hmm. Anyone here good with Heritrix at all please? [10:00]
....... (idle for 31mn)
***BlueMaxim has quit IRC (Read error: Operation timed out)
BlueMaxim has joined #archiveteam-bs
[10:31]
..... (idle for 23mn)
j08nY has joined #archiveteam-bs [10:55]
username1 has joined #archiveteam-bs [11:09]
schbirid2 has quit IRC (Read error: Operation timed out) [11:14]
drumstick has quit IRC (Ping timeout: 246 seconds) [11:28]
....... (idle for 32mn)
odemg has quit IRC (Read error: Operation timed out) [12:00]
.... (idle for 16mn)
odemg has joined #archiveteam-bs [12:16]
....... (idle for 32mn)
kristian_ has joined #archiveteam-bs [12:48]
Stiletti has quit IRC (Read error: Connection reset by peer)
Stiletti has joined #archiveteam-bs
[12:53]
.... (idle for 19mn)
BlueMaxim has quit IRC (Read error: Operation timed out) [13:12]
j08nY has quit IRC (Quit: Leaving) [13:18]
..... (idle for 20mn)
Stiletti has quit IRC (Read error: Operation timed out)
Stiletti has joined #archiveteam-bs
[13:38]
...... (idle for 25mn)
Stiletti has quit IRC (Read error: Operation timed out)
Stiletti has joined #archiveteam-bs
[14:03]
...... (idle for 27mn)
Stiletti has quit IRC (Read error: Operation timed out)
Stiletti has joined #archiveteam-bs
[14:30]
...... (idle for 27mn)
kristian_ has quit IRC (Quit: Leaving) [14:57]
.............. (idle for 1h6mn)
Stiletti has quit IRC (Read error: Operation timed out)
Stiletti has joined #archiveteam-bs
[16:03]
........ (idle for 39mn)
Stiletti has quit IRC (Read error: Operation timed out)
Stiletti has joined #archiveteam-bs
[16:42]
pizzaiolo has joined #archiveteam-bs
pizzaiolo has left
[16:52]
.... (idle for 18mn)
Mateon1 has quit IRC (Remote host closed the connection)
Mateon1 has joined #archiveteam-bs
pizzaiolo has joined #archiveteam-bs
[17:11]
JensRex has quit IRC (Remote host closed the connection)
JensRex has joined #archiveteam-bs
[17:24]
......... (idle for 44mn)
Stiletti has quit IRC (Read error: Operation timed out)
Stiletti has joined #archiveteam-bs
Mateon1 has quit IRC (Ping timeout: 250 seconds)
[18:08]
..... (idle for 23mn)
Stiletti has quit IRC (Read error: Operation timed out)
Stiletti has joined #archiveteam-bs
[18:34]
............ (idle for 55mn)
Aranje has joined #archiveteam-bs [19:29]
Asparagir has joined #archiveteam-bs [19:42]
mundusAnyone have a suggested tool for crawling urls off sites? [19:50]
AsparagirDo you mean crawling links that a website *links to*? Or only the site itself, plus its outbound links? If the latter, try wpull. [19:59]
mundusthe latter
Okay
[20:01]
.... (idle for 15mn)
***Mateon1 has joined #archiveteam-bs [20:17]
.... (idle for 17mn)
pikhq has quit IRC (Ping timeout: 245 seconds) [20:34]
pikhq has joined #archiveteam-bs [20:40]
username1 is now known as schbirid [20:53]
.... (idle for 16mn)
Stiletti has quit IRC (Read error: Operation timed out)
Stiletti has joined #archiveteam-bs
[21:09]
......... (idle for 44mn)
JAAAsparagir: "don't need much computing power" -- I thought others (FalconK?) said before that pipelines were mainly CPU-bound? [21:53]
AsparagirI've been okay running the 4 GB memory / 60 GB disk space droplets on Digital Ocean. But bigger is better, especially if they happen to be running a lot of phantomjs jobs.
But you can't really control if you happen to get a lot of those phantomjs jobs or not.
Also, wpull has known (but still not patched) memory leaks. So you need a little wiggle room...and probably need to restart the whole shebang once every few months.
[21:54]
JAAHopefully more often for security updates
But yeah
[21:58]
....... (idle for 31mn)
***Administr has joined #archiveteam-bs
HCross has quit IRC (Ping timeout: 268 seconds)
[22:29]
Administr has quit IRC (Ping timeout: 268 seconds)
HarryCros has joined #archiveteam-bs
[22:43]
drumstick has joined #archiveteam-bs [22:55]
HCross has joined #archiveteam-bs
HarryCros has quit IRC (Ping timeout: 268 seconds)
[23:04]
...... (idle for 26mn)
HCross has quit IRC (Read error: Connection reset by peer)
HarryCros has joined #archiveteam-bs
[23:31]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)