#archiveteam-bs 2015-12-31,Thu

↑back Search

Time Nickname Message
00:14 🔗 antomatic (nods)
00:15 🔗 antomatic I've started backing up/mirroring various YouTube channels (and my own favourites list) because you just can't rely on anything staying up for any significant time any more
00:27 🔗 JesseW has joined #archiveteam-bs
00:46 🔗 kyan DFJustin, Microgur1: I'll add this to the wiki, but here's a one liner for getting youtube (needs warcprox running on localhost:8000) youtube-dl --title --continue --retries 100 --write-info-json --write-description --write-thumbnail --proxy="localhost:8000" --write-annotations --all-subs --no-check-certificate --ignore-errors -k -f bestvideo+bestaudio/best
01:07 🔗 JetBalsa has quit IRC (Quit: - nbs-irc 2.39 - www.nbs-irc.net -)
01:08 🔗 JesseW PurpleSym: much better metadata for the yahoo grab, thank you
01:09 🔗 JesseW number of messages per group would be nice, and earliest and latest date of messages (although that would be more of a hassle to get).
01:40 🔗 godane i
01:40 🔗 godane i
01:40 🔗 godane i'm going to be uploading more mbc newsdesk
01:45 🔗 Sketchcow I got 9000 of Springer.
01:47 🔗 jleclanch has joined #archiveteam-bs
01:47 🔗 jleclanch fine
01:47 🔗 JesseW jleclanch: hi!
01:47 🔗 jleclanch hi :)
01:47 🔗 JesseW Sketchcow: Over 9000? :-)
01:48 🔗 * JesseW is trying to figure out how to install a particular version of a ruby gem, to get the tests for archivebot's bot to run
01:49 🔗 jleclanch JesseW: those download tests take so long >.>
01:50 🔗 JesseW yeah, they do
01:50 🔗 aaaaaaaaa JesseW: I believe -v sem.var is the
01:51 🔗 aaaaaaaaa proper option for that
01:52 🔗 JesseW --version seems to do the trick. thanks, though
01:54 🔗 Sketchcow I am sure someone will make insane torrents of springer. Let me know if they do.
01:56 🔗 godane you guys may get the official playstation magazine
01:56 🔗 Sketchcow But I have to turn back to the other opportunities.
01:56 🔗 godane from russia
01:56 🔗 Sketchcow Maybe one day I can finally clear all the uploads on FOS
01:56 🔗 aaaaaaaaa yeah, the day that machine is retired.
01:57 🔗 JesseW has quit IRC (Leaving.)
01:57 🔗 godane or dies like the printer in office space
01:58 🔗 godane https://www.youtube.com/watch?v=pD2xBXm4y70
02:19 🔗 username1 has joined #archiveteam-bs
02:24 🔗 schbirid2 has quit IRC (Ping timeout: 311 seconds)
02:24 🔗 godane looks like the korea trailer for Pearl Harbor is on one of the mbc newsdesk
02:25 🔗 godane at the end of 2001-05-21 episode
02:57 🔗 * kyan wants to know how to see the live chat from youtube live streams after the stream has ended. E.g., to see my comments I made on the IA telethon last year (?).
02:59 🔗 godane and it starting to be uploaded: https://archive.org/details/MBC_Newsdesk_20010411
03:16 🔗 aaaaaaaaa has quit IRC (Leaving)
04:02 🔗 dashcloud joepie91: have you seen this story yet? https://decorrespondent.nl/3789/Operation-Easy-Chair-or-how-a-little-company-in-Holland-helped-the-CIA-bug-the-Russians/116534484-2a3d7f11
05:18 🔗 BlueMaxim has joined #archiveteam-bs
05:24 🔗 vitzli has joined #archiveteam-bs
05:42 🔗 VADemon has quit IRC (left4dead)
05:49 🔗 FAMAS has joined #archiveteam-bs
06:18 🔗 FAMAS has quit IRC (Read error: Operation timed out)
06:27 🔗 godane i'm starting to upload good morning pops: https://archive.org/details/kbsradio-2fm-coolfm-gmp-2009-02-12
06:31 🔗 FAMAS has joined #archiveteam-bs
07:00 🔗 FAMAS2 has joined #archiveteam-bs
07:01 🔗 FAMAS has quit IRC (Read error: Connection reset by peer)
07:02 🔗 yipdw it's 2015, and I still can't upload a large file on my internet connection without everything else slowing down
07:03 🔗 yipdw and I still can't read recorded video via a CF card reader without Linux crapping out other I/O in odd ways
07:03 🔗 pikhq It might help a bit to have fqcodel set up on your router.
07:03 🔗 yipdw what even is that
07:03 🔗 yipdw oh bufferbloat-related
07:03 🔗 pikhq IP queue management algorithm, fixes bufferbloat rather effectively.
07:04 🔗 yipdw I'm starting to think I really gotta set up my own router
07:04 🔗 pikhq Unfortunately, having it set up means openwrt or building your own Linux router.
07:04 🔗 yipdw yeah, I've been told I need to do this a lot, maybe 2016 is the year I finally do so
07:04 🔗 pikhq i.e. it's nice if you can get it set up, but it's a bit of a pain to do so.
07:05 🔗 pikhq But, I run fq_codel and now it doesn't matter what I do with my Internet connection, everything works reasonably.
07:05 🔗 yipdw that does sound really nice
07:05 🔗 pikhq If there's some huge download then my connection technically "slows down", but it ends up in a fair sharing situation.
07:05 🔗 espes___ why the fuck doesn't everything use that by default then
07:05 🔗 yipdw ^
07:05 🔗 pikhq espes___: Because it's new and home router makers suck.
07:06 🔗 espes___ can isps deploy it
07:06 🔗 pikhq Yes.
07:06 🔗 pikhq It's upstream in Linux.
07:07 🔗 yipdw I'll read more on it; it'd be super-nice to be able to see what I'm typing in the chat without a 1-second delay after I hit Enter
07:08 🔗 * yipdw has modest goals
07:09 🔗 pikhq Welp, fq_codel can help that quite well.
07:12 🔗 yipdw ooh, there's some names I recognize on codel
07:26 🔗 FAMAS2 is now known as FAMA
07:26 🔗 FAMA is now known as FAMAS
07:28 🔗 FAMAS is there in any person within the archiveteam fora who practices the techniques of video screenshotting?
07:30 🔗 Stiletto has quit IRC (Read error: Operation timed out)
07:44 🔗 FAMAS has quit IRC (Read error: Connection reset by peer)
08:17 🔗 PurpleSym JW_work: #messages in this item/WARC /= #messages in this group. WARCs are split whenever they hit 1GB and not after finishing a group.
08:18 🔗 wp494 has quit IRC (Quit: LOUD UNNECESSARY QUIT MESSAGES)
08:21 🔗 wp494 has joined #archiveteam-bs
09:30 🔗 dashcloud has quit IRC (Read error: Operation timed out)
09:30 🔗 FAMAS has joined #archiveteam-bs
09:37 🔗 dashcloud has joined #archiveteam-bs
09:49 🔗 username1 is now known as schbirid
09:56 🔗 FAMAS2 has joined #archiveteam-bs
09:57 🔗 FAMAS has quit IRC (Quit: KVIrc 4.3.2 Aria http://www.kvirc.net/)
09:57 🔗 FAMAS2 is now known as FAMAS
11:06 🔗 schbirid anyone wanna mirror ebooks i grabbed at 32c3 speak in the next hour (no chance if i dont know your nick ;) )
11:08 🔗 schbirid *german
11:13 🔗 vitzli "mirror" as in "public mirror" or just get a copy?
11:13 🔗 FAMAS has quit IRC (Read error: Connection reset by peer)
11:13 🔗 FAMAS has joined #archiveteam-bs
11:17 🔗 schbirid rsync from my box before i delete it ;)
11:20 🔗 vitzli schbirid, may I pm you?
11:20 🔗 schbirid sure
11:29 🔗 arkhive has quit IRC (Read error: Connection reset by peer)
12:03 🔗 FAMAS has quit IRC (Read error: Connection reset by peer)
12:06 🔗 godane so i found something old
12:07 🔗 godane a magazine called Table Tennis News
12:22 🔗 BlueMaxim has quit IRC (Quit: Leaving)
12:29 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
12:30 🔗 tjg has joined #archiveteam-bs
12:35 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
12:39 🔗 tjg has joined #archiveteam-bs
13:12 🔗 vitzli has quit IRC (Quit: Leaving)
13:34 🔗 tjg has quit IRC (Read error: Connection reset by peer)
13:35 🔗 tjg has joined #archiveteam-bs
13:50 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
13:51 🔗 tjg has joined #archiveteam-bs
14:02 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
14:24 🔗 tjg has joined #archiveteam-bs
14:36 🔗 dashcloud has quit IRC (Read error: Operation timed out)
14:36 🔗 ndiddy godane, that's amazing
14:38 🔗 limebyte much fireworkz
14:38 🔗 limebyte such booom
14:39 🔗 dashcloud has joined #archiveteam-bs
14:52 🔗 ohhdemgir has quit IRC (Read error: Operation timed out)
14:55 🔗 FAMAS has joined #archiveteam-bs
14:59 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
14:59 🔗 tjg has joined #archiveteam-bs
15:04 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
15:07 🔗 tjg has joined #archiveteam-bs
15:20 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
15:27 🔗 tjg has joined #archiveteam-bs
15:35 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
15:36 🔗 tjg has joined #archiveteam-bs
15:41 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
15:46 🔗 tjg has joined #archiveteam-bs
16:06 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
16:06 🔗 tjg has joined #archiveteam-bs
16:26 🔗 FAMAS has quit IRC (Quit: http://chat.efnet.org (EOF))
16:28 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
16:29 🔗 Stiletto has joined #archiveteam-bs
16:30 🔗 tjg has joined #archiveteam-bs
16:52 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
16:53 🔗 tjg has joined #archiveteam-bs
16:58 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
16:58 🔗 tjg has joined #archiveteam-bs
17:13 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
17:13 🔗 tjg has joined #archiveteam-bs
17:20 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
17:24 🔗 tjg has joined #archiveteam-bs
17:25 🔗 JesseW has joined #archiveteam-bs
17:29 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
17:32 🔗 SimpBrain has quit IRC (Leaving)
17:32 🔗 SimpBrain has joined #archiveteam-bs
17:33 🔗 tjg has joined #archiveteam-bs
17:50 🔗 JesseW https://openlibrary.org/works/OL3282859W/Music_the_brain_and_ecstasy <- is mistakenly linked to a treatise on moral philosophy from the 1800s. I've sent a note to the openlibrary folks, but thought it worth mentioning here, if only for the amusement value.
17:50 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
17:53 🔗 tjg has joined #archiveteam-bs
18:00 🔗 wyatt8740 has quit IRC (Read error: Operation timed out)
18:06 🔗 JesseW Is archive.is down for anyone else?
18:08 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
18:08 🔗 tjg has joined #archiveteam-bs
18:08 🔗 wyatt8740 has joined #archiveteam-bs
18:15 🔗 Microgur1 when I try to archive an entire user, I get a 410 error "$ youtube-dl --title --continue --retries 4 --write-info-json --write-description --write-thumbnail --write-annotations --all-subs --ignore-errors -f bestvideo+bestaudio https://www.youtube.com/user/TheAn1meMan/
18:15 🔗 Microgur1 [download] Downloading playlist: TheAn1meMan
18:15 🔗 Microgur1 [youtube:user] TheAn1meMan: Downloading video ids from 1 to 51
18:15 🔗 Microgur1 ERROR: Unable to download webpage: HTTP Error 410: Gone; please report this issue on https://yt-dl.org/bug . Be sure to call youtube-dl with the --verbose flag and include its complete output. Make sure you are using the latest version; type youtube-dl -U to update."
18:17 🔗 Microgur1 is this normal? I'm using version 2014.02.17
18:17 🔗 ivan` Microgur1: haha you can't use an ancient youtube-dl
18:17 🔗 antomatic type youtube-dl -U to update
18:20 🔗 Microgur1 there we go, now it's working
18:21 🔗 Microgur1 I downloaded the latest version from the github page, set the executable bit, then ran.
18:22 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
18:22 🔗 JesseW archive.is now back for me.
18:23 🔗 tjg has joined #archiveteam-bs
18:28 🔗 Microgur1 "As a side note, the administrator is unsupportive of Internet Archive's robots.txt policy - which could hinder future backup cooperation. " does this mean that archive.is isn't backed up fully?
18:28 🔗 Microgur1 source: http://archiveteam.org/index.php?title=Archive.is
18:29 🔗 ivan` I emailed the guy and he was unhappy with IA for not making all the wayback WARCs available
18:32 🔗 ivan` hm, the date field in archive.org cdx'es is for the response, not the request, right?
18:45 🔗 kyan I'm unhappy with some things IA does too, but that doesn't mean I don't absolutely adore that organization.
18:52 🔗 Microgur1 i've noticed that when downloading an entire youtube channel, the resulting videos have some weird properties. such as "90001 frames per second" and an audio type of audio/x-unknown which won't play for me. this error message was produced in the terminal window, and may be related: "WARNING: Your copy of avconv is outdated, update avconv to version 10-0 or newer if you encounter any errors.
18:52 🔗 Microgur1 "
18:54 🔗 tjg has quit IRC (Ping timeout: 260 seconds)
18:58 🔗 tjg has joined #archiveteam-bs
18:58 🔗 SimpBrain hey either khanacademy.org is annyoing or they are running out of funds. just received multiple emails within the last 2 weeks, telling me to donate
19:23 🔗 ivan` Microgur1: yeah, youtube-dl needs avconv or ffmpeg to mux the audio and video streams together
19:23 🔗 ivan` I use a recent ffmpeg
19:23 🔗 ivan` there's a ppa for ubuntu 14.04; it's in ubuntu 15.10 without a ppa
19:26 🔗 Microgur1 I guess this machine is no good for archiving; it's stuck in 2014. at least it can run the Warrior just fine (8085 urlteam units and counting since last reboot)
19:27 🔗 antomatic I wouldn't worry about the avconf error, Microgurl. It just downloads the file in a differnt way, so it doesn't affect the archiving.
19:27 🔗 ivan` I use ubuntu 14.04 for archiving just fine
19:27 🔗 Microgur1 I see.
19:27 🔗 ivan` antomatic: he just mentioned that his files are screwed up though
19:27 🔗 ivan` it's probably still using avconv
19:28 🔗 ivan` and if it's not using avconv/ffmpeg for muxing, you can't get the highest-resolution formats
19:28 🔗 Microgur1 yeah, totemcan't play the audio. maybe that's just because the audio it's downloafing is in a codec I con't play. I've had issues with that unrelated to youtube-dl before with newer, generally patented codecs
19:28 🔗 ivan` well, you can, as separate files, but who wants that
19:28 🔗 ivan` try mpv
19:29 🔗 ivan` YouTube has opus audio and VP9 video
19:29 🔗 ivan` along with H264 main profile and VP8 and vorbis and AAC
19:35 🔗 Microgur1 i'm trying it out without specifying formats this time
19:36 🔗 Microgur1 i'm getting H.264 for the video and MPEG-4 AAC for the audio. are those usuially the best youtubre gives?
19:36 🔗 ivan` sometimes there are higher-resolution formats available only in VP9
19:36 🔗 ivan` use youtube-dl -F to see a list of formats for a video
19:37 🔗 ivan` you can pass -f FORMAT+FORMAT to get that video+audio
19:37 🔗 ivan` some formats (e.g. 22) include both video and audio
19:37 🔗 Microgur1 passing -f bestvideo+bestaudio while using grandpa's old avconf was vcausing the problem
19:40 🔗 JesseW I value archive.is particularly *because* it is not-entirely-friendly with IA -- it serves as a (semi-) independent collection.
19:42 🔗 Microgur1 speaking of that, are there any IA clones?
19:42 🔗 JesseW I also sympathize with both the objection to IA keeping the wayback WARCs private, and IA's decision to do so. It's a balance between availability (which would lean towards making the WARCs open) and avoiding content authors objecting to inclusion (which they would be more likely to do if the WARCs were open).
19:43 🔗 JesseW There are other web scrape collections using the Wayback software (mainly by national libraries).
19:43 🔗 JesseW I'm not sure where a list is.
19:44 🔗 JesseW Direct IA clones -- not exactly, AFAIK.
20:13 🔗 JesseW has quit IRC (Leaving.)
20:31 🔗 wyatt8740 has quit IRC (Read error: Operation timed out)
20:33 🔗 yipdw I didn't know it was a decision; I thought it was still more of a "that isn't on our high-priority list"
20:44 🔗 aaaaaaaaa has joined #archiveteam-bs
20:44 🔗 swebb sets mode: +o aaaaaaaaa
21:02 🔗 Ravenloft has joined #archiveteam-bs
21:02 🔗 Ravenloft "we held back this title till 1 week after cinema premiere to give the movie a fighting chance to play in the budget, we learned from our mistake"
21:02 🔗 Ravenloft did you guys catched that?
21:09 🔗 Ravenloft its not everyday a pirate suddenly got a conscience
21:12 🔗 myself That's really interesting.
21:25 🔗 dashcloud has quit IRC (Read error: Operation timed out)
21:29 🔗 dashcloud has joined #archiveteam-bs
21:48 🔗 Sketchcow Very occasionally
21:49 🔗 Stiletto has quit IRC (Read error: Connection reset by peer)
22:13 🔗 SimpBrain mpaa must have every fbi agent on the case
22:13 🔗 SimpBrain sod terrorists, someone is losing hollywood money, after them!
22:50 🔗 BlueMaxim has joined #archiveteam-bs
22:59 🔗 Sketchcow An end of year gift for Archive Team from me.
22:59 🔗 Sketchcow (Don't social media/reddit/hackernews it)
22:59 🔗 Sketchcow https://archive.org/details/magazine_rack?sort=-publicdate
22:59 🔗 Sketchcow Across the next, oh, hour or two, hundreds of magazines will appear there. Stuff from the last few months.
23:00 🔗 Sketchcow Read up, expand some horizons, go into the new year happy.
23:13 🔗 yipdw whoa
23:17 🔗 Sketchcow It won't last
23:17 🔗 Sketchcow But let us enjoy a fleeting public cool thing.
23:26 🔗 Sketchcow Just broke 1000 queued up.
23:27 🔗 HCross our little newsbot has a big night ahead of it to say the least http://newsgrabber.harrycross.me:29000
23:34 🔗 kyan has quit IRC (Ping timeout: 258 seconds)
23:52 🔗 dashcloud those are rather current magazines- good job there Sketchcow

irclogger-viewer