#archiveteam-bs 2018-01-13,Sat

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)

WhoWhatWhen
***BlueMaxim has joined #archiveteam-bs [00:48]
.................. (idle for 1h25mn)
bwn has joined #archiveteam-bs [02:13]
schbirid2 has joined #archiveteam-bs [02:23]
Pixi has quit IRC (Quit: Pixi)
schbirid has quit IRC (Read error: Operation timed out)
Pixi has joined #archiveteam-bs
[02:30]
....... (idle for 32mn)
godane has quit IRC (Remote host closed the connection)
godane has joined #archiveteam-bs
[03:04]
............ (idle for 57mn)
Coderjo has quit IRC (Remote host closed the connection) [04:02]
........ (idle for 39mn)
M-WillBra is now known as WillBradl [04:41]
jacketchaso is batoto just going to die [04:41]
***godane has quit IRC (Read error: Operation timed out)
wbradley has joined #archiveteam-bs
qw3rty16 has joined #archiveteam-bs
wbradley is now known as zeeboots
WillBradl is now known as WillBra4
WillBra4 is now known as zyph
qw3rty15 has quit IRC (Read error: Operation timed out)
zyph is now known as zyphlar
zeeboots has left WeeChat 1.4
godane has joined #archiveteam-bs
[04:42]
godaneso i'm archivebox project maybe in alpha/stable stage
i found out that the build-in wifi rpi3 would disconnect alot if wireless power management
was on
so i added 'wireless-power off' to /etc/network/interfaces
it was working for about 15 minutes when i was loading tons of pages from kiwix
vs like 5 or 10 pages before disconnecting with power management on
[05:04]
***Mateon1 has quit IRC (Read error: Connection reset by peer)
Mateon1 has joined #archiveteam-bs
icedice has joined #archiveteam-bs
[05:13]
....... (idle for 30mn)
icedice has quit IRC (Read error: Connection reset by peer) [05:45]
octothorp has quit IRC (Remote host closed the connection)
jdude104 has quit IRC (Leaving)
jdude104 has joined #archiveteam-bs
jdude104 has quit IRC (Client Quit)
jdude104 has joined #archiveteam-bs
icedice has joined #archiveteam-bs
Kimmer has quit IRC (Leaving)
[05:50]
...... (idle for 28mn)
Ravenloft has quit IRC (Read error: Connection reset by peer) [06:28]
jdude has joined #archiveteam-bs
jdude104 has quit IRC (Read error: Operation timed out)
[06:41]
icedice has quit IRC (Ping timeout: 245 seconds) [06:57]
jdude has quit IRC (Leaving)
jdude104 has joined #archiveteam-bs
jdude104 has quit IRC (Client Quit)
[07:09]
.............. (idle for 1h5mn)
octothorp has joined #archiveteam-bs [08:17]
............ (idle for 58mn)
Kimmer has joined #archiveteam-bs [09:15]
jschwart has joined #archiveteam-bs [09:25]
..... (idle for 20mn)
Coderjo has joined #archiveteam-bs [09:45]
................... (idle for 1h34mn)
BlueMaxim has quit IRC (Leaving) [11:19]
......... (idle for 40mn)
JAAjacketcha: Yes, #botato. [11:59]
***Smiley has joined #archiveteam-bs
SmileyG has quit IRC (Ping timeout: 260 seconds)
[12:02]
...................... (idle for 1h47mn)
REiN^ has quit IRC (Remote host closed the connection) [13:52]
............. (idle for 1h1mn)
odemgSketchCow, claim the $100
https://twitter.com/_cryptome_/status/952168812505387008
https://splinternews.com/rogue-archivists-are-creating-a-copy-of-gawker-com-so-t-1793861301
[14:53]
...... (idle for 25mn)
godane, we're ripping pbs content, see https://i.imgur.com/qGRIO9R.png ... get in here https://discord.gg/RQpHMJP (did you already write something?) still, get in there <3 [15:18]
............. (idle for 1h2mn)
godanecharlie rose uses a custom script just for charlierose.com
*i uses a custom script
[16:20]
***K4k has quit IRC (Read error: Connection reset by peer) [16:22]
JAAgodane: What are you grabbing exactly? I had to ignore the actual videos in the ArchiveBot job towards the end because my machine had a forced reboot due to the Meltdown bug.
I'm planning to resume that though. There are about 5400 videos left IIRC.
[16:22]
godaneright now i'm grabbing the 762 version of the videos
i was downloading a month worth of videos and then upload them
my panic grab of 762 version is just in case shit hits the fan
[16:23]
JAAOk, the URLs I ignored look like this: https://pfm1hycdn01-a.akamaihd.net/788/1HY788_003_xp.f4v [16:24]
godanecause it should be around 2.5 to 3.0tb [16:24]
JAAThe ArchiveBot job grabbed some 6 TB and the remaining videos will be another 2-3 TB. [16:24]
godanethose f4v files most of the time don't exist
i'm also doing something crazy and making a mp3 collection from the charlie rose videos
the mp3 collection will be offer some hoarders with low disk space to have some sort of archive of it
btw other series i have to go after later is called 'The Open Mind'
[16:25]
SketchCowodemg: I'm running something to pull out the gawker stuff.
I'm sure we used archivebot for it, not anything else, right
[16:38]
odemggodane, ohh I know re crose stuff you sent me the script, just wondering about pbs
SketchCow, sound :D
SketchCow, you should likely tweet at them and let them know, get that money son!
[16:38]
........ (idle for 36mn)
JAAgodane: They do exist, but you can only access them if you set the correct referrer, otherwise you get the not found error. [17:16]
........... (idle for 52mn)
***mnjgno has joined #archiveteam-bs [18:08]
mnjgnohello! I did this: http://bookmarklets.htmlbin.net/archiving.html Have any of you know more services? Obviously all of you use more advanced tools (warc, extensions) but for a casual browsing, bookmarklets are excellent, so if any of you know about more services...? :D [18:09]
SketchCowThe page should be a little pretty, and should have a way to preview what's IN the bookmarket. [18:12]
KazIgloo: https://twitter.com/emilybatty/status/952241942963851266 [18:15]
Iglooholy [18:15]
Kazassuming hoax, lots of people reporting it but i feel like there'd be some coverage [18:16]
IglooWow
Pretty wide spread
[18:17]
Kazhttps://twitter.com/NutzFordBucks/status/952243050675281922 [18:22]
mnjgno@SketchCow, I am just gathering online archive services, so if you now more, :) obviously all can be improved. [18:23]
SketchCowThat's fine
But I'm telling you "drag this bookmarklet to your bar" is the new "click on this awesome desktop toy.exe"
Document and make it easy to understand what these do
[18:24]
mnjgnocool! I'll have in mind if I ever publish for more people. Although if doing that I should remove peep us then. thanks anyway :) [18:34]
godaneJAA: whats the referer needed to get f4v file [18:40]
***Uzerus has joined #archiveteam-bs [18:40]
Uzerusjacketcha: missle? where? [18:40]
KazBBC news dropping in with the *slowest* breaking news alert ever http://www.bbc.co.uk/news/world-us-canada-42677604 [18:43]
JAAgodane: Something like https://charlierose.com/video/player/24740?autoplay=false (for the URL above) I think. I'm not sure how strictly they check. [18:46]
.... (idle for 18mn)
mnjgnohttps://www.buzzfeed.com/mbvd/false-alarm-ballistic-missile-threat-hawaii [19:04]
.... (idle for 18mn)
jacketchaUzerus: Hawaii
but, false alarm I guess
[19:22]
JAAgodane: Apparently a referrer of https://charlierose.com/ is sufficient. [19:28]
.... (idle for 15mn)
godanetell me how to get this file: https://pfm1hycdn01-a.akamaihd.net/113/1HY113_007_lp.f4v
i can't get it to download even with charlierose.com as referer
[19:43]
***Mateon1 has quit IRC (Read error: Operation timed out)
Mateon1 has joined #archiveteam-bs
[19:45]
JAAgodane: Hmm, yeah, neither can I. The server returns status 200 but an empty body.
The ArchiveBot job got the same result: 2017-12-02 22:57:21,338 - wpull.processor.web - INFO - Fetched ‘https://pfm1hycdn01-a.akamaihd.net/113/1HY113_007_lp.f4v’: 200 OK. Length: 0 [video/x-flv].
So I guess that file might be broken?
[19:47]
godanethat episode is the only lost one i can't get
plus side is the 2 segments from that episode do exist
[19:48]
jrwrKaz: Igloo https://streamable.com/6fs0n
what was broadcast to TV for the EAS Alert
[20:01]
mnjgnoby the way, any of you uses peeep.us to bypass robots.txt files? [20:12]
KazHuh
No, we just ignore them
[20:14]
mnjgnoah oki [20:16]
.... (idle for 18mn)
Igloojrwr: holy cow that is hard to read [20:34]
........ (idle for 37mn)
***REiN^ has joined #archiveteam-bs
ranavalon has quit IRC (Quit: Leaving)
[21:11]
......... (idle for 41mn)
Jusque has quit IRC (Quit: ZNC - http://znc.in)
Jusque has joined #archiveteam-bs
Jusque has quit IRC (Client Quit)
Jusque has joined #archiveteam-bs
[21:52]
..................... (idle for 1h40mn)
odemg has quit IRC (Ping timeout: 260 seconds)
mnjgno has quit IRC (Quit: Leaving)
[23:38]
odemg has joined #archiveteam-bs [23:52]

↑back Search ←Prev date Next date→ Show only urls(Click on time to select a line by its url)