Time |
Nickname |
Message |
00:48
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
02:13
🔗
|
|
bwn has joined #archiveteam-bs |
02:23
🔗
|
|
schbirid2 has joined #archiveteam-bs |
02:30
🔗
|
|
Pixi has quit IRC (Quit: Pixi) |
02:32
🔗
|
|
schbirid has quit IRC (Read error: Operation timed out) |
02:32
🔗
|
|
Pixi has joined #archiveteam-bs |
03:04
🔗
|
|
godane has quit IRC (Remote host closed the connection) |
03:05
🔗
|
|
godane has joined #archiveteam-bs |
04:02
🔗
|
|
Coderjo has quit IRC (Remote host closed the connection) |
04:41
🔗
|
|
M-WillBra is now known as WillBradl |
04:41
🔗
|
jacketcha |
so is batoto just going to die |
04:42
🔗
|
|
godane has quit IRC (Read error: Operation timed out) |
04:45
🔗
|
|
wbradley has joined #archiveteam-bs |
04:45
🔗
|
|
qw3rty16 has joined #archiveteam-bs |
04:46
🔗
|
|
wbradley is now known as zeeboots |
04:47
🔗
|
|
WillBradl is now known as WillBra4 |
04:47
🔗
|
|
WillBra4 is now known as zyph |
04:48
🔗
|
|
qw3rty15 has quit IRC (Read error: Operation timed out) |
04:48
🔗
|
|
zyph is now known as zyphlar |
04:49
🔗
|
|
zeeboots has left WeeChat 1.4 |
04:51
🔗
|
|
godane has joined #archiveteam-bs |
05:04
🔗
|
godane |
so i'm archivebox project maybe in alpha/stable stage |
05:06
🔗
|
godane |
i found out that the build-in wifi rpi3 would disconnect alot if wireless power management |
05:06
🔗
|
godane |
was on |
05:06
🔗
|
godane |
so i added 'wireless-power off' to /etc/network/interfaces |
05:07
🔗
|
godane |
it was working for about 15 minutes when i was loading tons of pages from kiwix |
05:07
🔗
|
godane |
vs like 5 or 10 pages before disconnecting with power management on |
05:13
🔗
|
|
Mateon1 has quit IRC (Read error: Connection reset by peer) |
05:13
🔗
|
|
Mateon1 has joined #archiveteam-bs |
05:15
🔗
|
|
icedice has joined #archiveteam-bs |
05:45
🔗
|
|
icedice has quit IRC (Read error: Connection reset by peer) |
05:50
🔗
|
|
octothorp has quit IRC (Remote host closed the connection) |
05:54
🔗
|
|
jdude104 has quit IRC (Leaving) |
05:55
🔗
|
|
jdude104 has joined #archiveteam-bs |
05:56
🔗
|
|
jdude104 has quit IRC (Client Quit) |
05:56
🔗
|
|
jdude104 has joined #archiveteam-bs |
05:57
🔗
|
|
icedice has joined #archiveteam-bs |
06:00
🔗
|
|
Kimmer has quit IRC (Leaving) |
06:28
🔗
|
|
Ravenloft has quit IRC (Read error: Connection reset by peer) |
06:41
🔗
|
|
jdude has joined #archiveteam-bs |
06:45
🔗
|
|
jdude104 has quit IRC (Read error: Operation timed out) |
06:57
🔗
|
|
icedice has quit IRC (Ping timeout: 245 seconds) |
07:09
🔗
|
|
jdude has quit IRC (Leaving) |
07:09
🔗
|
|
jdude104 has joined #archiveteam-bs |
07:12
🔗
|
|
jdude104 has quit IRC (Client Quit) |
08:17
🔗
|
|
octothorp has joined #archiveteam-bs |
09:15
🔗
|
|
Kimmer has joined #archiveteam-bs |
09:25
🔗
|
|
jschwart has joined #archiveteam-bs |
09:45
🔗
|
|
Coderjo has joined #archiveteam-bs |
11:19
🔗
|
|
BlueMaxim has quit IRC (Leaving) |
11:59
🔗
|
JAA |
jacketcha: Yes, #botato. |
12:02
🔗
|
|
Smiley has joined #archiveteam-bs |
12:05
🔗
|
|
SmileyG has quit IRC (Ping timeout: 260 seconds) |
13:52
🔗
|
|
REiN^ has quit IRC (Remote host closed the connection) |
14:53
🔗
|
odemg |
SketchCow, claim the $100 |
14:53
🔗
|
odemg |
https://twitter.com/_cryptome_/status/952168812505387008 |
14:53
🔗
|
odemg |
https://splinternews.com/rogue-archivists-are-creating-a-copy-of-gawker-com-so-t-1793861301 |
15:18
🔗
|
odemg |
godane, we're ripping pbs content, see https://i.imgur.com/qGRIO9R.png ... get in here https://discord.gg/RQpHMJP (did you already write something?) still, get in there <3 |
16:20
🔗
|
godane |
charlie rose uses a custom script just for charlierose.com |
16:21
🔗
|
godane |
*i uses a custom script |
16:22
🔗
|
|
K4k has quit IRC (Read error: Connection reset by peer) |
16:22
🔗
|
JAA |
godane: What are you grabbing exactly? I had to ignore the actual videos in the ArchiveBot job towards the end because my machine had a forced reboot due to the Meltdown bug. |
16:22
🔗
|
JAA |
I'm planning to resume that though. There are about 5400 videos left IIRC. |
16:23
🔗
|
godane |
right now i'm grabbing the 762 version of the videos |
16:23
🔗
|
godane |
i was downloading a month worth of videos and then upload them |
16:24
🔗
|
godane |
my panic grab of 762 version is just in case shit hits the fan |
16:24
🔗
|
JAA |
Ok, the URLs I ignored look like this: https://pfm1hycdn01-a.akamaihd.net/788/1HY788_003_xp.f4v |
16:24
🔗
|
godane |
cause it should be around 2.5 to 3.0tb |
16:24
🔗
|
JAA |
The ArchiveBot job grabbed some 6 TB and the remaining videos will be another 2-3 TB. |
16:25
🔗
|
godane |
those f4v files most of the time don't exist |
16:27
🔗
|
godane |
i'm also doing something crazy and making a mp3 collection from the charlie rose videos |
16:28
🔗
|
godane |
the mp3 collection will be offer some hoarders with low disk space to have some sort of archive of it |
16:29
🔗
|
godane |
btw other series i have to go after later is called 'The Open Mind' |
16:38
🔗
|
SketchCow |
odemg: I'm running something to pull out the gawker stuff. |
16:38
🔗
|
SketchCow |
I'm sure we used archivebot for it, not anything else, right |
16:38
🔗
|
odemg |
godane, ohh I know re crose stuff you sent me the script, just wondering about pbs |
16:38
🔗
|
odemg |
SketchCow, sound :D |
16:40
🔗
|
odemg |
SketchCow, you should likely tweet at them and let them know, get that money son! |
17:16
🔗
|
JAA |
godane: They do exist, but you can only access them if you set the correct referrer, otherwise you get the not found error. |
18:08
🔗
|
|
mnjgno has joined #archiveteam-bs |
18:09
🔗
|
mnjgno |
hello! I did this: http://bookmarklets.htmlbin.net/archiving.html Have any of you know more services? Obviously all of you use more advanced tools (warc, extensions) but for a casual browsing, bookmarklets are excellent, so if any of you know about more services...? :D |
18:12
🔗
|
SketchCow |
The page should be a little pretty, and should have a way to preview what's IN the bookmarket. |
18:15
🔗
|
Kaz |
Igloo: https://twitter.com/emilybatty/status/952241942963851266 |
18:15
🔗
|
Igloo |
holy |
18:16
🔗
|
Kaz |
assuming hoax, lots of people reporting it but i feel like there'd be some coverage |
18:17
🔗
|
Igloo |
Wow |
18:17
🔗
|
Igloo |
Pretty wide spread |
18:22
🔗
|
Kaz |
https://twitter.com/NutzFordBucks/status/952243050675281922 |
18:23
🔗
|
mnjgno |
@SketchCow, I am just gathering online archive services, so if you now more, :) obviously all can be improved. |
18:24
🔗
|
SketchCow |
That's fine |
18:24
🔗
|
SketchCow |
But I'm telling you "drag this bookmarklet to your bar" is the new "click on this awesome desktop toy.exe" |
18:24
🔗
|
SketchCow |
Document and make it easy to understand what these do |
18:34
🔗
|
mnjgno |
cool! I'll have in mind if I ever publish for more people. Although if doing that I should remove peep us then. thanks anyway :) |
18:40
🔗
|
godane |
JAA: whats the referer needed to get f4v file |
18:40
🔗
|
|
Uzerus has joined #archiveteam-bs |
18:40
🔗
|
Uzerus |
jacketcha: missle? where? |
18:43
🔗
|
Kaz |
BBC news dropping in with the *slowest* breaking news alert ever http://www.bbc.co.uk/news/world-us-canada-42677604 |
18:46
🔗
|
JAA |
godane: Something like https://charlierose.com/video/player/24740?autoplay=false (for the URL above) I think. I'm not sure how strictly they check. |
19:04
🔗
|
mnjgno |
https://www.buzzfeed.com/mbvd/false-alarm-ballistic-missile-threat-hawaii |
19:22
🔗
|
jacketcha |
Uzerus: Hawaii |
19:22
🔗
|
jacketcha |
but, false alarm I guess |
19:28
🔗
|
JAA |
godane: Apparently a referrer of https://charlierose.com/ is sufficient. |
19:43
🔗
|
godane |
tell me how to get this file: https://pfm1hycdn01-a.akamaihd.net/113/1HY113_007_lp.f4v |
19:43
🔗
|
godane |
i can't get it to download even with charlierose.com as referer |
19:45
🔗
|
|
Mateon1 has quit IRC (Read error: Operation timed out) |
19:46
🔗
|
|
Mateon1 has joined #archiveteam-bs |
19:47
🔗
|
JAA |
godane: Hmm, yeah, neither can I. The server returns status 200 but an empty body. |
19:47
🔗
|
JAA |
The ArchiveBot job got the same result: 2017-12-02 22:57:21,338 - wpull.processor.web - INFO - Fetched ‘https://pfm1hycdn01-a.akamaihd.net/113/1HY113_007_lp.f4v’: 200 OK. Length: 0 [video/x-flv]. |
19:47
🔗
|
JAA |
So I guess that file might be broken? |
19:48
🔗
|
godane |
that episode is the only lost one i can't get |
19:49
🔗
|
godane |
plus side is the 2 segments from that episode do exist |
20:01
🔗
|
jrwr |
Kaz: Igloo https://streamable.com/6fs0n |
20:01
🔗
|
jrwr |
what was broadcast to TV for the EAS Alert |
20:12
🔗
|
mnjgno |
by the way, any of you uses peeep.us to bypass robots.txt files? |
20:14
🔗
|
Kaz |
Huh |
20:14
🔗
|
Kaz |
No, we just ignore them |
20:16
🔗
|
mnjgno |
ah oki |
20:34
🔗
|
Igloo |
jrwr: holy cow that is hard to read |
21:11
🔗
|
|
REiN^ has joined #archiveteam-bs |
21:11
🔗
|
|
ranavalon has quit IRC (Quit: Leaving) |
21:52
🔗
|
|
Jusque has quit IRC (Quit: ZNC - http://znc.in) |
21:53
🔗
|
|
Jusque has joined #archiveteam-bs |
21:57
🔗
|
|
Jusque has quit IRC (Client Quit) |
21:58
🔗
|
|
Jusque has joined #archiveteam-bs |
23:38
🔗
|
|
odemg has quit IRC (Ping timeout: 260 seconds) |
23:42
🔗
|
|
mnjgno has quit IRC (Quit: Leaving) |
23:52
🔗
|
|
odemg has joined #archiveteam-bs |