Time |
Nickname |
Message |
00:04
🔗
|
|
ldac050 has quit IRC (Ping timeout: 260 seconds) |
00:08
🔗
|
|
Video has joined #archiveteam-bs |
00:25
🔗
|
|
Mayonaise has quit IRC (Read error: Operation timed out) |
00:30
🔗
|
|
Mayonaise has joined #archiveteam-bs |
00:43
🔗
|
|
OrIdow6 has joined #archiveteam-bs |
01:03
🔗
|
SootBectr |
Noted, thanks |
01:04
🔗
|
|
oxguy3 has joined #archiveteam-bs |
01:05
🔗
|
oxguy3 |
@sootbectr (continuing from #archiveteam) i'm just wgetting the rockstar manuals using my browser's rendered version of the html page as input |
01:06
🔗
|
SootBectr |
Oh is it that easy, I was going to grep out the .pdf links and make a list |
01:07
🔗
|
JAA |
They're not even all PDFs. GTA V for example has some .zip, .dmg, and other stuff. :-| |
01:07
🔗
|
oxguy3 |
yep, good ole `-F -i page.html` |
01:07
🔗
|
JAA |
Also, https://www.rockstargames.com/manuals/get-manuals.json |
01:07
🔗
|
oxguy3 |
oooooo |
01:07
🔗
|
SootBectr |
Ruinged all my commandline fun ;) |
01:08
🔗
|
oxguy3 |
i'll whip up a jq command then |
01:08
🔗
|
JAA |
Parse the JSON with grep for shell fun. :-) |
01:08
🔗
|
SootBectr |
I'll pass on that parsing |
01:09
🔗
|
JAA |
Probably just grep -Po '"href"\s*:\s*"\K[^"]+' in this case. |
01:12
🔗
|
oxguy3 |
i love jq https://pastebin.com/dQcv7JQw |
01:12
🔗
|
markedL |
ia_archiver doesn't apply to AB I assume? |
01:12
🔗
|
JAA |
Of course not. AB only uses robots.txt to discover additional content. |
01:12
🔗
|
arkiver |
:) |
01:12
🔗
|
oxguy3 |
hehehe |
01:13
🔗
|
arkiver |
can put the list in archivebot |
01:13
🔗
|
arkiver |
we should have some easy command for AB to extract any URLs from some initial page and queue those as a !ao < |
01:14
🔗
|
JAA |
Nope, we should have a mode of "grab this page and all links on it". |
01:14
🔗
|
arkiver |
yeah |
01:14
🔗
|
JAA |
I.e. limited recursion. Wouldn't even be very difficult to implement. |
01:15
🔗
|
JAA |
So maybe it'll happen in the 3-5 year timespan. |
01:15
🔗
|
|
Auctus has joined #archiveteam-bs |
01:15
🔗
|
arkiver |
haha yeah |
01:15
🔗
|
oxguy3 |
if anyone wants it, here was my jq command: `curl https://www.rockstargames.com/manuals/get-manuals.json | jq -r '.manuals[].manuals_platforms[].manuals_links[].href'` |
01:16
🔗
|
oxguy3 |
great little tool if you've never used it, highly recomend if you find yourself working with json with any frequency |
01:17
🔗
|
JAA |
Yeah, jq is pretty useful. Sometimes it takes me longer to write that structure down than to just use grep though. :-P |
01:18
🔗
|
JAA |
Running now. |
01:19
🔗
|
oxguy3 |
awesome, thanks |
01:19
🔗
|
JAA |
I kept only the actual files. |
01:22
🔗
|
|
Raccoon has joined #archiveteam-bs |
01:22
🔗
|
oxguy3 |
cool. ugh, wish i'd done this sooner. i bookmarked a bunch of game manuals pages a few years ago for an archive project i never actually did -- now half the links are dead :/ |
01:23
🔗
|
|
anarcat has joined #archiveteam-bs |
01:28
🔗
|
JAA |
Forgot to mention: I started an archival of Story Wars earlier (shutting down today). ETA for the actual stories is 3 hours, and then there'll be a bunch of less important things. |
01:33
🔗
|
Raccoon |
JAA: is there an updated list of the complete history of Archive Team projects and service shutdowns? |
01:34
🔗
|
* |
Raccoon back after dinner & |
01:35
🔗
|
JAA |
Complete? lol no. |
01:36
🔗
|
JAA |
The major ones are/should be on the wiki, but smaller stuff is just *maybe* announced in some channel. At other times, it just goes into ArchiveBot and that's it. |
01:37
🔗
|
JAA |
So yeah, a complete list does not and probably will never exist. |
01:37
🔗
|
|
britmob has quit IRC (Read error: Connection reset by peer) |
01:37
🔗
|
markedL |
we need an archivist, for us |
01:41
🔗
|
oxguy3 |
okay so. let's say i was aware of a 17TB folder on an enterprise box.com account, and that folder has contents of high public interest, much of which might not exist anywhere else. but, the owner probably didn't intend it to be public as the link was in a pretty obscure place and has been removed since i found it originally. so if i dropped it casually into a public IRC chat, that might spell the end of it being public. what uh do y'all think i |
01:41
🔗
|
oxguy3 |
should do? |
01:43
🔗
|
ivan |
that really depends on what is inside |
01:43
🔗
|
|
britmob has joined #archiveteam-bs |
01:44
🔗
|
oxguy3 |
it's stuff meant for members of the press, i should add. so it's meant to be sorta public at the very least |
01:44
🔗
|
ivan |
I guess you should download all of it and send it to wapo or something |
01:44
🔗
|
oxguy3 |
it includes a lot of raw video files (b-roll and stuff) and press guides and so forth |
01:44
🔗
|
oxguy3 |
washington post? i imagine most press orgs have this link |
01:45
🔗
|
|
lempamo has quit IRC (Read error: Connection reset by peer) |
01:45
🔗
|
oxguy3 |
there's nothing in here that the organization doesn't want the public to know; it's just that they probably didn't intend for the general public to have such easy access to these raw files |
01:46
🔗
|
oxguy3 |
i also don't have 17TB of storage space just sitting around lol. i have trawled it for a lot of the valuable stuff, but i haven't touched the video stuff cuz i dont have space |
01:47
🔗
|
SootBectr |
They can't care about it too much if they're counting on security by obscurity and giving the link out to lots of journalists |
01:48
🔗
|
oxguy3 |
yeah, the link literally used to be on a subsidiary company's media info page |
01:48
🔗
|
JAA |
Hypothetically, you should probably pass that link to someone trusted to mirror it quickly (i.e. before the owners notice and shut it down), and everything should stay private until the mirror is complete. |
01:49
🔗
|
oxguy3 |
yeah... maybe the folks at the-eye? |
01:49
🔗
|
JAA |
But it also depends on whether that folder still gets updated. |
01:49
🔗
|
oxguy3 |
it does |
01:50
🔗
|
JAA |
Then maybe "grab as fast as you can" is not the best solution since it would be better to preserve access. |
01:50
🔗
|
OrIdow6 |
Sounds more like this is "assumed to be of no interest to the general public" than "hidden from the general public" |
01:51
🔗
|
oxguy3 |
yeah i guess that's true, though there is definitely press-only information i've seen in here |
01:54
🔗
|
oxguy3 |
i guess im just looking for someone with the storage capacity to grab a copy of this lol |
01:54
🔗
|
hook54321 |
JAA: I doubt services like box.com notify people when stuff is downloadeded rapidly, so it'd probably be fine. |
01:55
🔗
|
|
mls_ has quit IRC (Ping timeout: 258 seconds) |
01:56
🔗
|
|
mls_ has joined #archiveteam-bs |
02:00
🔗
|
|
kiska has quit IRC (Remote host closed the connection) |
02:00
🔗
|
|
Flashfire has quit IRC (Remote host closed the connection) |
02:00
🔗
|
JAA |
hook54321: There is a traffic limit, so they probably would notice: https://community.box.com/t5/How-to-Guides-for-Account/Understand-How-Box-Measures-Bandwidth-Usage/ta-p/44 |
02:00
🔗
|
hook54321 |
ah |
02:01
🔗
|
|
kiska has joined #archiveteam-bs |
02:01
🔗
|
|
Flashfire has joined #archiveteam-bs |
02:02
🔗
|
JAA |
And in fact, that limit is 2 TB for normal business accounts (though probably higher for the Enterprise plan), so 17 TB would take the better part of a year to download. |
02:02
🔗
|
|
svchfoo3 sets mode: +o kiska |
02:02
🔗
|
|
svchfoo1 sets mode: +o kiska |
02:02
🔗
|
JAA |
Ah wait, 2 TB per month per user. |
02:02
🔗
|
oxguy3 |
it's possible that no individual user has uploaded 2TB -- there are a lot of users in this folder |
02:04
🔗
|
oxguy3 |
actually, it sounds like that limit might be shared across all users (i.e. if you have ten users, then the enterprise-wide limit is 20TB) |
02:04
🔗
|
JAA |
Yeah, probably. |
02:04
🔗
|
oxguy3 |
they have a lot of users, their limit is likely way above 17TB |
02:05
🔗
|
oxguy3 |
i know for certain they have at least 30 or so, and i think it's likely they have plenty more |
02:11
🔗
|
oxguy3 |
still, definitely don't want to risk hitting their limit (not only because we'd lose access but also cuz it sounds like it would screw them over) |
02:27
🔗
|
oxguy3 |
hmm, anyone know any tools that can download from a public box directory without authentication? rclone's box implementation requires a login. their JSON api seems easy enough so i'm already starting a python script, but i figured i'd ask... |
02:48
🔗
|
|
tech234a has quit IRC (Remote host closed the connection) |
02:48
🔗
|
|
LowLevelM has quit IRC (Remote host closed the connection) |
02:48
🔗
|
|
LowLevelM has joined #archiveteam-bs |
03:00
🔗
|
|
bsmith093 has quit IRC (Quit: Leaving.) |
03:00
🔗
|
|
tech234a has joined #archiveteam-bs |
03:07
🔗
|
|
tech234a3 has joined #archiveteam-bs |
03:09
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
03:10
🔗
|
|
tech234a has quit IRC (Ping timeout: 496 seconds) |
03:10
🔗
|
|
tech234a3 is now known as tech234a |
03:10
🔗
|
|
tech234a2 has joined #archiveteam-bs |
03:12
🔗
|
|
tech234a2 has quit IRC (Client Quit) |
03:15
🔗
|
|
tech234a7 has joined #archiveteam-bs |
03:17
🔗
|
|
tech234a7 has quit IRC (Client Quit) |
03:18
🔗
|
|
tech234a9 has joined #archiveteam-bs |
03:18
🔗
|
|
tech234a has quit IRC (Ping timeout: 496 seconds) |
03:18
🔗
|
|
tech234a9 is now known as tech234a |
03:19
🔗
|
|
tech234a0 has joined #archiveteam-bs |
03:21
🔗
|
|
BlueMax has quit IRC (Ping timeout: 745 seconds) |
03:21
🔗
|
|
tech234a0 has quit IRC (Client Quit) |
03:24
🔗
|
|
bsmith093 has joined #archiveteam-bs |
03:26
🔗
|
|
bsmith093 has quit IRC (Client Quit) |
03:27
🔗
|
|
tech234a has quit IRC (Ping timeout: 496 seconds) |
03:28
🔗
|
|
tech234a has joined #archiveteam-bs |
03:30
🔗
|
|
tech234a4 has joined #archiveteam-bs |
03:32
🔗
|
|
tech234a4 has quit IRC (Client Quit) |
03:33
🔗
|
|
tech234a1 has joined #archiveteam-bs |
03:35
🔗
|
|
tech234a1 has quit IRC (Client Quit) |
03:35
🔗
|
|
tech234a7 has joined #archiveteam-bs |
03:37
🔗
|
|
tech234a7 has quit IRC (Client Quit) |
03:38
🔗
|
|
tech234a has quit IRC (Ping timeout: 496 seconds) |
03:38
🔗
|
|
bsmith093 has joined #archiveteam-bs |
03:39
🔗
|
|
bsmith093 has quit IRC (Client Quit) |
03:40
🔗
|
|
tech234a4 has joined #archiveteam-bs |
03:40
🔗
|
|
tech234a4 has quit IRC (Client Quit) |
03:41
🔗
|
|
tech234a3 has joined #archiveteam-bs |
03:42
🔗
|
|
RichardG has quit IRC (Quit: Keyboard not found, press F1 to continue) |
03:42
🔗
|
|
cerca has quit IRC (Remote host closed the connection) |
03:43
🔗
|
|
tech234a3 has quit IRC (Client Quit) |
03:43
🔗
|
|
tech234a4 has joined #archiveteam-bs |
03:43
🔗
|
|
RichardG has joined #archiveteam-bs |
03:45
🔗
|
|
tech234a4 has quit IRC (Client Quit) |
03:46
🔗
|
|
tech234a2 has joined #archiveteam-bs |
03:46
🔗
|
|
tech234a2 is now known as tech234a |
03:47
🔗
|
|
tech234a6 has joined #archiveteam-bs |
03:49
🔗
|
|
tech234a6 has quit IRC (Client Quit) |
03:49
🔗
|
|
tech234a3 has joined #archiveteam-bs |
03:51
🔗
|
|
tech234a3 has quit IRC (Client Quit) |
03:52
🔗
|
|
m007a83_ has joined #archiveteam-bs |
03:52
🔗
|
|
m007a83_ has quit IRC (Connection closed) |
03:52
🔗
|
|
tech234a6 has joined #archiveteam-bs |
03:52
🔗
|
|
m007a83_ has joined #archiveteam-bs |
03:54
🔗
|
|
tech234a6 has quit IRC (Client Quit) |
03:54
🔗
|
|
m007a83 has quit IRC (Read error: Operation timed out) |
03:54
🔗
|
|
tech234a3 has joined #archiveteam-bs |
03:55
🔗
|
|
tech234a has quit IRC (Ping timeout: 496 seconds) |
03:55
🔗
|
|
tech234a3 is now known as tech234a |
03:55
🔗
|
|
tech234a6 has joined #archiveteam-bs |
03:57
🔗
|
|
tech234a6 has quit IRC (Client Quit) |
03:58
🔗
|
|
tech234a4 has joined #archiveteam-bs |
03:59
🔗
|
|
m007a83 has joined #archiveteam-bs |
03:59
🔗
|
|
m007a83_ has quit IRC (Read error: Connection reset by peer) |
04:00
🔗
|
|
tech234a4 has quit IRC (Client Quit) |
04:00
🔗
|
|
tech234a4 has joined #archiveteam-bs |
04:02
🔗
|
|
tech234a4 has quit IRC (Client Quit) |
04:03
🔗
|
|
tech234a8 has joined #archiveteam-bs |
04:03
🔗
|
|
tech234a has quit IRC (Ping timeout: 496 seconds) |
04:03
🔗
|
|
tech234a8 is now known as tech234a |
04:04
🔗
|
|
tech234a4 has joined #archiveteam-bs |
04:06
🔗
|
|
tech234a4 has quit IRC (Client Quit) |
04:06
🔗
|
|
tech234a7 has joined #archiveteam-bs |
04:08
🔗
|
|
tech234a7 has quit IRC (Client Quit) |
04:09
🔗
|
|
tech234a9 has joined #archiveteam-bs |
04:10
🔗
|
|
oxguy3 has quit IRC (Quit: My MacBook has gone to sleep. ZZZzzz…) |
04:11
🔗
|
|
tech234a9 has quit IRC (Client Quit) |
04:12
🔗
|
|
tech234a6 has joined #archiveteam-bs |
04:12
🔗
|
|
tech234a has quit IRC (Ping timeout: 496 seconds) |
04:12
🔗
|
|
tech234a6 is now known as tech234a |
04:12
🔗
|
|
tech234a3 has joined #archiveteam-bs |
04:14
🔗
|
|
tech234a3 has quit IRC (Client Quit) |
04:15
🔗
|
|
tech234a5 has joined #archiveteam-bs |
04:16
🔗
|
Flashfire |
Would I be able to appeal my ban to archivebot temporarily in order to archive some webspaces I have found hosted on an older domain |
04:16
🔗
|
|
DFJustin has quit IRC (Remote host closed the connection) |
04:17
🔗
|
|
tech234a5 has quit IRC (Client Quit) |
04:17
🔗
|
|
tech234a1 has joined #archiveteam-bs |
04:19
🔗
|
|
odemgi_ has joined #archiveteam-bs |
04:19
🔗
|
|
tech234a1 has quit IRC (Client Quit) |
04:20
🔗
|
|
Terbium has joined #archiveteam-bs |
04:20
🔗
|
|
tech234a4 has joined #archiveteam-bs |
04:20
🔗
|
|
tech234a has quit IRC (Ping timeout: 496 seconds) |
04:20
🔗
|
|
tech234a4 is now known as tech234a |
04:21
🔗
|
|
tech234a4 has joined #archiveteam-bs |
04:21
🔗
|
|
DFJustin has joined #archiveteam-bs |
04:23
🔗
|
|
tech234a4 has quit IRC (Client Quit) |
04:23
🔗
|
|
tech234a2 has joined #archiveteam-bs |
04:25
🔗
|
|
odemgi has quit IRC (Read error: Operation timed out) |
04:25
🔗
|
|
tech234a2 has quit IRC (Client Quit) |
04:26
🔗
|
|
tech234a8 has joined #archiveteam-bs |
04:28
🔗
|
|
tech234a8 has quit IRC (Client Quit) |
04:28
🔗
|
Somebody2 |
Flashfire: just post the links here. |
04:28
🔗
|
|
tech234a0 has joined #archiveteam-bs |
04:29
🔗
|
Somebody2 |
I'll glance at them and throw them in if they make sense to me to grab. |
04:29
🔗
|
|
tech234a has quit IRC (Ping timeout: 496 seconds) |
04:29
🔗
|
|
tech234a0 is now known as tech234a |
04:29
🔗
|
|
tech234a6 has joined #archiveteam-bs |
04:31
🔗
|
|
tech234a6 has quit IRC (Client Quit) |
04:32
🔗
|
|
tech234a4 has joined #archiveteam-bs |
04:34
🔗
|
|
tech234a4 has quit IRC (Client Quit) |
04:34
🔗
|
|
tech234a1 has joined #archiveteam-bs |
04:36
🔗
|
|
tech234a1 has quit IRC (Client Quit) |
04:37
🔗
|
|
tech234a8 has joined #archiveteam-bs |
04:37
🔗
|
|
tech234a has quit IRC (Ping timeout: 496 seconds) |
04:37
🔗
|
|
tech234a8 is now known as tech234a |
04:38
🔗
|
|
tech234a3 has joined #archiveteam-bs |
04:40
🔗
|
|
tech234a3 has quit IRC (Client Quit) |
04:40
🔗
|
|
tech234a5 has joined #archiveteam-bs |
04:41
🔗
|
Raccoon |
> <JAA> So yeah, a complete list does not and probably will never exist. |
04:42
🔗
|
Raccoon |
JAA: Reason I ask is because it'd be a really fancy number / site list to have for an article "The Internet Is Shutting Down." |
04:42
🔗
|
|
tech234a5 has quit IRC (Client Quit) |
04:43
🔗
|
|
tech234a2 has joined #archiveteam-bs |
04:45
🔗
|
|
tech234a2 has quit IRC (Client Quit) |
04:45
🔗
|
|
tech234a7 has joined #archiveteam-bs |
04:46
🔗
|
|
tech234a has quit IRC (Ping timeout: 496 seconds) |
04:46
🔗
|
|
tech234a7 is now known as tech234a |
04:46
🔗
|
|
tech234a1 has joined #archiveteam-bs |
04:49
🔗
|
|
tech234a1 has quit IRC (Client Quit) |
04:49
🔗
|
|
tech234a7 has joined #archiveteam-bs |
04:51
🔗
|
|
tech234a7 has quit IRC (Client Quit) |
04:52
🔗
|
|
tech234a4 has joined #archiveteam-bs |
04:54
🔗
|
|
tech234a4 has quit IRC (Client Quit) |
04:54
🔗
|
|
tech234a6 has joined #archiveteam-bs |
04:54
🔗
|
|
tech234a has quit IRC (Ping timeout: 496 seconds) |
04:54
🔗
|
|
tech234a6 is now known as tech234a |
04:55
🔗
|
|
tech234a1 has joined #archiveteam-bs |
04:56
🔗
|
|
oxguy3 has joined #archiveteam-bs |
04:57
🔗
|
|
tech234a1 has quit IRC (Client Quit) |
04:57
🔗
|
|
qw3rty2 has joined #archiveteam-bs |
04:57
🔗
|
|
tech234a5 has joined #archiveteam-bs |
04:59
🔗
|
|
tech234a5 has quit IRC (Client Quit) |
05:00
🔗
|
|
tech234a5 has joined #archiveteam-bs |
05:02
🔗
|
|
tech234a5 has quit IRC (Client Quit) |
05:02
🔗
|
|
tech234a9 has joined #archiveteam-bs |
05:03
🔗
|
|
tech234a has quit IRC (Ping timeout: 496 seconds) |
05:03
🔗
|
|
tech234a9 is now known as tech234a |
05:04
🔗
|
|
tech234a9 has joined #archiveteam-bs |
05:06
🔗
|
|
tech234a9 has quit IRC (Client Quit) |
05:06
🔗
|
|
qw3rty has quit IRC (Ping timeout: 745 seconds) |
05:06
🔗
|
|
tech234a4 has joined #archiveteam-bs |
05:08
🔗
|
|
tech234a4 has quit IRC (Client Quit) |
05:09
🔗
|
|
tech234a0 has joined #archiveteam-bs |
05:11
🔗
|
|
tech234a0 has quit IRC (Client Quit) |
05:11
🔗
|
|
tech234a2 has joined #archiveteam-bs |
05:12
🔗
|
|
tech234a has quit IRC (Ping timeout: 496 seconds) |
05:12
🔗
|
|
tech234a2 is now known as tech234a |
05:12
🔗
|
|
tech234a3 has joined #archiveteam-bs |
05:14
🔗
|
|
tech234a3 has quit IRC (Client Quit) |
05:15
🔗
|
|
tech234a9 has joined #archiveteam-bs |
05:17
🔗
|
|
tech234a9 has quit IRC (Client Quit) |
05:17
🔗
|
|
tech234a1 has joined #archiveteam-bs |
05:17
🔗
|
|
tech234a1 has quit IRC (Client Quit) |
05:19
🔗
|
|
Terbium has quit IRC (Leaving) |
05:20
🔗
|
|
tech234a has quit IRC (Ping timeout: 496 seconds) |
05:23
🔗
|
|
Terbium has joined #archiveteam-bs |
05:27
🔗
|
|
Nick-PC has joined #archiveteam-bs |
05:30
🔗
|
|
tech234a has joined #archiveteam-bs |
05:34
🔗
|
|
HP_Archiv has quit IRC (Read error: Operation timed out) |
06:04
🔗
|
|
legoktm has quit IRC (Read error: Operation timed out) |
06:06
🔗
|
|
britmob_ has joined #archiveteam-bs |
06:08
🔗
|
|
britmob has quit IRC (Ping timeout: 248 seconds) |
06:08
🔗
|
|
legoktm has joined #archiveteam-bs |
06:13
🔗
|
|
britmob_ has quit IRC (Read error: Operation timed out) |
06:15
🔗
|
|
britmob has joined #archiveteam-bs |
06:44
🔗
|
|
bsmith093 has joined #archiveteam-bs |
07:10
🔗
|
|
oxguy3 has quit IRC (Quit: My MacBook has gone to sleep. ZZZzzz…) |
07:13
🔗
|
|
oxguy3 has joined #archiveteam-bs |
07:20
🔗
|
|
anarcat has quit IRC (Read error: Operation timed out) |
07:22
🔗
|
|
anarcat has joined #archiveteam-bs |
07:22
🔗
|
|
anarcat has quit IRC (Handshake flooding) |
07:25
🔗
|
|
VADemon has quit IRC (Quit: left4dead) |
07:27
🔗
|
|
anarcat has joined #archiveteam-bs |
07:29
🔗
|
|
VADemon has joined #archiveteam-bs |
07:30
🔗
|
|
Damme has joined #archiveteam-bs |
07:55
🔗
|
|
anarcat has quit IRC (Read error: Connection reset by peer) |
07:55
🔗
|
|
anarcat has joined #archiveteam-bs |
08:23
🔗
|
|
Medowar has joined #archiveteam-bs |
08:23
🔗
|
|
step has quit IRC (Quit: ZNC 1.7.5 - https://znc.in) |
08:24
🔗
|
|
step has joined #archiveteam-bs |
08:34
🔗
|
|
yano has quit IRC (Read error: Connection reset by peer) |
08:35
🔗
|
|
yano has joined #archiveteam-bs |
08:39
🔗
|
|
step has quit IRC (Quit: ZNC 1.7.5 - https://znc.in) |
08:40
🔗
|
|
step has joined #archiveteam-bs |
09:09
🔗
|
|
nepeat has quit IRC (Quit: ZNC 1.7.5 - https://znc.in) |
09:11
🔗
|
JAA |
My Story Wars grab finished just now except for categories, story lists, and some other random stuff. That's running now. |
09:12
🔗
|
|
step_ has joined #archiveteam-bs |
09:12
🔗
|
|
step has quit IRC (Quit: ZNC 1.7.5 - https://znc.in) |
09:13
🔗
|
|
step_ is now known as step |
09:14
🔗
|
JAA |
Raccoon: Oh, it would certainly be useful, but well, the effort to document it all would be unreasonably large. And not every single site being archived is noteworthy either. The Deathwatch page is probably as close as it gets. |
09:15
🔗
|
Raccoon |
JAA: indeed. maybe some IRC logs grepping but I haven't been logging this channel for very long either. I'll refer to the wiki as you suggested |
09:16
🔗
|
JAA |
Even that won't get you very far because stuff gets mentioned in various channels or sometimes just silently thrown into AB. |
09:16
🔗
|
JAA |
Also, there are public logs of this channel and a few others, linked on the wiki. |
09:16
🔗
|
Raccoon |
ah, duh! |
09:17
🔗
|
Raccoon |
are sandbox user pages allowed on the wiki |
09:19
🔗
|
|
girst_ has joined #archiveteam-bs |
09:19
🔗
|
|
oxguy3 has quit IRC (Quit: My MacBook has gone to sleep. ZZZzzz…) |
09:20
🔗
|
|
oxguy3 has joined #archiveteam-bs |
09:21
🔗
|
|
girst has quit IRC (Read error: Operation timed out) |
09:21
🔗
|
|
girst_ is now known as girst |
09:26
🔗
|
Somebody2 |
JAA: I'd love to get the WARCs from the fandom ArchiveBot job when you get a momement. |
09:29
🔗
|
|
Mastazi has joined #archiveteam-bs |
09:30
🔗
|
JAA |
The Story Wars grab finished a couple minutes ago. Can't guarantee it will work in the WBM, but from what I've seen, it should. |
09:31
🔗
|
|
oxguy3 has quit IRC (Quit: My MacBook has gone to sleep. ZZZzzz…) |
09:37
🔗
|
|
oxguy3 has joined #archiveteam-bs |
09:49
🔗
|
|
Mastazi has quit IRC (Leaving) |
10:01
🔗
|
|
deevious has quit IRC (Ping timeout: 248 seconds) |
10:02
🔗
|
|
deevious has joined #archiveteam-bs |
10:28
🔗
|
|
oxguy3 has quit IRC (My MacBook has gone to sleep. ZZZzzz…) |
10:51
🔗
|
|
BlueMaxim has quit IRC (Read error: Connection reset by peer) |
10:57
🔗
|
|
ranma has quit IRC () |
11:30
🔗
|
|
ShellyRol has quit IRC (Read error: Connection reset by peer) |
11:33
🔗
|
|
ShellyRol has joined #archiveteam-bs |
12:05
🔗
|
|
Sora_Uta has joined #archiveteam-bs |
12:13
🔗
|
|
SoraUta has quit IRC (Ping timeout: 610 seconds) |
12:42
🔗
|
|
cerca has joined #archiveteam-bs |
13:01
🔗
|
|
killsushi has joined #archiveteam-bs |
13:07
🔗
|
|
tech234a has quit IRC (Remote host closed the connection) |
13:07
🔗
|
|
LowLevelM has quit IRC (Remote host closed the connection) |
13:07
🔗
|
|
LowLevelM has joined #archiveteam-bs |
13:50
🔗
|
|
oxguy3 has joined #archiveteam-bs |
14:23
🔗
|
|
oxguy3 has quit IRC (Quit: My MacBook has gone to sleep. ZZZzzz…) |
14:25
🔗
|
|
Sora_Uta has quit IRC (Ping timeout: 610 seconds) |
14:31
🔗
|
|
Video_ has joined #archiveteam-bs |
14:35
🔗
|
|
Video has quit IRC (Ping timeout: 255 seconds) |
14:45
🔗
|
|
deevious has quit IRC (Quit: deevious) |
15:18
🔗
|
|
sep332 has joined #archiveteam-bs |
15:30
🔗
|
|
DigiDigi has quit IRC (Quit: Leaving) |
15:37
🔗
|
|
Frogging has quit IRC (Close the World, Open the nExt) |
15:39
🔗
|
|
Frogging has joined #archiveteam-bs |
15:46
🔗
|
|
deevious has joined #archiveteam-bs |
16:15
🔗
|
|
DogsRNice has joined #archiveteam-bs |
16:17
🔗
|
|
DigiDigi has joined #archiveteam-bs |
16:29
🔗
|
|
tech234a has joined #archiveteam-bs |
16:29
🔗
|
|
tech234a3 has joined #archiveteam-bs |
16:30
🔗
|
|
tech234a5 has joined #archiveteam-bs |
16:32
🔗
|
|
tech234a5 has quit IRC (Client Quit) |
16:32
🔗
|
|
tech234a8 has joined #archiveteam-bs |
16:34
🔗
|
|
tech234a8 has quit IRC (Client Quit) |
16:41
🔗
|
|
tech234a has quit IRC (Ping timeout: 745 seconds) |
16:42
🔗
|
|
tech234a3 has quit IRC (Ping timeout: 745 seconds) |
16:57
🔗
|
|
DigiDigi has quit IRC (Remote host closed the connection) |
17:00
🔗
|
|
DigiDigi has joined #archiveteam-bs |
17:03
🔗
|
betamax |
Hi all! |
17:03
🔗
|
betamax |
The BBC are shutting down their "digital teletext" service, "Red Button Text", early next year. |
17:03
🔗
|
betamax |
I'm trying to archive as much as I can using computers with TV tuner cards, but - being in Scotland - I can't receive the main 'BBC One' channel, only 'BBC One Scotland'. |
17:03
🔗
|
betamax |
Anyone in the UK have a computer with a TV tuner card and want to help out? |
17:03
🔗
|
betamax |
I'm particularly keen to try and get the teletext coverage of tonight's general election coverage. |
17:03
🔗
|
betamax |
(yeah, I know, I've made sure there's loads of time to prepare /s) |
17:03
🔗
|
betamax |
Instructions on the wiki: https://www.archiveteam.org/index.php?title=Red_Button_Text |
17:04
🔗
|
|
X-Scale` has joined #archiveteam-bs |
17:06
🔗
|
jrwr |
betamax: got a IRC channel for it? |
17:06
🔗
|
markedL |
what happens to this signal when the channels go onto cableTV lines? |
17:06
🔗
|
jrwr |
Its a teletype service |
17:06
🔗
|
jrwr |
so its pretty much Modem over NTSC |
17:06
🔗
|
jrwr |
PAL* |
17:09
🔗
|
betamax |
no IRC channel, right now it's just me fiddling around :) |
17:10
🔗
|
jrwr |
I've asked SketchCow to tweet your project to get more exposure |
17:10
🔗
|
jrwr |
:) |
17:10
🔗
|
betamax |
thanks! |
17:11
🔗
|
betamax |
I'm going to get two more TV tuner cards setup this evening, but short of moving to England, there's no way I can get the regular BBC One |
17:12
🔗
|
|
X-Scale has quit IRC (Ping timeout: 610 seconds) |
17:12
🔗
|
|
X-Scale` is now known as X-Scale |
17:12
🔗
|
SootBectr |
Do you know if those set-top recorders that you can pull a .ts file from would include the data? |
17:13
🔗
|
SootBectr |
aka DVR boxes |
17:16
🔗
|
betamax |
no clue I'm afraid |
17:17
🔗
|
SootBectr |
In any case, I'm not sure I know anyone who still has one with a USB port to pull files off easily |
17:19
🔗
|
betamax |
I'm using Raspberry Pi's with the DVB Hat. Frustratingly, I've actually got more Pi's and DVB Hats than I do microSD cards right now |
17:20
🔗
|
betamax |
although I'm also limited by the number of TV aerial sockets in the house |
17:21
🔗
|
SootBectr |
Local poundshop might still have some powered splitters? |
17:21
🔗
|
jrwr |
betamax: https://twitter.com/textfiles/status/1205174593692061698 |
17:22
🔗
|
betamax |
I've got splitters running into splitters already :) |
17:22
🔗
|
markedL |
yeah you need a channel name otherwise people won't know where to go to converse |
17:24
🔗
|
betamax |
hmm, is #redbutton too boring? |
17:24
🔗
|
SootBectr |
#deadbutton perhaps |
17:24
🔗
|
betamax |
I like it! |
17:39
🔗
|
Frogging |
nginx devs have been detained in Russia. https://www.zdnet.com/article/russian-police-raid-nginx-moscow-office/ |
18:11
🔗
|
|
Medowar has quit IRC (Quit: Connection closed for inactivity) |
18:17
🔗
|
|
Nick-PC has quit IRC (Ping timeout: 248 seconds) |
18:21
🔗
|
|
oxguy3 has joined #archiveteam-bs |
18:47
🔗
|
Ryz |
Huh, something new's going on the The Game Awards 2019, 48-hr time limited demos in something called 'The Game Festival': https://medium.com/@geoffkeighley/introducing-the-game-festival-9d319cf7a579 |
18:48
🔗
|
Ryz |
...Hopefully those games will be grabbed for preservation purposes~ |
18:48
🔗
|
Ryz |
Well, video game demos |
19:05
🔗
|
|
schbirid has joined #archiveteam-bs |
19:20
🔗
|
SketchCow |
I'd like to ask someone to run youtube-dl for me |
19:20
🔗
|
jrwr |
Syre |
19:20
🔗
|
jrwr |
Whats up SketchCow |
19:20
🔗
|
SketchCow |
I know, I never ask this, but if someone could do it, so i can get this off the list. |
19:22
🔗
|
Frogging |
I can if the content is accessible from Canada |
19:24
🔗
|
Frogging |
actually never mind the Canada bit, I'll figure something out if there's a problem |
19:26
🔗
|
Frogging |
(geoblocking is on my mind because someone just sent me a YT link that I couldn't view -_-) |
19:27
🔗
|
SootBectr |
You could run Tor browser and then youtube-dl --proxy socks5://127.0.0.1:9150/ |
19:30
🔗
|
jrwr |
I sicked the YT Archive bots at it |
19:30
🔗
|
jrwr |
they are going to town on the channels right now |
19:30
🔗
|
Frogging |
ah, good |
19:32
🔗
|
Ryz |
jrwr, be persistent in running those continously |
19:32
🔗
|
jrwr |
Ya |
19:32
🔗
|
Ryz |
Like repeat it again and again until all of it is actually in |
19:33
🔗
|
Ryz |
Since archiving efforts of those individual channels may be cut early |
19:33
🔗
|
jrwr |
Im also mirroring them to my seedbox as well |
19:35
🔗
|
SootBectr |
What's the channel? |
19:37
🔗
|
|
tech234a has joined #archiveteam-bs |
19:48
🔗
|
|
X-Scale` has joined #archiveteam-bs |
19:52
🔗
|
|
X-Scale has quit IRC (Read error: Operation timed out) |
19:52
🔗
|
|
X-Scale` is now known as X-Scale |
20:57
🔗
|
|
d5f4a3622 has quit IRC (Read error: Connection reset by peer) |
20:59
🔗
|
|
d5f4a3622 has joined #archiveteam-bs |
21:05
🔗
|
|
LowLevelM has quit IRC (Quit: The Lounge - https://thelounge.chat) |
21:05
🔗
|
|
tech234a has quit IRC (Quit: The Lounge - https://thelounge.chat) |
21:07
🔗
|
|
LowLevelM has joined #archiveteam-bs |
21:12
🔗
|
oxguy3 |
ah, don't you just love it when the API lets you do things the frontend doesn't? currently downloading 8.5GB of data from a site that is supposed to have strict controls and logging for downloads, but none of it applies to me because i'm using special download links from the API ^_^ |
21:12
🔗
|
|
d5f4a3622 has quit IRC (Read error: Connection reset by peer) |
21:17
🔗
|
oxguy3 |
according to the activity feed for my account on their website, i have downloaded 14 files. but according to `watch "ls -1 | wc -l"`, i have downloaded 1335 files and counting. i was worried that ripped their entire site would get me banned, but they'll probably never even notice that a site rip occurred |
21:20
🔗
|
|
d5f4a3622 has joined #archiveteam-bs |
21:23
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
21:38
🔗
|
|
fuzzy8021 has quit IRC (Read error: Connection reset by peer) |
21:38
🔗
|
|
fuzzy8021 has joined #archiveteam-bs |
22:11
🔗
|
|
Wingy has joined #archiveteam-bs |
22:12
🔗
|
Wingy |
JAA: ComicBookDB works rarely. It must only have a few connections. |
22:13
🔗
|
JAA |
Or someone is very aggressively scraping or archiving them. |
22:20
🔗
|
Wingy |
Yeah |
22:40
🔗
|
|
BlueMax has joined #archiveteam-bs |
22:56
🔗
|
|
icedice has joined #archiveteam-bs |
23:03
🔗
|
Ryz |
Reminder! On 2020 January 01: The developerWorks Connections section of the IBM will no longer be available; it looks very very big D: - https://developer.ibm.com/code/dw-connections-sunset-faq/ |
23:08
🔗
|
OrIdow6 |
Lots of JS |
23:28
🔗
|
oxguy3 |
well that was a fun archival project. here is every branding asset the NCAA has produced in the last five years in original vector format: https://archive.org/details/ncaa-assets |
23:31
🔗
|
oxguy3 |
hmm, should i not have uploaded it as a tar.gz file? i notice i can't view the contents of it online (which i know it lets you do for zip files) |
23:43
🔗
|
oxguy3 |
ah, i've realized #archiveteam-ot is a thing. sorry for the off-topic spam |
23:56
🔗
|
|
Sora_Uta has joined #archiveteam-bs |