Time |
Nickname |
Message |
00:14
🔗
|
|
BlueMax has joined #archiveteam-bs |
00:21
🔗
|
|
Asparag-1 has joined #archiveteam-bs |
00:22
🔗
|
|
Asparagir has quit IRC (Read error: Operation timed out) |
00:41
🔗
|
|
Asparag-1 has quit IRC (Asparag-1) |
01:44
🔗
|
|
RichardG_ has joined #archiveteam-bs |
01:44
🔗
|
|
ld1 has quit IRC (Quit: ld1) |
01:44
🔗
|
|
RichardG has quit IRC (Ping timeout: 250 seconds) |
01:50
🔗
|
|
ld1 has joined #archiveteam-bs |
01:55
🔗
|
|
vitzli has joined #archiveteam-bs |
02:24
🔗
|
rbraun |
riking: apparently using gateways may also require team admin approval and unlike apps it's off by default |
02:31
🔗
|
|
vitzli has quit IRC (Leaving) |
02:37
🔗
|
Stilett0- |
you guys have any tools for backing up youtube channels? |
02:37
🔗
|
|
Stilett0- is now known as Stiletto |
02:37
🔗
|
Stiletto |
that's better |
02:42
🔗
|
bithippo |
Backup locally? |
02:50
🔗
|
|
RichardG_ has quit IRC (Read error: Connection reset by peer) |
02:51
🔗
|
|
RichardG has joined #archiveteam-bs |
02:52
🔗
|
Stiletto |
yeah |
02:57
🔗
|
bithippo |
https://github.com/rg3/youtube-dl |
03:10
🔗
|
hook54321 |
bithippo: Would a script that's like tubeup except it waits until a video goes down before uploading it be possible? |
03:11
🔗
|
bithippo |
:thinking: |
03:11
🔗
|
bithippo |
So you'd download the video and all metadata locally, and only upon further download attempts in the future (that failed due to the video being deleted) would it be uploaded to IA? |
03:17
🔗
|
hook54321 |
bithippo: Well, not necessarily try to download it again, but like, check if the video has been taken down from youtube, and then if it has been taken down upload it. |
03:20
🔗
|
|
Ravenloft has quit IRC (Read error: Connection reset by peer) |
03:26
🔗
|
|
godane has quit IRC (Ping timeout: 250 seconds) |
03:29
🔗
|
bithippo |
@hook54321 Sorry, I should've been more specific. |
03:29
🔗
|
bithippo |
I'd have to see what sort of API call you'd back to Youtube to detect video availability. |
03:30
🔗
|
bithippo |
https://stackoverflow.com/a/32503070 |
03:31
🔗
|
bithippo |
Without the Youtube API, looks like the way to do it is attempt to GET the video thumbnail, which will 404 if the video is no longer available. |
03:31
🔗
|
Stiletto |
I didn't know youtube-dl could do a whole channel at once |
03:31
🔗
|
bithippo |
youtube-dl is powerful |
03:32
🔗
|
Stiletto |
that powerful? |
03:33
🔗
|
bithippo |
Can grab entire channels, playlists, and supports a plethora of video sites. |
03:33
🔗
|
bithippo |
Rarely do I run into a piece of content I can't extract with it. |
03:34
🔗
|
bithippo |
Can also dump channel metadata out as JSON. |
03:34
🔗
|
bithippo |
I'd argue it's becoming a first class tool similar to wget and curl. |
03:35
🔗
|
bithippo |
(honorable mention: httpie) |
03:43
🔗
|
|
godane has joined #archiveteam-bs |
04:00
🔗
|
|
Mayonaise has quit IRC (Read error: Connection reset by peer) |
04:01
🔗
|
|
godane has quit IRC (Quit: Leaving.) |
04:09
🔗
|
|
qw3rty116 has joined #archiveteam-bs |
04:11
🔗
|
|
Mayonaise has joined #archiveteam-bs |
04:13
🔗
|
|
qw3rty115 has quit IRC (Read error: Operation timed out) |
04:35
🔗
|
|
Mateon1 has quit IRC (Read error: Operation timed out) |
04:35
🔗
|
|
Mateon1 has joined #archiveteam-bs |
05:07
🔗
|
|
godane has joined #archiveteam-bs |
05:57
🔗
|
|
godane has quit IRC (Ping timeout: 633 seconds) |
06:02
🔗
|
|
odemg has quit IRC (Read error: Operation timed out) |
06:06
🔗
|
|
fie has quit IRC (Ping timeout: 600 seconds) |
06:12
🔗
|
|
odemg has joined #archiveteam-bs |
06:55
🔗
|
|
bithippo has quit IRC (My MacBook Air has gone to sleep. ZZZzzz…) |
07:18
🔗
|
|
Aoede has quit IRC (Ping timeout: 250 seconds) |
07:18
🔗
|
|
Aoede has joined #archiveteam-bs |
07:18
🔗
|
|
Rai-chan has quit IRC (Ping timeout: 250 seconds) |
08:33
🔗
|
|
schbirid has joined #archiveteam-bs |
09:16
🔗
|
|
Stiletto has quit IRC (Ping timeout: 250 seconds) |
09:53
🔗
|
|
BlueMax has quit IRC (Read error: Connection reset by peer) |
10:37
🔗
|
|
Rai-chan has joined #archiveteam-bs |
12:06
🔗
|
|
rsznick has joined #archiveteam-bs |
12:09
🔗
|
|
rsznikk has joined #archiveteam-bs |
12:12
🔗
|
|
rsznik has quit IRC (Read error: Operation timed out) |
12:13
🔗
|
|
rsznick has quit IRC (Read error: Operation timed out) |
13:49
🔗
|
JAA |
PurpleSym: FYI, I finally got around to taking a look at the Instagram script. Doesn't work anymore, unfortunately. |
13:49
🔗
|
PurpleSym |
Meh, too bad :( |
13:50
🔗
|
JAA |
I found a very easy method to do the scraping though. Just request the profile page with an __a=1 parameter, gives you a JSON. To get later pages, you can use the max_id parameter set to the last post that was retrieved already. |
13:51
🔗
|
JAA |
E.g. https://www.instagram.com/elonmusk/?__a=1 -> https://www.instagram.com/elonmusk/?__a=1&max_id=1709068240325503498 |
13:56
🔗
|
JAA |
I'll implement this now, rather than trying to fiddle around with GraphQL. There's an annoying query_hash parameter in those requests, and I didn't see what it's supposed to be. Obfuscated JS hell... |
14:33
🔗
|
Jon |
for mediawiki backups, I'm going to refresh the dumps I did a year ago. should I update the existing archicve.org entries with the new dumps, or create new ones? |
14:35
🔗
|
|
jtn2 has quit IRC (Read error: Operation timed out) |
14:38
🔗
|
|
jtn2 has joined #archiveteam-bs |
14:40
🔗
|
phuzion |
Does anyone know offhand the model of CD drive that Jason uses to mass-rip a stack of CDs? I've tried searching his twitter and ascii.textfiles.com but I can't seem to find it. |
14:43
🔗
|
Laverne |
looks like it's a http://www.acronova.com/product/auto-blu-ray-duplicator-publisher-ripper-nimbie-usb-nb21/9/review.html |
14:48
🔗
|
phuzion |
Thanks a bunch. |
15:11
🔗
|
|
Igloo has quit IRC (Remote host closed the connection) |
15:15
🔗
|
|
odemg has quit IRC (Read error: Operation timed out) |
15:52
🔗
|
|
bithippo has joined #archiveteam-bs |
15:54
🔗
|
|
bithippo has quit IRC (Client Quit) |
15:56
🔗
|
|
bithippo has joined #archiveteam-bs |
16:02
🔗
|
|
chfoo has quit IRC (LoveChatot) |
16:02
🔗
|
|
svchfoo1 has quit IRC (Remote host closed the connection) |
16:11
🔗
|
|
chfoo has joined #archiveteam-bs |
16:11
🔗
|
JAA |
Great... Last December, Facebook "accidentally" broke the Graph API such that you can no longer discover all posts through it. At least that's what I gather from https://github.com/minimaxir/facebook-page-post-scraper (can't read the bug report itself since it requires a login). |
16:53
🔗
|
Frogging |
seems to be a recurring theme in social media APIs |
16:53
🔗
|
Frogging |
oops we changed something and now there's no way to get complete data |
16:54
🔗
|
Frogging |
please call the complaints department at 1800-dev-null |
16:58
🔗
|
|
Jonimus has quit IRC (WeeChat 1.4) |
17:03
🔗
|
JAA |
That, and also "Sorry, you'll have to go through the totally separate and unrelated company X now, which will happily sell you the data you're after." |
17:06
🔗
|
JAA |
Interestingly, there's no such thing for Reddit yet as far as I know. |
17:07
🔗
|
JAA |
Even though they crippled the search UI last year and are in the process of removing timestamp-based searches through the API (which was the only way to get around the 1000 threads limit for a while now)... |
17:21
🔗
|
bithippo |
The walled garden is having existential anxiety. |
17:42
🔗
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
17:44
🔗
|
|
RichardG has joined #archiveteam-bs |
17:56
🔗
|
|
godane has joined #archiveteam-bs |
18:19
🔗
|
|
BnARobin_ has quit IRC (Read error: Operation timed out) |
18:32
🔗
|
|
BnARobin has joined #archiveteam-bs |
18:34
🔗
|
|
odemg has joined #archiveteam-bs |
18:38
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
18:38
🔗
|
|
BnARobin has joined #archiveteam-bs |
18:44
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
18:44
🔗
|
|
BnARobin has joined #archiveteam-bs |
18:52
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
18:52
🔗
|
|
BnARobin has joined #archiveteam-bs |
18:58
🔗
|
|
kisspunch has quit IRC (Quit: ZNC - http://znc.in) |
19:02
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
19:03
🔗
|
|
BnARobin has joined #archiveteam-bs |
19:05
🔗
|
|
kisspunch has joined #archiveteam-bs |
19:09
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
19:09
🔗
|
|
BnARobin has joined #archiveteam-bs |
19:18
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
19:18
🔗
|
|
BnARobin has joined #archiveteam-bs |
19:24
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
19:24
🔗
|
|
BnARobin has joined #archiveteam-bs |
19:28
🔗
|
|
jschwart has joined #archiveteam-bs |
19:31
🔗
|
godane |
so i lost power last night for maybe 3 hours |
19:32
🔗
|
godane |
i had to go out of the snow storm to help my brother dig out the generator |
19:39
🔗
|
godane |
anyways i'm at 20k items now |
19:40
🔗
|
godane |
i only 8 days into march and i have about 1/3 of feb |
19:40
🔗
|
godane |
i was at 60k items in 2018-02 |
19:41
🔗
|
godane |
i just need another 40k items to make number 2 in my grab collections |
19:42
🔗
|
odemg |
Does archive.org have anything on 'http://files.filefront.com/godlike_132rar/;4965816;;/fileinfo.html' maybe I'm searching wrong. |
19:46
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
19:47
🔗
|
|
BnARobin has joined #archiveteam-bs |
19:52
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
19:53
🔗
|
|
BnARobin has joined #archiveteam-bs |
19:58
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
19:59
🔗
|
|
BnARobin has joined #archiveteam-bs |
20:04
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
20:04
🔗
|
|
BnARobin has joined #archiveteam-bs |
20:11
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
20:11
🔗
|
|
BnARobin has joined #archiveteam-bs |
20:28
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
20:29
🔗
|
|
BnARobin has joined #archiveteam-bs |
20:36
🔗
|
|
icedice has joined #archiveteam-bs |
20:53
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
20:53
🔗
|
|
BnARobin has joined #archiveteam-bs |
20:59
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
20:59
🔗
|
|
BnARobin has joined #archiveteam-bs |
21:05
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
21:05
🔗
|
|
BnARobin has joined #archiveteam-bs |
21:11
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
21:11
🔗
|
|
BnARobin has joined #archiveteam-bs |
21:20
🔗
|
|
BnARobin has quit IRC (Remote host closed the connection) |
21:20
🔗
|
|
BnARobin has joined #archiveteam-bs |
21:23
🔗
|
|
JAA sets mode: +b *!*BnARobin@*.bnaboyz.nl |
21:23
🔗
|
|
BnARobin was kicked by JAA (Please fix your connection.) |
22:02
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
22:57
🔗
|
|
jschwart has quit IRC (Quit: Konversation terminated!) |
23:07
🔗
|
|
Igloo has joined #archiveteam-bs |
23:07
🔗
|
|
Igloo has quit IRC (Client Quit) |
23:29
🔗
|
|
RichardG_ has joined #archiveteam-bs |
23:29
🔗
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |