Time |
Nickname |
Message |
00:07
🔗
|
godane |
so i couldn't get bluebird card to work with petreon without registering it |
00:07
🔗
|
godane |
and bluebird.com couldn't take my info after i filled everything out |
00:31
🔗
|
SketchCow |
https://archive.org/details/disneynews&tab=collection |
00:31
🔗
|
godane |
i noticed that awhile ago |
00:32
🔗
|
godane |
SketchCow: i'm setting up my patreon page so can get more vhs tapes to digitize |
00:36
🔗
|
|
Mateon1 has joined #archiveteam-bs |
00:36
🔗
|
|
Dimtree has quit IRC (Read error: Operation timed out) |
00:41
🔗
|
|
Dimtree has joined #archiveteam-bs |
00:44
🔗
|
godane |
SketchCow: https://www.patreon.com/godane |
00:45
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
00:54
🔗
|
|
antomatic has joined #archiveteam-bs |
00:54
🔗
|
|
swebb sets mode: +o antomatic |
00:55
🔗
|
|
Dimtree has quit IRC (Read error: Operation timed out) |
01:01
🔗
|
|
Dimtree has joined #archiveteam-bs |
01:42
🔗
|
|
ruunyan has joined #archiveteam-bs |
01:51
🔗
|
|
zyphlar has joined #archiveteam-bs |
03:35
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
03:38
🔗
|
|
dashcloud has joined #archiveteam-bs |
03:48
🔗
|
|
Dimtree has quit IRC (Ping timeout: 506 seconds) |
04:00
🔗
|
|
fie has quit IRC (Quit: Leaving) |
04:10
🔗
|
|
Dimtree has joined #archiveteam-bs |
04:35
🔗
|
|
Sk1d has quit IRC (Ping timeout: 194 seconds) |
04:41
🔗
|
|
Sk1d has joined #archiveteam-bs |
05:02
🔗
|
|
DFJustin has quit IRC (Remote host closed the connection) |
05:02
🔗
|
|
DFJustin has joined #archiveteam-bs |
05:02
🔗
|
|
swebb sets mode: +o DFJustin |
05:09
🔗
|
|
zyphlar has quit IRC (Quit: Connection closed for inactivity) |
05:29
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
05:33
🔗
|
|
dashcloud has joined #archiveteam-bs |
05:57
🔗
|
|
eprillios has quit IRC (Ping timeout: 506 seconds) |
06:00
🔗
|
|
eprillios has joined #archiveteam-bs |
06:22
🔗
|
|
schbirid has joined #archiveteam-bs |
06:28
🔗
|
|
ruunyan has quit IRC (Quit: meow) |
06:36
🔗
|
|
nyaomi has joined #archiveteam-bs |
06:39
🔗
|
|
zyphlar has joined #archiveteam-bs |
06:52
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
07:40
🔗
|
|
Dimtree has quit IRC (Read error: Operation timed out) |
07:44
🔗
|
|
Dimtree has joined #archiveteam-bs |
07:48
🔗
|
|
BlueMaxim has quit IRC (Ping timeout: 255 seconds) |
07:49
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
08:46
🔗
|
|
Dimtree has quit IRC (Peace) |
08:51
🔗
|
|
Dimtree has joined #archiveteam-bs |
09:02
🔗
|
|
Dimtree has quit IRC (Read error: Operation timed out) |
09:05
🔗
|
|
Dimtree has joined #archiveteam-bs |
09:29
🔗
|
|
zyphlar has quit IRC (Quit: Connection closed for inactivity) |
10:12
🔗
|
JAA |
My Dead Format scraper isn't even close to done yet, but it already discovered 10.9k users (out of 12.3k total according to the homepage). :-) |
11:01
🔗
|
|
pizzaiolo has joined #archiveteam-bs |
11:06
🔗
|
|
pizzaiolo has quit IRC (Client Quit) |
11:08
🔗
|
|
pizzaiolo has joined #archiveteam-bs |
11:13
🔗
|
|
pizzaiolo has quit IRC (Client Quit) |
11:24
🔗
|
|
BlueMaxim has quit IRC (Read error: Connection reset by peer) |
12:28
🔗
|
godane |
here are all the tapes on archive.org that i digitize so far: https://pastebin.com/SAzZth7J |
12:32
🔗
|
Aoede |
nice job! |
12:33
🔗
|
godane |
i have a patreon page to get money to buy tapes off ebay: https://www.patreon.com/godane |
12:42
🔗
|
|
Mateon1 has quit IRC (Ping timeout: 255 seconds) |
13:58
🔗
|
joepie91 |
JAA: make sure you doublecheck that it's actually getting all results :) |
13:58
🔗
|
joepie91 |
JAA: the scraper I wrote was for a search that allowed like 50 results max, so the variance in letter usage made quite an impact |
13:58
🔗
|
joepie91 |
if you can get more results out of your target, the adapting thing might indeed not be necessary |
14:00
🔗
|
|
Mateon1 has joined #archiveteam-bs |
14:37
🔗
|
|
Stilett0- has joined #archiveteam-bs |
14:55
🔗
|
|
RichardG has quit IRC (Ping timeout: 255 seconds) |
14:57
🔗
|
godane |
my twitter account: https://twitter.com/ArchiveGodane |
14:57
🔗
|
|
RichardG has joined #archiveteam-bs |
14:57
🔗
|
godane |
i put a twit out to help get my patreon campaign going |
15:01
🔗
|
godane |
i hope when SketchCow gets better he can retweet my campaign |
15:02
🔗
|
godane |
i really suck at social networking stuff anyways |
15:18
🔗
|
|
Fusl has quit IRC (Ping timeout: 250 seconds) |
15:26
🔗
|
JAA |
joepie91: The problem isn't that certain search terms don't work. I could just make 26 queries for a* through z* and handle the pagination. But that would be extremely slow because it takes the server a very long time to retrieve those records from the database. |
15:27
🔗
|
JAA |
Also, searches for bla*, blac*, and black* are almost equally slow. But searching for blacka*, blackb*, etc. obviously won't find records with the word "black". So I can't really go too deep either. |
15:36
🔗
|
|
Fletcher has quit IRC (Read error: Operation timed out) |
15:41
🔗
|
|
schbirid has joined #archiveteam-bs |
15:43
🔗
|
JAA |
I rewrote my scraper earlier today. It now uses aiohttp and multiple connections. In less than three hours, it has already surpassed the progress my other script has made since yesterday. |
15:44
🔗
|
schbirid |
and i keep getting cockblocked by wpull bugs :( |
15:44
🔗
|
|
Fletcher has joined #archiveteam-bs |
15:45
🔗
|
JAA |
Yeah, I'm pretty glad I didn't use wpull for this one. |
15:53
🔗
|
|
icedice has joined #archiveteam-bs |
16:01
🔗
|
|
Fletcher has quit IRC (Remote host closed the connection) |
16:02
🔗
|
|
cf has quit IRC (segfaulted) |
16:06
🔗
|
|
cf has joined #archiveteam-bs |
16:17
🔗
|
|
Jonison has joined #archiveteam-bs |
16:51
🔗
|
|
schbirid has quit IRC (Quit: Leaving) |
16:58
🔗
|
JAA |
Hmm, I guess I should've used multiple aiohttp sessions. |
17:07
🔗
|
|
icedice has quit IRC (Quit: Leaving) |
17:07
🔗
|
|
Stilett0- is now known as Stiletto |
17:16
🔗
|
|
brayden has quit IRC (Read error: Connection reset by peer) |
17:45
🔗
|
|
icedice has joined #archiveteam-bs |
20:03
🔗
|
|
icedice has quit IRC (Ping timeout: 260 seconds) |
21:02
🔗
|
|
Jonison has quit IRC (Read error: Connection reset by peer) |
22:23
🔗
|
mundus |
https://i.mundus.xyz/2ZEbM7.png |
22:23
🔗
|
|
dashcloud has quit IRC (Read error: Operation timed out) |
22:25
🔗
|
|
dashcloud has joined #archiveteam-bs |
22:28
🔗
|
|
Mateon1 has quit IRC (Read error: Operation timed out) |
22:30
🔗
|
|
Mateon1 has joined #archiveteam-bs |
22:32
🔗
|
Frogging |
mundus: lol |
22:32
🔗
|
mundus |
yep |
22:37
🔗
|
Frogging |
mundus: maybe put a robots.txt so google doesn't crawl it |
22:38
🔗
|
mundus |
good idea |
22:44
🔗
|
|
RichardG has quit IRC (Read error: Connection reset by peer) |
22:50
🔗
|
|
RichardG has joined #archiveteam-bs |
23:15
🔗
|
joepie91 |
the future is here: https://twitter.com/a_antonellis/status/912428669230043136 |
23:16
🔗
|
|
RichardG has quit IRC (Read error: Operation timed out) |
23:16
🔗
|
|
RichardG has joined #archiveteam-bs |
23:27
🔗
|
astrid |
that's heckin rad |
23:38
🔗
|
|
icedice has joined #archiveteam-bs |
23:43
🔗
|
|
Mateon1 has quit IRC (Ping timeout: 245 seconds) |
23:43
🔗
|
|
Mateon1 has joined #archiveteam-bs |
23:49
🔗
|
|
BlueMaxim has joined #archiveteam-bs |
23:55
🔗
|
|
Asparagir has joined #archiveteam-bs |