[00:07] so i couldn't get bluebird card to work with petreon without registering it [00:07] and bluebird.com couldn't take my info after i filled everything out [00:31] https://archive.org/details/disneynews&tab=collection [00:31] i noticed that awhile ago [00:32] SketchCow: i'm setting up my patreon page so can get more vhs tapes to digitize [00:36] *** Mateon1 has joined #archiveteam-bs [00:36] *** Dimtree has quit IRC (Read error: Operation timed out) [00:41] *** Dimtree has joined #archiveteam-bs [00:44] SketchCow: https://www.patreon.com/godane [00:45] *** BlueMaxim has joined #archiveteam-bs [00:54] *** antomatic has joined #archiveteam-bs [00:54] *** swebb sets mode: +o antomatic [00:55] *** Dimtree has quit IRC (Read error: Operation timed out) [01:01] *** Dimtree has joined #archiveteam-bs [01:42] *** ruunyan has joined #archiveteam-bs [01:51] *** zyphlar has joined #archiveteam-bs [03:35] *** dashcloud has quit IRC (Read error: Operation timed out) [03:38] *** dashcloud has joined #archiveteam-bs [03:48] *** Dimtree has quit IRC (Ping timeout: 506 seconds) [04:00] *** fie has quit IRC (Quit: Leaving) [04:10] *** Dimtree has joined #archiveteam-bs [04:35] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [04:41] *** Sk1d has joined #archiveteam-bs [05:02] *** DFJustin has quit IRC (Remote host closed the connection) [05:02] *** DFJustin has joined #archiveteam-bs [05:02] *** swebb sets mode: +o DFJustin [05:09] *** zyphlar has quit IRC (Quit: Connection closed for inactivity) [05:29] *** dashcloud has quit IRC (Read error: Operation timed out) [05:33] *** dashcloud has joined #archiveteam-bs [05:57] *** eprillios has quit IRC (Ping timeout: 506 seconds) [06:00] *** eprillios has joined #archiveteam-bs [06:22] *** schbirid has joined #archiveteam-bs [06:28] *** ruunyan has quit IRC (Quit: meow) [06:36] *** nyaomi has joined #archiveteam-bs [06:39] *** zyphlar has joined #archiveteam-bs [06:52] *** schbirid has quit IRC (Quit: Leaving) [07:40] *** Dimtree has quit IRC (Read error: Operation timed out) [07:44] *** Dimtree has joined #archiveteam-bs [07:48] *** BlueMaxim has quit IRC (Ping timeout: 255 seconds) [07:49] *** BlueMaxim has joined #archiveteam-bs [08:46] *** Dimtree has quit IRC (Peace) [08:51] *** Dimtree has joined #archiveteam-bs [09:02] *** Dimtree has quit IRC (Read error: Operation timed out) [09:05] *** Dimtree has joined #archiveteam-bs [09:29] *** zyphlar has quit IRC (Quit: Connection closed for inactivity) [10:12] My Dead Format scraper isn't even close to done yet, but it already discovered 10.9k users (out of 12.3k total according to the homepage). :-) [11:01] *** pizzaiolo has joined #archiveteam-bs [11:06] *** pizzaiolo has quit IRC (Client Quit) [11:08] *** pizzaiolo has joined #archiveteam-bs [11:13] *** pizzaiolo has quit IRC (Client Quit) [11:24] *** BlueMaxim has quit IRC (Read error: Connection reset by peer) [12:28] here are all the tapes on archive.org that i digitize so far: https://pastebin.com/SAzZth7J [12:32] nice job! [12:33] i have a patreon page to get money to buy tapes off ebay: https://www.patreon.com/godane [12:42] *** Mateon1 has quit IRC (Ping timeout: 255 seconds) [13:58] JAA: make sure you doublecheck that it's actually getting all results :) [13:58] JAA: the scraper I wrote was for a search that allowed like 50 results max, so the variance in letter usage made quite an impact [13:58] if you can get more results out of your target, the adapting thing might indeed not be necessary [14:00] *** Mateon1 has joined #archiveteam-bs [14:37] *** Stilett0- has joined #archiveteam-bs [14:55] *** RichardG has quit IRC (Ping timeout: 255 seconds) [14:57] my twitter account: https://twitter.com/ArchiveGodane [14:57] *** RichardG has joined #archiveteam-bs [14:57] i put a twit out to help get my patreon campaign going [15:01] i hope when SketchCow gets better he can retweet my campaign [15:02] i really suck at social networking stuff anyways [15:18] *** Fusl has quit IRC (Ping timeout: 250 seconds) [15:26] joepie91: The problem isn't that certain search terms don't work. I could just make 26 queries for a* through z* and handle the pagination. But that would be extremely slow because it takes the server a very long time to retrieve those records from the database. [15:27] Also, searches for bla*, blac*, and black* are almost equally slow. But searching for blacka*, blackb*, etc. obviously won't find records with the word "black". So I can't really go too deep either. [15:36] *** Fletcher has quit IRC (Read error: Operation timed out) [15:41] *** schbirid has joined #archiveteam-bs [15:43] I rewrote my scraper earlier today. It now uses aiohttp and multiple connections. In less than three hours, it has already surpassed the progress my other script has made since yesterday. [15:44] and i keep getting cockblocked by wpull bugs :( [15:44] *** Fletcher has joined #archiveteam-bs [15:45] Yeah, I'm pretty glad I didn't use wpull for this one. [15:53] *** icedice has joined #archiveteam-bs [16:01] *** Fletcher has quit IRC (Remote host closed the connection) [16:02] *** cf has quit IRC (segfaulted) [16:06] *** cf has joined #archiveteam-bs [16:17] *** Jonison has joined #archiveteam-bs [16:51] *** schbirid has quit IRC (Quit: Leaving) [16:58] Hmm, I guess I should've used multiple aiohttp sessions. [17:07] *** icedice has quit IRC (Quit: Leaving) [17:07] *** Stilett0- is now known as Stiletto [17:16] *** brayden has quit IRC (Read error: Connection reset by peer) [17:45] *** icedice has joined #archiveteam-bs [20:03] *** icedice has quit IRC (Ping timeout: 260 seconds) [21:02] *** Jonison has quit IRC (Read error: Connection reset by peer) [22:23] https://i.mundus.xyz/2ZEbM7.png [22:23] *** dashcloud has quit IRC (Read error: Operation timed out) [22:25] *** dashcloud has joined #archiveteam-bs [22:28] *** Mateon1 has quit IRC (Read error: Operation timed out) [22:30] *** Mateon1 has joined #archiveteam-bs [22:32] mundus: lol [22:32] yep [22:37] mundus: maybe put a robots.txt so google doesn't crawl it [22:38] good idea [22:44] *** RichardG has quit IRC (Read error: Connection reset by peer) [22:50] *** RichardG has joined #archiveteam-bs [23:15] the future is here: https://twitter.com/a_antonellis/status/912428669230043136 [23:16] *** RichardG has quit IRC (Read error: Operation timed out) [23:16] *** RichardG has joined #archiveteam-bs [23:27] that's heckin rad [23:38] *** icedice has joined #archiveteam-bs [23:43] *** Mateon1 has quit IRC (Ping timeout: 245 seconds) [23:43] *** Mateon1 has joined #archiveteam-bs [23:49] *** BlueMaxim has joined #archiveteam-bs [23:55] *** Asparagir has joined #archiveteam-bs