[00:27] weak spambot [00:29] Quick though, invited me to #JAAisanigger17 within seconds after the first kickban. [00:29] Well, probably that's scripted as well [00:31] plenty of spamming at #internetarchive right now [00:35] *** Pixi has quit IRC (Ping timeout: 255 seconds) [00:38] *** Pixi has joined #archiveteam-bs [00:38] t2t2: Yeah, we should try to get someone to spread the ops there. [00:39] JAA: The person running that youtube show apparently isn't the person spamming in the IRC channels. [00:40] Weird [00:40] "Hey dude, can you promote my show on EFNet please?" [00:41] nah, I went onto the stream's live chat and he answered with something about some retard doing it [00:42] Ah [00:42] And then someone said "If you're wondering why you're getting spammed this is why : https://pastebin.com/5YD0uLsS" [00:43] I don't have time to try to understand what's going on in that cancerous discussion right now though [00:45] oh, it's a botnet. https://pastebin.com/GvqPpgMn [00:54] so i got a new vcr and i can capture from it [00:54] the only bad news is i its not keeping track of time on tape [01:17] so the cbs wizard of oz tape doesn't get picture on this vcr either [01:17] its a 1988 cbs airing of wizard of oz [01:53] *** ld1 has quit IRC (Quit: ld1) [01:57] *** ld1 has joined #archiveteam-bs [02:19] hah, i'm getting the weirdest leechers on [MY-RAW] Belle and Sebastian (名犬ジョリィ) TV 01-52 (DVD 640x480 AVC_AAC) [02:20] dual audio: japanese and chinese, no subs [02:20] just got a leecher from Iran [02:22] so i'm grabbing Motor Sports back issues [02:22] gotten Oman, Russia, Kuwait, Saudi Arabia, UAE [02:22] https://www.motorsportmagazine.com/archive/issues/1925 [02:22] https://media.motorsportmagazine.com/archive/january-1925/full/2.jpg [02:26] you're not running into "You have now viewed 0/10 free articles" etc? [02:26] or is that only on articles so new? [02:27] oh, i guess it's hard to do that with images [02:27] there are not 10 free articles with images [02:27] its all free [02:28] nice [02:36] Not really difficult to implement that, but certainly more work than with HTML pages. [02:37] *** C4K3_ is now known as C4K3 [03:18] *** bwn has quit IRC (Read error: Connection reset by peer) [03:27] *** bwn has joined #archiveteam-bs [03:58] *** bwn has quit IRC (Quit) [04:15] *** bwn has joined #archiveteam-bs [04:30] *** ndiddy_ has quit IRC () [04:54] *** dashcloud has quit IRC (Read error: Operation timed out) [04:57] *** dashcloud has joined #archiveteam-bs [04:57] *** Valentine has quit IRC (Read error: Connection reset by peer) [05:00] *** qw3rty114 has joined #archiveteam-bs [05:02] *** Valentine has joined #archiveteam-bs [05:05] *** qw3rty113 has quit IRC (Read error: Operation timed out) [05:07] *** kimmer1 has quit IRC (Remote host closed the connection) [05:07] *** kimmer1 has joined #archiveteam-bs [05:17] *** prb has quit IRC () [05:21] *** kimmer12 has joined #archiveteam-bs [05:28] *** kimmer1 has quit IRC (Read error: Operation timed out) [05:53] *** kimmer12 has quit IRC (Remote host closed the connection) [05:53] *** kimmer1 has joined #archiveteam-bs [05:55] *** Rai-chan has joined #archiveteam-bs [05:55] *** purplebot has joined #archiveteam-bs [05:58] *** i0npulse has joined #archiveteam-bs [06:17] *** Asparagir has joined #archiveteam-bs [06:27] *** Pixi has quit IRC (Quit: Pixi) [06:33] *** Asparagir has quit IRC (Asparagir) [06:52] *** robink has quit IRC (Ping timeout: 506 seconds) [07:10] *** prb has joined #archiveteam-bs [07:14] *** prb has quit IRC (Remote host closed the connection!) [07:15] *** prb has joined #archiveteam-bs [07:29] *** dashcloud has quit IRC (Read error: Connection reset by peer) [07:30] *** dashcloud has joined #archiveteam-bs [07:44] *** schbirid has joined #archiveteam-bs [08:24] *** K4k_ has quit IRC (Read error: Connection reset by peer) [08:46] *** ZexaronS has joined #archiveteam-bs [10:21] *** BlueMaxim has quit IRC (Quit: Leaving) [12:12] *** pizzaiolo has joined #archiveteam-bs [12:28] *** schbirid has quit IRC (Ping timeout: 255 seconds) [12:40] *** schbirid has joined #archiveteam-bs [14:30] i'm temped to get this: https://www.ebay.com/itm/LOT-OF-40-POLAROID-PRE-RECORDED-VHS-TAPES-SOLD-AS-BLANKS-80S-90S/202151046847 [14:30] but the only thing i see i want is the 4 tapes with beyond 2000 on them [14:38] SketchCow: when am i getting the label to print to mail back your boxes? [15:15] *** ez has quit IRC (Read error: Operation timed out) [15:15] *** kimmer12 has joined #archiveteam-bs [15:18] *** ez has joined #archiveteam-bs [15:20] I will have the team mail you one. [15:20] 10lb, right [15:20] *** K4k has joined #archiveteam-bs [15:21] *** kimmer1 has quit IRC (Read error: Operation timed out) [15:24] yes [15:24] but just know i have 3 boxes [15:25] also think its weird to mail a shipping lable [15:25] *label [15:30] *** Pixi has joined #archiveteam-bs [15:49] *** dashcloud has quit IRC (Read error: Operation timed out) [15:49] *** dashcloud has joined #archiveteam-bs [15:49] *** Stilett0 has quit IRC (Read error: Operation timed out) [16:14] *** Stilett0 has joined #archiveteam-bs [16:35] *** dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.) [16:37] *** dashcloud has joined #archiveteam-bs [17:03] *** kimmer12 has quit IRC (Ping timeout: 633 seconds) [17:12] *** Mateon1 has quit IRC (Ping timeout: 255 seconds) [17:12] *** Mateon1 has joined #archiveteam-bs [17:46] *** kimmer1 has joined #archiveteam-bs [17:53] *** Asparagir has joined #archiveteam-bs [18:01] *** Pixi has quit IRC (Quit: Pixi) [18:02] *** Pixi has joined #archiveteam-bs [18:04] *** jschwart has joined #archiveteam-bs [18:38] *** dashcloud has quit IRC (Read error: Connection reset by peer) [18:38] *** dashcloud has joined #archiveteam-bs [18:50] *** ola_norsk has joined #archiveteam-bs [18:57] *** pizzaiolo has quit IRC (Remote host closed the connection) [19:09] *** icedice has joined #archiveteam-bs [19:10] *** Pixi has quit IRC (Quit: Pixi) [19:10] *** Pixi has joined #archiveteam-bs [19:11] *** svchfoo1 has quit IRC (Remote host closed the connection) [19:13] Is there a particular reason why we don't have an auto-opper bot in #archiveteam (arguably the most important of our channels)? [19:27] *** svchfoo1 has joined #archiveteam-bs [19:28] *** svchfoo3 sets mode: +o svchfoo1 [19:29] *** svchfoo3 has quit IRC (Remote host closed the connection) [19:36] *** svchfoo3 has joined #archiveteam-bs [19:36] *** svchfoo1 sets mode: +o svchfoo3 [19:39] man the 'We will always have this youtube channel' is kind of kicking me in the nuts .. https://youtu.be/t5_dSCu_mLY [19:42] for a 7-10 old youtube channel, what could be a very estimated size at 720p videos? [19:43] depends how many videos there are [19:43] a 10 year old channel could have very many videos, very few, or anywhere in between :p [19:43] And the average video length [19:43] that too [19:46] ~620 [19:47] *** dashcloud has quit IRC (Read error: Operation timed out) [19:47] i don't know how exact youtube search is https://www.youtube.com/results?search_query=theangrygrandpashow [19:48] eh, who knows. it's sometimes very inexact and unpredictable in my experience [19:48] (i was kind of expecting it to be more, but i see there's subchannels like 'granpa's corner') [19:48] aye [19:48] the sorting order changes the number and content of results [19:48] it's ridiculous [19:51] *** dashcloud has joined #archiveteam-bs [19:51] one tip courtesy of google search say to hover over username for a couple of seconds, but that seems to be no longer a feature on YT [19:52] You should be able to find out by finding the channel's "all videos" playlist [19:53] Select "Play All" under the "Videos" tab of the channel page [19:53] *** icedice has quit IRC (Read error: Connection reset by peer) [19:53] *** icedice has joined #archiveteam-bs [19:54] oh good tip [19:54] ah ty! (https://www.youtube.com/user/TheAngryGrandpaShow/videos?view=57&flow=list?&ab_channel=TheAngryGrandpaShow) ..i guessing youtube-dl could count them with 'dryrun' [19:55] *** icedice2 has joined #archiveteam-bs [19:56] *** dd0a13f37 has joined #archiveteam-bs [19:59] *** icedice has quit IRC (Ping timeout: 248 seconds) [19:59] seems to really be "just" 621.. [20:00] for that specific channel [20:01] e.g one youtube user can have multiple channels, right? [20:10] or rather, one google+ user can [20:12] that 'all videos' playlist tip will do though [20:18] It's damn worrysome that people seems to think Youtube (and google for that matter) is guaranteed to stick around forever [20:29] *** Smiley has quit IRC (Remote host closed the connection) [20:29] *** Smiley has joined #archiveteam-bs [20:29] *** dashcloud has quit IRC (Read error: Operation timed out) [20:33] *** dashcloud has joined #archiveteam-bs [20:42] I HAVE A CHANNEL CALLED #godane-archivebox on irc.efnet.org [20:42] LETS BUILD SOFTWARE TO MAKE A CACHE/ARCHIVE BOX FOR HOME USERS [20:45] a NAS ? [20:45] sounds exiting [20:46] not a nas but a router with storage [20:46] so a kind of 'preseeded proxy' then? [20:46] yes [20:46] good stuff [20:46] i guess like the idea of a squid server to save on bandwidth [20:47] aye, not too bad an idea [20:54] *** BlueMaxim has joined #archiveteam-bs [21:05] *** icedice2 has quit IRC (Ping timeout: 633 seconds) [21:05] *** Asparagir has quit IRC (Asparagir) [21:40] so looks like i did 96k items this year so far [21:40] last year i did 449,998 items [22:12] who's behind archive.is, and more importantly, why does it block waybackbackmachine? [22:13] it's pissing me off [22:13] is it just cloudflare's fault? [22:15] They don't block WM, they block all bots. [22:16] And they have a quite complex robots.txt. [22:16] I think that individual saves should actually be allowed. [22:16] But WM's robots.txt parsing is quite crude AFAIK. [22:16] *** pizzaiolo has joined #archiveteam-bs [22:17] aye, before i got the cloudflare error, i was getting robots.txt error when trying to archive an item [22:17] im pretty sure of that [22:17] Yeah [22:18] Some months ago, I noticed that if a site has Disallow: * + Allow: /something/, i.e. allowing robots only in a specific part of the site, WM doesn't handle that correctly and blocks everything. [22:18] Not sure if that's still the case, but something similar might be what caused that. [22:19] And CloudFlare's just annoying. But we're getting closer to circumventing that automatically. My Python implementation of joepie91's cracker seems to work correctly. [22:20] (I won't release it until I have proper test coverage though.) [22:21] anyway to (ab)use it's seemingly adherance to 'memento api' ? [22:21] "archive.is supports MementoWeb API. More info can be found here" (http://mementoweb.org/depot/native/archiveis/) [22:22] ...whatever that is [22:24] *** mls_ has joined #archiveteam-bs [22:26] doesn't look much like icelandic to me https://who.is/whois/archive.is [22:28] unless it's shared outwards, it's basically a glorified url shortener with content [22:32] *** jschwart has quit IRC (Quit: Konversation terminated!) [22:32] *** dd0a13f37 has quit IRC (Quit: Connection closed for inactivity) [22:35] 'archive.is is "your personal Wayback Machine".' [22:35] trouble is, more and more people seem to use it [22:38] and if it's not going to any serious wayback machine(s), it could all go away by cancellation of hosting or domain [22:41] i think IA has memento api as well. Any way to check whether or not archive.is is keepin all the goodies? [22:43] *** mls_ has quit IRC (Quit: Page closed) [22:48] Anyway, maybe they do share. I don't know. "personal waybackmachines" sounds akin to saving warcs on a usb stick though [22:53] *** Atom has quit IRC (Read error: Operation timed out) [23:21] *** sep332 has quit IRC (Read error: Operation timed out) [23:24] *** Atom has joined #archiveteam-bs [23:26] *** ndiddy has joined #archiveteam-bs [23:43] *** atlogbot has quit IRC (Read error: Operation timed out) [23:43] *** swebb has quit IRC (Ping timeout: 246 seconds) [23:56] *** swebb has joined #archiveteam-bs [23:56] *** svchfoo1 sets mode: +o swebb [23:58] *** atlogbot has joined #archiveteam-bs