[00:47] *** af10b3e5e has quit IRC (Quit: https://i.imgur.com/xacQ09F.mp4) [01:03] *** d5f4a3622 has joined #archiveteam-bs [01:21] *** thuban3 has joined #archiveteam-bs [01:24] *** thuban2 has quit IRC (Read error: Operation timed out) [01:33] *** qw3rty_ has quit IRC (Read error: Connection reset by peer) [01:34] *** qw3rty_ has joined #archiveteam-bs [01:38] *** HP_Archiv has quit IRC (Ping timeout: 276 seconds) [02:05] *** HP_Archiv has joined #archiveteam-bs [02:35] is there any active/ongoing projects to archive flash games? [02:38] *** godane has joined #archiveteam-bs [02:39] my latest scans : https://www.patreon.com/posts/digitize-for-01-33454656 [03:07] wow looks like the wayback machine has some of Apple's IPSWs... trying to actually get one gives me gateway timeout though [03:07] I guess for those file sizes they need to send a gnome to get it off cold storage [03:07] *** kiska has quit IRC (Remote host closed the connection) [03:07] *** Flashfire has quit IRC (Remote host closed the connection) [03:08] *** kiska has joined #archiveteam-bs [03:08] *** Flashfire has joined #archiveteam-bs [03:08] *** svchfoo1 sets mode: +o kiska [03:08] *** svchfoo3 sets mode: +o kiska [03:17] *** shrines has joined #archiveteam-bs [03:28] ok Apple's CDN stopped giving me files [03:28] my remaining URLs all give 403 now [03:46] *** shrines has quit IRC (Quit: Page closed) [04:13] *** DogsRNice has quit IRC (Read error: Connection reset by peer) [04:27] *** qw3rty__ has joined #archiveteam-bs [04:31] *** odemgi_ has joined #archiveteam-bs [04:31] *** qw3rty_ has quit IRC (Ping timeout: 276 seconds) [04:34] *** odemgi has quit IRC (Ping timeout: 276 seconds) [04:47] *** duh has quit IRC (Quit: http://quassel-irc.org - Chat comfortably. Anywhere.) [04:48] *** legoktm has joined #archiveteam-bs [04:57] *** cerca has quit IRC (Remote host closed the connection) [05:40] Maybe someone can help - Wim Kayzer is a Dutch documentarian, interviewer and writer. How do I capture this video/interview into AB? https://www.npostart.nl/boeken/16-03-2014/VPWON_1210470 [05:41] There is not much video content of him outside of his famous 'A Glorious Accident' and 'Of Beauty and Consolation' from VPRO in the late 90's, https://youtu.be/RVrnn7QW6Jg [05:43] Any ideas or suggestions are welcome [05:49] !o https://mubi.com/films/a-glorious-accident [05:55] *** af10b3e5e has joined #archiveteam-bs [05:55] *** d5f4a3622 has quit IRC (Read error: Connection reset by peer) [06:06] HP_Archiv: you want to be in #archivebot [06:06] and what does !o stand for? [06:07] Yeah, I though that was the syntax argument for the channel. What's the function again for submitting links into AB? [06:08] And noted [06:08] Hp_Archiv: That's a video in a custom player, which Firefox tells me needs the W3C DRM [06:08] I had to Noscript whitelist several domains to get it to play [06:08] Can't imagine it would do too well in AB or the WBM [06:09] Hm. What's the best way to get the video into AB then? There are add that play before the interview starts, and I also tried the FF router. Video is not easily accessible like you said [06:10] Could get the video URL, download and convert and then put on Archive.org ? [06:10] ads* that play [06:10] if it has DRM it would be hard to do something useful once you download it [06:11] So that implies content like this is at-risk of not being captured? [06:11] that's why DRM is the devil [06:12] ^^ [06:12] Well, VPRO does have several YT channels. I suppose there's a chance they uploaded this interview there, maybe. [06:13] Hard enough searching through Dutch-only sites to find references of him, etc. [06:16] Here's the higher level part of the site, which outlinks to the video page, https://www.uitzendinggemist.net/aflevering/259482/Vpro_Boeken.html# [06:16] No chance of capturing any of this content? [06:26] *** britmob has quit IRC (Read error: Connection reset by peer) [06:28] *** britmob has joined #archiveteam-bs [07:00] *** HP_Archiv has quit IRC (Ping timeout: 276 seconds) [07:04] *** HP_Archiv has joined #archiveteam-bs [07:10] JAA, how are we handling vampirefreaks going away? I got a login but it's all fucking js so I don't know how to handle the crawl... [07:43] *** ShellyRol has quit IRC (Read error: Connection reset by peer) [07:45] *** ShellyRol has joined #archiveteam-bs [07:45] *** qw3rty_ has joined #archiveteam-bs [07:46] *** qw3rty has joined #archiveteam-bs [07:47] *** Flashfire has quit IRC (Read error: Connection reset by peer) [07:47] *** Dallas has quit IRC (Read error: Connection reset by peer) [07:47] *** marked1 has quit IRC (Read error: Connection reset by peer) [07:47] *** benjinsmi has joined #archiveteam-bs [07:48] *** qw3rty__ has quit IRC (Ping timeout: 276 seconds) [07:48] *** OrIdow6 has quit IRC (Ping timeout: 276 seconds) [07:48] *** atphoenix has quit IRC (Ping timeout: 276 seconds) [07:48] *** benjins has quit IRC (Ping timeout: 276 seconds) [07:49] *** atphoenix has joined #archiveteam-bs [07:50] *** qw3rty_ has quit IRC (Ping timeout: 276 seconds) [07:50] *** marked1 has joined #archiveteam-bs [07:51] vampirefreaks is in #lastbyte (on hackint) [07:52] *** marked10 has joined #archiveteam-bs [07:59] *** marked1 has quit IRC (Ping timeout: 492 seconds) [08:00] *** marked101 has joined #archiveteam-bs [08:07] *** marked10 has quit IRC (Ping timeout: 492 seconds) [08:07] *** marked101 is now known as marked10 [08:08] *** marked108 has joined #archiveteam-bs [08:09] *** marked108 has quit IRC (Client Quit) [08:10] *** marked104 has joined #archiveteam-bs [08:11] *** marked104 has quit IRC (Client Quit) [08:12] *** marked105 has joined #archiveteam-bs [08:13] *** marked105 has quit IRC (Client Quit) [08:14] *** marked103 has joined #archiveteam-bs [08:15] *** marked103 has quit IRC (Client Quit) [08:16] *** marked10 has quit IRC (Ping timeout: 492 seconds) [08:30] *** HP_Archiv has quit IRC (Read error: Operation timed out) [08:33] *** OrIdow6 has joined #archiveteam-bs [09:03] *** OrIdow6 has quit IRC (Ping timeout: 276 seconds) [09:35] *** HP_Archiv has joined #archiveteam-bs [09:35] *** OrIdow6 has joined #archiveteam-bs [10:41] *** arktek has quit IRC (Ping timeout: 622 seconds) [10:58] *** arktek has joined #archiveteam-bs [11:09] *** OrIdow6 has quit IRC (Ping timeout: 276 seconds) [11:29] *** BlueMax has quit IRC (Read error: Connection reset by peer) [11:35] *** HP_Archiv has quit IRC (Read error: Operation timed out) [11:41] *** OrIdow6 has joined #archiveteam-bs [12:28] *** LowLevelM has joined #archiveteam-bs [12:48] *** HP_Archiv has joined #archiveteam-bs [12:49] *** X-Scale` has joined #archiveteam-bs [12:56] *** X-Scale has quit IRC (Ping timeout: 610 seconds) [12:56] *** X-Scale` is now known as X-Scale [13:01] *** X-Scale` has joined #archiveteam-bs [13:07] *** X-Scale has quit IRC (Ping timeout: 610 seconds) [13:07] *** X-Scale` is now known as X-Scale [13:22] odemgi_: arkiver's looking into it. And yeah, #lastbyte on hackint. [15:00] trying to archive iOS ipsw files, requesting from multiple locations, I'm hitting seriously diminished returns by now [15:00] http://nicolas17.s3.amazonaws.com/ipsw-torrent-status.txt here's the list of what I'm still missing [15:04] I think they deleted the files but some are still in CDN caches, so you try the same link repeatedly it might eventually work when you hit a server that still has it, and if that doesn't work then requesting from a different location hits a different set of caches and might have a chance [15:06] but I seem to have hit a wall now [15:32] *** igloo25 has joined #archiveteam-bs [15:34] igloo25: meet kiska [15:35] Hello? [15:35] greetings! [16:19] *** Mayonaise has quit IRC (Read error: Operation timed out) [16:33] *** britmob has quit IRC (Read error: Connection reset by peer) [16:33] *** marked10 has joined #archiveteam-bs [16:35] *** britmob has joined #archiveteam-bs [16:54] *** Mayonaise has joined #archiveteam-bs [16:59] *** MaximeleG has joined #archiveteam-bs [17:05] *** MaximeleG has quit IRC (Quit: MaximeleG) [17:07] *** MaximeleG has joined #archiveteam-bs [17:07] *** thuban3 has quit IRC (Read error: Connection reset by peer) [17:09] *** thuban3 has joined #archiveteam-bs [17:18] nicolas17, this wiki has hash values for some of the firmware: https://www.theiphonewiki.com/wiki/Firmware/Apple_TV/12.x [17:18] if you trust the hash files or can cross reference them elsewhere, you should be able to use alternative sources to backfill the missing ipsw files. [17:22] *** DogsRNice has joined #archiveteam-bs [17:24] *** MaximeleG has quit IRC (Quit: MaximeleG) [17:30] *** godane1 has joined #archiveteam-bs [17:31] *** godane has quit IRC (Read error: Connection reset by peer) [17:35] *** thuban4 has joined #archiveteam-bs [17:39] Ryz, mr_archiv I have added https://www.youtube.com/channel/UC-s_Z-t5hMPyiocz7wrAGkg studiodaily YT channel to youtubearchive. [17:39] Please also check to see if there are related Twitter/FB/instagram pags [17:41] *** thuban3 has quit IRC (Read error: Operation timed out) [17:50] *** Atom__ has quit IRC (Quit: Atom__) [17:54] atphoenix: I was using hashes from ipsw.me so far [17:54] HP_Archiv: youtube-dl has an extractor for npo; you could rip the video(s) and upload to IA manually [17:56] nicolas17, see PM [17:58] *** thuban has joined #archiveteam-bs [18:01] *** thuban4 has quit IRC (Ping timeout: 276 seconds) [18:29] *** marked108 has joined #archiveteam-bs [18:29] *** asdf01012 has joined #archiveteam-bs [18:30] *** marked10 has quit IRC (Read error: Operation timed out) [18:30] *** marked108 is now known as marked10 [18:32] *** asdf0101 has quit IRC (Read error: Operation timed out) [18:32] *** asdf01012 is now known as asdf0101 [18:43] @atphoenix, Here is what I found: [18:43] https://twitter.com/StudioDaily [18:43] https://www.facebook.com/studiodaily [18:43] I also found a linkedin link on their website but I get redirected to a login page. [18:43] https://www.facebook.com/studiodaily [18:44] http://www.linkedin.com/groups?gid=1796059 [18:44] I must have missed the copy button the first time. [18:51] *** marked10 is now known as marked1 [18:51] *** DiscantX has joined #archiveteam-bs [18:58] Hello mr_archiv, I believe we got 'em all, except their LinkedIn profile, that website is very notorious for their anti-scraping capabilities [19:02] !ig 8lg70qyuqkfn2kf215v51anqp ^https?://www\.michael\.fabricant\.mp\.co\.uk/wp-content/cache/wpfc-minified/ [19:02] Ugh [19:08] *** superkuh has quit IRC (Quit: the neuronal action potential is an electrical manipulation of reversible abrupt phase changes in the lipid bilaye) [19:40] *** qw3rty has quit IRC (Read error: Operation timed out) [20:02] *** superkuh has joined #archiveteam-bs [20:23] *** qw3rty has joined #archiveteam-bs [20:26] *** jodizzle has quit IRC (Read error: Operation timed out) [20:26] *** Frogging has quit IRC (Read error: Operation timed out) [20:26] *** Frogging has joined #archiveteam-bs [20:26] *** girst has quit IRC (Read error: Operation timed out) [20:26] *** jodizzle has joined #archiveteam-bs [20:26] *** Mayonaise has quit IRC (Read error: Operation timed out) [20:26] *** dxrt_ has quit IRC (Read error: Operation timed out) [20:26] *** Craigle has quit IRC (Read error: Operation timed out) [20:27] *** systwiALT has joined #archiveteam-bs [20:27] *** PurpleSym has quit IRC (Read error: Operation timed out) [20:27] *** anarchat has quit IRC (Read error: Operation timed out) [20:27] *** anarcat has joined #archiveteam-bs [20:27] *** paul2520 has quit IRC (Read error: Operation timed out) [20:28] *** Mayonaise has joined #archiveteam-bs [20:28] *** Auctus has quit IRC (Read error: Operation timed out) [20:28] *** Auctus has joined #archiveteam-bs [20:28] *** Raccoon` has joined #archiveteam-bs [20:28] *** asdf0101 has quit IRC (Read error: Operation timed out) [20:29] *** brayden has quit IRC (Read error: Operation timed out) [20:29] *** LowLevelM has quit IRC (Read error: Operation timed out) [20:29] *** atphoeni1 has joined #archiveteam-bs [20:29] *** luckcolor has quit IRC (Read error: Operation timed out) [20:29] *** gtwy has quit IRC (Read error: Operation timed out) [20:29] *** keith20 has quit IRC (Read error: Operation timed out) [20:29] *** luckcolor has joined #archiveteam-bs [20:30] *** girst has joined #archiveteam-bs [20:30] *** klg has quit IRC (Read error: Operation timed out) [20:30] *** atphoenix has quit IRC (Read error: Operation timed out) [20:30] *** atphoeni1 is now known as atphoenix [20:31] *** arktek has quit IRC (Read error: Operation timed out) [20:31] *** HP_Archiv has quit IRC (Read error: Operation timed out) [20:32] *** Raccoon has quit IRC (Read error: Operation timed out) [20:32] *** Raccoon` is now known as Raccoon [20:33] *** Tenebrae has quit IRC (Ping timeout: 864 seconds) [20:33] *** systwi has quit IRC (Read error: Operation timed out) [20:34] *** HP_Archiv has joined #archiveteam-bs [20:36] *** kiska has quit IRC (Ping timeout: 622 seconds) [20:38] *** Wingy has quit IRC (Read error: Operation timed out) [20:40] *** MrRadar2 has quit IRC (Read error: Operation timed out) [20:50] *** klg has joined #archiveteam-bs [20:51] ~/win 135 [20:51] Oops [21:00] *** godane1 has quit IRC (Ping timeout: 255 seconds) [21:06] *** Tenebrae has joined #archiveteam-bs [21:07] *** gtwy has joined #archiveteam-bs [21:07] *** arktek has joined #archiveteam-bs [21:07] *** asdf0101 has joined #archiveteam-bs [21:07] *** LowLevelM has joined #archiveteam-bs [21:07] *** brayden has joined #archiveteam-bs [21:07] *** dxrt_ has joined #archiveteam-bs [21:08] *** keith20 has joined #archiveteam-bs [21:08] *** MrRadar2 has joined #archiveteam-bs [21:08] *** svchfoo1 sets mode: +o dxrt_ [21:08] *** svchfoo3 sets mode: +o dxrt_ [21:09] *** Wingy has joined #archiveteam-bs [21:14] *** Sokar has joined #archiveteam-bs [21:14] *** godane has joined #archiveteam-bs [21:15] *** atphoenix has quit IRC (Read error: Connection reset by peer) [21:15] *** PurpleSym has joined #archiveteam-bs [21:16] *** Frogging has quit IRC (Read error: Connection reset by peer) [21:16] *** atphoenix has joined #archiveteam-bs [21:16] *** svchfoo3 sets mode: +o PurpleSym [21:16] *** svchfoo1 sets mode: +o PurpleSym [21:17] *** thuban has quit IRC (Ping timeout: 276 seconds) [21:17] *** paul2520 has joined #archiveteam-bs [21:17] *** thuban has joined #archiveteam-bs [21:18] *** Frogging has joined #archiveteam-bs [21:18] *** qw3rty_ has joined #archiveteam-bs [21:18] *** HP_Archiv has quit IRC (Ping timeout: 276 seconds) [21:18] *** qw3rty has quit IRC (Ping timeout: 276 seconds) [21:20] *** anarcat has quit IRC (Ping timeout: 276 seconds) [21:21] *** HP_Archiv has joined #archiveteam-bs [21:23] *** Raccoon has quit IRC (Ping timeout: 276 seconds) [21:24] *** Raccoon has joined #archiveteam-bs [21:24] *** OrIdow6 has quit IRC (Ping timeout: 276 seconds) [21:26] *** qw3rty_ has quit IRC (Read error: Connection reset by peer) [21:26] *** anarcat has joined #archiveteam-bs [21:27] *** qw3rty has joined #archiveteam-bs [21:39] *** OrIdow6 has joined #archiveteam-bs [21:46] *** thuban1 has joined #archiveteam-bs [21:47] *** Ctrl has joined #archiveteam-bs [21:51] *** thuban has quit IRC (Read error: Operation timed out) [21:55] *** kiska has joined #archiveteam-bs [21:56] *** svchfoo3 sets mode: +o kiska [21:56] *** svchfoo1 sets mode: +o kiska [21:59] *** BlueMax has joined #archiveteam-bs [22:02] *** HP_Archiv has quit IRC (Quit: Leaving) [22:06] *** Craigle has joined #archiveteam-bs [22:24] SketchCow: If that AB backlog is more or less constant, don't worry about it IMO. I'm working on the new system. [22:33] Well, I mostly pass it along just to keep people informed since people use FOS when they're not part of the inner circle. [22:33] I do agree the real solution is to completely avoid FOS and upload directly, especially for things that don't depend on inside-archive bandwidth. [22:34] Good things FOS is for: Downloading things from the archive at ludicrous speed, analyzing them, uploading things like screenshots, listings, summaries back into the item. [22:34] Things FOS is not so good for: time-critical reflection of mass data into the archive's stores, unless no other option is there (people who FTP me bundles, for example). [22:35] Yeah [22:35] FOS was outgrown years ago. Glad to have it there and tinker with the speed issues, because it helps me learn some basic engineering, but yeah, a distributed cohesive bouncer is the way to go if we're going to make Archivebot work [22:36] *** DiscantX has quit IRC (Remote host closed the connection) [22:38] DFJustin: I think it's been above 2 TB per day on average in the last months. Overall, AB has archived 1.2 PB since it was launched in 2014. [22:39] And 12 billion URLs. [22:48] *** PotcFdk has joined #archiveteam-bs [22:50] One more stat: AB accounts for about 5 % of the WBM ingestion in terms of WARC size. [22:50] whew [22:50] *** antomati_ has joined #archiveteam-bs [22:50] *** qw3rty_ has joined #archiveteam-bs [22:51] *** fredgido_ has joined #archiveteam-bs [22:51] *** ranma_ has joined #archiveteam-bs [22:51] *** SmileyG has joined #archiveteam-bs [22:53] *** benjins has joined #archiveteam-bs [22:54] *** Fionera_ has joined #archiveteam-bs [22:54] *** sknebel_ has joined #archiveteam-bs [22:55] *** qw3rty has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** britmob has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** X-Scale has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** benjinsmi has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** ShellyRol has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** dashcloud has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** ranma has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** sknebel has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** Smiley has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** Kenshin has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** actually_ has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** Polylith has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** Fionera has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** fredgido has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** chfoo has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** betamax has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** antomatic has quit IRC (irc.efnet.nl efnet.deic.eu) [22:55] *** ctrl_ has quit IRC (irc.efnet.nl efnet.deic.eu) [22:56] *** betamax_ has joined #archiveteam-bs [22:56] *** chfoo_ has joined #archiveteam-bs [22:56] *** britmob_ has joined #archiveteam-bs [22:58] *** RKenshin has joined #archiveteam-bs [22:58] *** obskyr has joined #archiveteam-bs [22:58] *** PotcFdk has quit IRC (Quit: ~'o'/) [23:04] I found some of my remaining ipsw files in WBM [23:05] *** X-Scale` has joined #archiveteam-bs [23:07] *** Polylith_ has joined #archiveteam-bs [23:08] hmm or not... truncated file [23:08] *** thuban1 has quit IRC (Read error: Connection reset by peer) [23:09] *** thuban1 has joined #archiveteam-bs [23:10] *** dashcloud has joined #archiveteam-bs [23:10] *** PotcFdk has joined #archiveteam-bs [23:10] *** RKenshin is now known as Kenshin [23:12] *** ShellyRol has joined #archiveteam-bs [23:17] *** PotcFdk has quit IRC (Quit: ~'o'/) [23:21] *** PotcFdk has joined #archiveteam-bs [23:24] I tried a few more, all truncated at different <1GB sizes [23:47] *** chfoo_ is now known as chfoo [23:52] Do we have a spin channel yet [23:57] Is there any more than just https://www.spin.com/ and their social media stuff to archive? [23:58] can anyone view this page without a phone : https://reader.magzter.com/preview/03wsivyumv87sg39wplimdc219750/21975 [23:59] i'm trying to get wget to make website think its android