[00:05] *** julientm has quit IRC (Remote host closed the connection) [00:06] *** julientm has joined #archiveteam-bs [00:08] *** julientm has quit IRC (Remote host closed the connection) [00:11] *** julientm has joined #archiveteam-bs [00:29] *** julientm has quit IRC (Remote host closed the connection) [00:32] *** julientm has joined #archiveteam-bs [01:11] *** bitBaron has quit IRC (My computer has gone to sleep. 😴😪ZZZzzz…) [01:16] *** bitBaron has joined #archiveteam-bs [01:32] *** bitBaron has quit IRC (Quit: My computer has gone to sleep. 😴😪ZZZzzz…) [01:46] *** qw3rty117 has quit IRC (Read error: Connection reset by peer) [01:51] *** killsushi has joined #archiveteam-bs [02:04] *** qw3rty117 has joined #archiveteam-bs [02:06] *** killsushi has quit IRC (Quit: Leaving) [02:34] *** julientm has quit IRC (Read error: Operation timed out) [02:44] *** SimpBrain has quit IRC (Remote host closed the connection) [02:45] *** SimpBrain has joined #archiveteam-bs [02:45] Just a word of note, from the byte range that JAA has, it looks like that specific blog wasn't downloaded because it was after tumblr [02:45] tumblr's deadline* [02:49] *** bitBaron has joined #archiveteam-bs [02:56] *** m007a83 has quit IRC (Read error: Connection reset by peer) [03:00] *** BlueMax has joined #archiveteam-bs [03:01] *** m007a83 has joined #archiveteam-bs [03:42] *** wp494 has quit IRC (Read error: Operation timed out) [03:42] *** bitBaron has quit IRC (Quit: My computer has gone to sleep. 😴😪ZZZzzz…) [03:44] *** wp494 has joined #archiveteam-bs [03:56] *** VADemon has quit IRC (Quit: left4dead) [04:05] *** superkuh has quit IRC (Quit: the neuronal action potential is an electrical manipulation of reversible abrupt phase changes in the lipid bilaye) [04:07] *** Despatche has joined #archiveteam-bs [04:33] *** ndiddy has quit IRC () [04:36] *** qw3rty118 has joined #archiveteam-bs [04:40] *** qw3rty117 has quit IRC (Read error: Operation timed out) [04:45] *** deevious1 has joined #archiveteam-bs [04:46] *** deevious has quit IRC (Ping timeout: 252 seconds) [04:46] *** deevious1 is now known as deevious [04:47] *** wyatt8740 has quit IRC (Ping timeout: 255 seconds) [04:52] *** odemgi_ has joined #archiveteam-bs [04:54] *** odemgi has quit IRC (Ping timeout: 252 seconds) [05:00] *** odemg has quit IRC (Ping timeout: 615 seconds) [05:07] *** odemg has joined #archiveteam-bs [05:16] *** m007a83 has quit IRC (Quit: Fuck you Comcast) [05:16] *** m007a83 has joined #archiveteam-bs [06:07] *** turnkit has quit IRC (Ping timeout: 360 seconds) [06:09] *** turnkit has joined #archiveteam-bs [06:28] *** m007a83 has quit IRC (Read error: Connection reset by peer) [06:30] *** m007a83 has joined #archiveteam-bs [06:39] *** Despatche has quit IRC (Remote host closed the connection) [06:41] *** Despatche has joined #archiveteam-bs [06:42] *** Despatche has quit IRC (Read error: Connection reset by peer) [06:42] *** Despatche has joined #archiveteam-bs [06:50] *** Despatche has quit IRC (Connection reset by deer) [06:53] *** deevious1 has joined #archiveteam-bs [06:55] *** deevious has quit IRC (Ping timeout: 252 seconds) [06:55] *** deevious1 is now known as deevious [07:45] *** Stiletto has joined #archiveteam-bs [07:45] *** Stilett0 has quit IRC (Ping timeout: 252 seconds) [08:03] *** svchfoo3 has left [08:20] *** deevious has quit IRC (Ping timeout: 252 seconds) [08:47] *** wyatt8740 has joined #archiveteam-bs [08:58] *** deevious has joined #archiveteam-bs [09:47] *** SimpBrain has quit IRC (Read error: Operation timed out) [09:53] *** c4rc4s has quit IRC (Remote host closed the connection) [09:55] *** SimpBrain has joined #archiveteam-bs [09:56] *** c4rc4s has joined #archiveteam-bs [09:58] *** S1mpbrain has joined #archiveteam-bs [09:58] *** SimpBrain has quit IRC (Read error: Connection reset by peer) [10:49] dashcloud: ok [10:50] whats funny is i question what cause the audio sync issue cause i was able to digitize the rest of your warbirds tapes without issue [10:53] looks like asus pvr-416 is limited to svideo [11:16] For [[ArchiveBot/Knowledge preservation initiatives/list]], is there any reason to not include the many language sites of https://wikisource.org/wiki/Main_Page [11:22] JAA: I’m going to stop fixing wpull now until someone merged or rejected my pending pull requests. Too many branches and too many failing tests (locally). [11:27] *** bitBaron has joined #archiveteam-bs [11:32] *** BlueMax has quit IRC (Quit: Leaving) [12:17] *** bitBaron has quit IRC (Quit: My computer has gone to sleep. 😴😪ZZZzzz…) [12:32] *** kiska1 has quit IRC (Read error: Operation timed out) [12:35] *** kiska1 has joined #archiveteam-bs [12:41] *** wp494 has quit IRC (Read error: Operation timed out) [12:42] *** wp494 has joined #archiveteam-bs [12:42] *** bitBaron has joined #archiveteam-bs [12:58] *** superkuh has joined #archiveteam-bs [13:29] *** VADemon has joined #archiveteam-bs [13:36] *** icedice has joined #archiveteam-bs [14:01] *** bitBaron has quit IRC (Quit: My computer has gone to sleep. 😴😪ZZZzzz…) [14:05] *** S1mpbrain has quit IRC (Read error: Connection reset by peer) [14:12] *** SimpBrain has joined #archiveteam-bs [14:14] *** bitBaron has joined #archiveteam-bs [14:16] *** deevious has quit IRC (Quit: deevious) [14:32] *** omarroth has joined #archiveteam-bs [14:54] *** deevious has joined #archiveteam-bs [14:54] *** LFlare has quit IRC (Ping timeout: 268 seconds) [15:03] *** Stiletto has quit IRC (Ping timeout: 246 seconds) [15:04] *** Stiletto has joined #archiveteam-bs [15:04] *** deevious has quit IRC (Quit: deevious) [15:37] *** deevious has joined #archiveteam-bs [16:05] *** deevious has quit IRC (Remote host closed the connection) [16:06] *** bitBaron has quit IRC (My computer has gone to sleep. 😴😪ZZZzzz…) [16:21] *** fredgido has quit IRC (Ping timeout: 600 seconds) [16:40] *** bitBaron has joined #archiveteam-bs [16:42] *** evul has left Textual IRC Client: www.textualapp.com [16:45] eientei95: add them if you want [16:45] *** Moder112 has joined #archiveteam-bs [16:46] Well [16:46] Thanks to your help, JAA, I was able to download exactly what I wanted [16:47] but unfortunately neither webarchive player or warcproxy were able to open the smaller file [16:47] properly [16:48] https://imgur.com/a/BArTzpy [16:48] this is the error I got [16:48] It's in polish so I'll just translate [16:48] Moder112: Nice. I don't know too much about playback. I only have experience with pywb in that regard. [16:49] the error says that the website was moved [16:49] and the console says it's a tunnel error or something [16:49] I'd try pywb but it won't work for some reason [16:49] Oh [16:49] it just acts like it's finished [16:50] but then does nothing [16:50] We didn't grab those assets for every blog. [16:50] So yeah, those are not contained in the archives. [16:50] no but like [16:50] I can't browse any of the pages [16:50] all of them give the same error [16:50] and something has to be in there [16:50] it's 800Mb [16:51] Hmm, it should work, even if the pages look awful due to the missing CSS etc. [16:51] yeah [16:51] but none of them work [16:51] well [16:51] We moved the media stuff towards the end of the grab, so you might need to look for tumblr_static [16:52] yeah the tumblr static folder is in there iirc [16:52] The other option is to use WBM, that should give you all the content without having to do warc downloading [16:52] WBM? [16:52] Oh yeah right, we grabbed the media separately. [16:53] Might be a painful process to find those corresponding archives. [16:53] WayBack Machine [16:53] wayback machine has very little of this blog saved [16:53] The WBM has everything we did save. [16:53] All WARCs in our collections get ingested into the WBM. [16:53] Unfortunately it does look like we didn't get this blog [16:54] ah okay [16:54] well [16:54] there's got to be something in that warc though [16:55] I was looking for some specific stuff concerning a few cancelled fangames this person was working on [16:55] Yeah, 34k URLs according to your screenshot. [16:55] Which blog is this? [16:55] it's http://seashelbby.tumblr.com [16:56] the url seems to have been taken over for spam purposes [16:57] Does this mean something https://web.archive.org/web/20181210021124/http://seashelbby.tumblr.com/ [16:57] https://web.archive.org/web/20181210021152/http://seashelbby.tumblr.com/archive looks pretty good to me. [16:57] Some posts are lacking images on that page, but the actual post pages have everything at least in the few samples I just tested. [16:58] oh yeah, I guess I'm dumb then [16:58] Well, the scrolling on /archive doesn't seem to work. [16:58] But https://web.archive.org/web/20181210021146/http://seashelbby.tumblr.com/page/2 etc. does. [16:58] Huh? WBM should have rewritten the /archive endpoints [16:59] *** micro has quit IRC (Ping timeout: 246 seconds) [16:59] WBM is awful at handling anything JS-related. [16:59] This also works https://web.archive.org/web/20181210021451/http://seashelbby.tumblr.com/archive/2015/8 [16:59] Well I think I can just browse it through * [16:59] well anyway thanks for the help [16:59] Yes, monthly archives should work as long as there aren't too many posts within a month. [16:59] I was about to say we didn't really get it, but it looks like that was the first blogs we queued up [17:00] Oh wait, scrolling does work. [17:01] I didn't realize the dumps were instantly incorporated into the whole WB [17:01] I feel kinda dumb now [17:02] Everything that we do is ingested into WBM [17:02] And if you want, Google+ will be soon™ [17:03] well I'll just fuck off and let my pc work as part of the G+ scrape botnet [17:03] sorry for wasting your guys' time [17:03] That is fine, we're here to answer questions [17:04] That, and if you want a local archive, your questions are still very relevant. [17:05] *** micro has joined #archiveteam-bs [17:05] *** omarroth has quit IRC (Ping timeout: 268 seconds) [17:05] well I was going to include the warc as part of the archive I'm compiling as navigating it on WB is kinda difficult, but I guess I'll just look through it myself for anything relevant [17:05] anyway, thanks for the help [17:06] *** Moder112 has left [17:06] You're welcome [17:07] For anymore tumblr questions, can I get you to join #tumbledown ? [17:07] Since that is where the project was, and we are still there [17:10] (They left.) [17:10] ... [17:11] Hey, they stayed around long enough to actually have a conversation. That's better than 90 % of the webchat users. :-) [17:12] That is very true [17:17] SketchCow: it's uploaded, but I probably did it wrong https://archive.org/details/BL-Spare-Rib-Archive [17:22] *** fredgido has joined #archiveteam-bs [17:30] *** ndiddy has joined #archiveteam-bs [17:42] *** julientm has joined #archiveteam-bs [18:24] *** lindalap has joined #archiveteam-bs [19:21] JAA: r/WPD and r/Gory (those names exactly, not abbreviations) seem to be new "replacements" [19:24] lindalap: #shreddit by the way. [19:26] Thanks [19:27] *** Despatche has joined #archiveteam-bs [19:45] *** Stilett0 has joined #archiveteam-bs [19:47] *** Stiletto has quit IRC (Read error: Operation timed out) [19:57] IT IS TEXTFILES.COM DAY [20:45] *** BlueMax has joined #archiveteam-bs [21:01] *** m007a83 has quit IRC (Ping timeout: 252 seconds) [21:36] *** icedice2 has joined #archiveteam-bs [21:41] *** icedice has quit IRC (Read error: Operation timed out) [21:45] *** wp494 has quit IRC (Ping timeout: 615 seconds) [21:45] *** wp494 has joined #archiveteam-bs [22:47] *** dw_ has joined #archiveteam-bs [23:15] *** Despatche has quit IRC (Remote host closed the connection) [23:15] *** Despatche has joined #archiveteam-bs [23:24] *** ColdIce has joined #archiveteam-bs [23:45] *** dw_ has left