[00:06] *** Aranje has quit IRC (Read error: Connection reset by peer) [00:07] *** Aranje has joined #archiveteam [00:29] *** BlueMaxim is now known as BlueMax [01:26] *** mistym has quit IRC (Remote host closed the connection) [01:41] *** mistym has joined #archiveteam [01:45] *** aaaaaaaaa has joined #archiveteam [01:48] *** philpem has quit IRC (Ping timeout: 272 seconds) [02:06] *** mistym has quit IRC (Remote host closed the connection) [02:07] *** mistym has joined #archiveteam [02:07] *** mistym has quit IRC (Remote host closed the connection) [02:13] Special Message [02:18] *** zenguy_pc has quit IRC (Ping timeout: 480 seconds) [02:20] *** ionpulse has joined #archiveteam [02:34] *** zenguy_pc has joined #archiveteam [02:39] nico: yeah? [03:03] *** Ymgve has quit IRC () [03:21] *** dashcloud has quit IRC (Read error: Connection reset by peer) [03:22] *** dashcloud has joined #archiveteam [04:09] *** primus104 has quit IRC (Leaving.) [04:58] *** aaaaaaaaa has quit IRC (Leaving) [05:12] *** logchfoo starts logging #archiveteam at Thu Nov 20 05:12:25 2014 [05:12] *** logchfoo has joined #archiveteam [05:21] *** mistym has joined #archiveteam [06:00] *** vertice32 has joined #archiveteam [06:01] hey all, I maintain a longstanding community site for queer and questioning teens that we have decided to sunset [06:02] what’s the best tool to use to build a warc of a drupal 5.x site ? [06:11] *** wp494 has quit IRC (Read error: Connection reset by peer) [06:11] *** wp494 has joined #archiveteam [06:12] wget (or wpull) is usually a good choice if the website has a good structure and is not too big [06:14] heritrix for complex and advanced set up. #archivebot for small to medium sites if you want it in archive.org [06:15] if it's a website that you want us to crawl, let us know [06:52] the website is http://www.oasisjournals.com [06:52] it’s been running for 19 years [07:28] wow, thanks for running it that long [08:08] *** mistym has quit IRC (Remote host closed the connection) [08:14] *** garyrh has quit IRC (Remote host closed the connection) [08:19] *** signius has quit IRC (Ping timeout: 480 seconds) [08:29] *** signius has joined #archiveteam [08:34] *** garyrh has joined #archiveteam [08:46] yeah. history is important. which is why i didn’t want to just turn it off [08:46] and the site served it’s purpose. gay teenagers don’t need somewhere else to talk anymore [08:47] and it got creepier and creepier running that site as we got older and older [08:54] *** schbirid has joined #archiveteam [08:54] if you go to the channel #archivebot and post the site, the guys there should be able to kick off a grab of it, or at least point you in the right direction [08:54] vertice32: otherwise, we've got these on the wiki if you'd want to do it yourself: http://www.archiveteam.org/index.php?title=Wget_with_WARC_output http://www.archiveteam.org/index.php?title=Wget#Creating_WARC_with_wget [09:02] *** primus104 has joined #archiveteam [09:42] *** primus104 has quit IRC (Leaving.) [09:46] *** primus104 has joined #archiveteam [09:47] *** primus104 has quit IRC (Client Quit) [09:54] vertice32: is your aim to keep maintain a static version of it? [09:56] A static export is really a feature that big CMS should support better. I've searched one for Joomla and the landscape is very sad, no idea about Drupal. And we don't really offer any tool to serve your own WARC backup on your site [10:05] Yeah, we normally recommend putting your WARC backup on Archive.org, and then it can be imported into the Wayback Machine [10:06] I believe [10:06] *** filippo__ has joined #archiveteam [10:07] *** filippo_ has quit IRC (Ping timeout: 378 seconds) [10:07] *** filippo__ is now known as filippo_ [10:09] *** BlueMax has quit IRC (Quit: Leaving) [10:12] *** APerti has quit IRC (ircd.choopa.net ircd.shaw.ca) [10:12] *** godane has quit IRC (ircd.choopa.net ircd.shaw.ca) [10:12] *** Ravenloft has quit IRC (ircd.choopa.net ircd.shaw.ca) [10:12] *** Smiley has quit IRC (ircd.choopa.net ircd.shaw.ca) [10:12] *** SadDM has quit IRC (ircd.choopa.net ircd.shaw.ca) [10:12] *** Sellyme has quit IRC (ircd.choopa.net ircd.shaw.ca) [10:13] *** APerti has joined #archiveteam [10:13] *** godane has joined #archiveteam [10:13] *** Ravenloft has joined #archiveteam [10:13] *** Smiley has joined #archiveteam [10:13] *** SadDM has joined #archiveteam [10:13] *** Sellyme has joined #archiveteam [10:30] *** APerti has quit IRC (ircd.choopa.net ircd.shaw.ca) [10:30] *** godane has quit IRC (ircd.choopa.net ircd.shaw.ca) [10:30] *** Ravenloft has quit IRC (ircd.choopa.net ircd.shaw.ca) [10:30] *** Smiley has quit IRC (ircd.choopa.net ircd.shaw.ca) [10:30] *** SadDM has quit IRC (ircd.choopa.net ircd.shaw.ca) [10:30] *** Sellyme has quit IRC (ircd.choopa.net ircd.shaw.ca) [10:32] *** APerti has joined #archiveteam [10:32] *** godane has joined #archiveteam [10:32] *** Ravenloft has joined #archiveteam [10:32] *** Smiley has joined #archiveteam [10:32] *** SadDM has joined #archiveteam [10:32] *** Sellyme has joined #archiveteam [10:33] Nemo_bis, danneh_ : i want to replace the front page, and put the warc on the on archive.org [10:33] linking to it [10:34] Nemo_bis: the irony of this is, i used to be a core drupal developer, and i’m sure something like that exists somewhere, but a crawled copy is actually preferable to me in this situation [10:35] *** dashcloud has quit IRC (Quit: No Ping reply in 180 seconds.) [10:37] *** dashcloud has joined #archiveteam [10:48] *** kris33 has joined #archiveteam [10:56] *** Emcy has quit IRC (Ping timeout: 480 seconds) [11:02] *** Emcy has joined #archiveteam [11:14] *** primus104 has joined #archiveteam [11:23] *** signius has quit IRC (Ping timeout: 480 seconds) [11:32] *** signius has joined #archiveteam [11:33] *** ruukasu has quit IRC (Ping timeout: 265 seconds) [11:40] *** Ymgve has joined #archiveteam [11:51] *** ruukasu has joined #archiveteam [11:58] *** ruukasu has quit IRC (Ping timeout: 265 seconds) [12:57] *** xk_id has quit IRC (Remote host closed the connection) [12:58] *** arbin_ has joined #archiveteam [12:59] *** arbin has quit IRC (Read error: Operation timed out) [13:07] *** human39 has joined #archiveteam [13:18] *** ruukasu has joined #archiveteam [13:24] *** APerti has quit IRC (Ping timeout: 378 seconds) [13:24] *** APerti has joined #archiveteam [13:52] *** primus104 has quit IRC (Leaving.) [14:21] *** Aranje has quit IRC (Read error: Connection reset by peer) [14:22] *** Aranje has joined #archiveteam [14:29] *** K4k has joined #archiveteam [14:33] *** Ravenloft has quit IRC (Ping timeout: 378 seconds) [14:47] *** ruukasu has quit IRC (Quit: WeeChat 1.0.1) [14:47] *** sankin has joined #archiveteam [14:47] *** ruukasu has joined #archiveteam [14:59] *** APerti has quit IRC (Ping timeout: 606 seconds) [15:09] *** xk_id has joined #archiveteam [15:09] *** xk_id has quit IRC (Remote host closed the connection) [15:13] *** ruukasu has quit IRC (Quit: WeeChat 1.0.1) [15:14] *** xk_id has joined #archiveteam [15:15] *** ruukasu has joined #archiveteam [15:16] i'm grabbing world.kbs.co.kr/english news pages [15:16] *** xk_id has quit IRC (Read error: Operation timed out) [15:17] the news articles go back to mid 2000 [15:25] *** xk_id has joined #archiveteam [15:36] drupal sites tend to behave badly with pull because they don't return 404 for a lot of bogus urls [15:36] with wpull [15:39] *** aaaaaaaaa has joined #archiveteam [15:54] *** the_fox has quit IRC (Read error: Connection reset by peer) [15:56] *** the_fox has joined #archiveteam [15:58] i'm finally getting missing epsiodes of cbs evening news for 2013-12 [16:00] DFJustin: what do they return? [16:09] 200 [16:13] *** ruukasu has quit IRC (Ping timeout: 265 seconds) [16:20] *** ruukasu has joined #archiveteam [16:20] *** mistym has joined #archiveteam [16:25] joepie91 wants me to relay that all bitcasa properties should be archived, https://pdf.yt/d/kdIrOFZdTzelLwY9 because they're financially insolvent [16:25] they tried to get the court to strike that info, but judge rejected [16:25] that link is the ruling, with the list of phrases bitcasa wanted struck [16:26] (sorry for duplicate, I failed to follow his directions haha) [16:36] *** bebzol has joined #archiveteam [16:38] *** primus104 has joined #archiveteam [16:39] arkiver: hi! i see that tracker for ownlog-grab has been set up [16:40] what's next? Should I change it on github sources and run a few warrior instances? [16:40] *** xk_id has quit IRC (Remote host closed the connection) [16:54] We recently surveyed 231 developers and found that more than half of them still store and manage data on the device. Really? In 2014? [16:54] With so many apps competing for on-device storage space, this no longer makes sense. And what happens when your users upgrade to a new device? [16:54] how can you even write something like that [16:56] sorry, mega -bs [17:01] bebzol: you should talk chfoo if you have a question about the tracker [17:01] and you should also inform SketchCow (Jason Scott) about your project [17:06] bebzol: if the scripts are ready and we get some sort of approval for rsync, i'll put in the warrior [17:18] i'm uploading 1964 newspapers of svoboda [17:25] *** mistym has quit IRC (Remote host closed the connection) [17:39] *** danneh_ has quit IRC (Ping timeout: 633 seconds) [17:39] *** xk_id has joined #archiveteam [17:42] *** Diesel_ has joined #archiveteam [17:43] *** Diesel- has quit IRC (Read error: Connection reset by peer) [17:44] *** danneh_ has joined #archiveteam [18:20] *** mistym has joined #archiveteam [18:20] *** mistym has quit IRC (Connection closed) [18:24] *** APerti has joined #archiveteam [18:24] *** mistym has joined #archiveteam [18:31] According to a now deleted post, the admin with a history of nuking site databases he's in charge of has now stepped down from FurAffinity. [18:31] So, assuming the post was truthful and the admin didn't slip in any back doors, it appears that FA is now out of imminent danger. [18:37] aka perfect time to make a good archive [18:39] Completely agree. Even without him, FA is still a poorly managed site with an increasingly discontent user base. But now there's no need to move quickly. We can take our time. [18:39] :)\ [18:39] also [18:39] did anybody take care of bitcasa yet [18:50] Like, with weapons? [18:50] of ass destruction? [19:00] *** Jonimus has quit IRC (Write error: Broken pipe) [19:01] *** Jonimus has joined #archiveteam [19:17] SketchCow: weapons of mass archival [19:17] SketchCow: bitcasa is insolvent, likely that they disappear very soon [19:21] it did not look like real proof imo [19:22] schbirid: ? [19:22] https://pdf.yt/d/kdIrOFZdTzelLwY9 <- ? [19:22] schbirid: yes? [19:22] where does it say that bitcasa is insolvent? [19:23] “Insolvent, and with no new venture funding in sight;" [19:23] between 6 and 7 on page 1 [19:23] err [19:23] page 2, sorry [19:24] those are quotes from that other case which they seek to censor, unless i misunderstand it all [19:24] hmmm [19:24] not sure how big this paste is... [19:24] oh ffs can't copy from that site properly [19:24] schbirid: wha? they're arguing that these phrases should be sealed in the documents about the class action suit against bitcasa [19:25] "the other case"? [19:25] that case i meant [19:25] yes? [19:25] yes [19:25] ... [19:25] I don't understand what's unclear - bitcasa tries to get the statements about their insolvency in the class action case sealed [19:25] how does this not show that bitcasa is insolvent? [19:26] were those statements written by them or are they allegations? [19:26] defendant Bitcasa, Inc. filed an administrative motion for leave to file underseal certain proposed redactions in (1) its opposition to a preliminary injunction and (2) adeclaration filed in support of its opposition [19:26] allegations... [19:26] evidently not [19:26] which the judge refused to blank... [19:26] leaning towards them being truthful [19:26] according to that sentence, these statements originate from bitcasa's opposition to the preliminary injunction [19:27] thus, bitcasa or their representation made the statements [19:27] lets -bs [19:52] *** mistym_ has joined #archiveteam [19:54] ^ [19:54] *** mistym has quit IRC (Ping timeout: 246 seconds) [20:07] *** T31M has joined #archiveteam [20:10] *** BlueMaxim has joined #archiveteam [20:10] *** ruukasu has quit IRC (Ping timeout: 265 seconds) [20:23] *** T31M has quit IRC (Quit: Leaving) [20:25] *** mistym_ has quit IRC (Remote host closed the connection) [20:28] *** Ravenloft has joined #archiveteam [20:59] *** APerti has quit IRC (Read error: Operation timed out) [20:59] *** dashcloud has quit IRC (Read error: Connection reset by peer) [21:00] *** dashcloud has joined #archiveteam [21:06] *** lukeman has quit IRC (Read error: Operation timed out) [21:07] *** chazchaz has quit IRC (Read error: Operation timed out) [21:07] *** warthurt has quit IRC (Read error: Operation timed out) [21:09] *** Laverne has quit IRC (Ping timeout: 369 seconds) [21:09] *** swebb has quit IRC (Read error: Operation timed out) [21:09] *** RealMarc has quit IRC (Read error: Operation timed out) [21:09] *** RealMarc has joined #archiveteam [21:10] *** swebb has joined #archiveteam [21:10] *** Cameron_D has quit IRC (Read error: Operation timed out) [21:10] *** Laverne has joined #archiveteam [21:10] *** dcmorton has quit IRC (Read error: Operation timed out) [21:10] *** Cameron_D has joined #archiveteam [21:10] *** lukeman has joined #archiveteam [21:12] *** dcmorton has joined #archiveteam [21:12] *** lemonkey has quit IRC (Read error: Operation timed out) [21:14] *** JonimusP has joined #archiveteam [21:14] *** lemonkey has joined #archiveteam [21:15] *** warthurto has joined #archiveteam [21:16] *** Meeh has joined #archiveteam [21:17] *** BlueMaxim has quit IRC (Read error: Operation timed out) [21:17] *** BlueMaxim has joined #archiveteam [21:19] *** chazchaz has joined #archiveteam [21:24] *** Meeh_ has quit IRC (Ping timeout: 1221 seconds) [21:24] *** Jonimus has quit IRC (Ping timeout: 1221 seconds) [21:40] *** K4k has quit IRC (Read error: Operation timed out) [21:47] *** cbb has joined #archiveteam [21:48] *** sankin has quit IRC (Leaving.) [21:54] *** philpem has joined #archiveteam [22:01] *** mistym has joined #archiveteam [22:07] *** ruukasu has joined #archiveteam [22:20] *** human39 has quit IRC (Leaving) [22:49] *** godane has quit IRC (Ping timeout: 378 seconds) [23:00] *** godane has joined #archiveteam