[00:13] *** BlueMaxim has joined #archiveteam-bs [00:36] *** ris has quit IRC () [00:42] *** JesseW has quit IRC (Ping timeout: 370 seconds) [00:47] *** JesseW has joined #archiveteam-bs [00:55] jeez I've been using heroku for a year now and I only now learned about heroku fork [00:55] * yipdw yay [00:55] this makes the couple-hundred-a-month more palatable [00:56] What's the plan for the kinja blogs? 😐 [00:57] I don't know but [00:57] you're not going to get them into archivebot unless one of the following happens: (1) someone adds incremental log upload to wpull; (2) someone adds a pipeline with an assload of disk space (where "assload" can be read as, say, >= 2 TB) [00:57] incremental metawarc creation is probably more accurate [00:57] wpull doesn't upload [00:59] alternatives: (3) someone combs the sitemaps and splits them up (Kinja blogs seem to be pretty good about that); (4) this is what godane is already doing so I say let him do his job, he's good at it [00:59] Is he doing the kinja user blogs too? [01:00] I don't know, ask him [01:01] *** JesseW has quit IRC (Ping timeout: 370 seconds) [01:02] hey all [01:04] i'm moving crap from one drive to other drives so i can start doing gawker stuff again [01:06] Are you doing the kinja user blogs? [01:07] not yet [01:22] i got my old hard drive finally format to ext4 [01:23] i'm hoping that there is not more weird system freeze when using this hard drive with ext4 [01:23] vs ntfs with ntfs-3g [01:25] And I finally got 1/3 of the way through archiving a big site with only a month left [01:26] *** j08nY has quit IRC (Remote host closed the connection) [01:41] godane: are you using a recent version? it's strange your computer would freeze with ntfs-3g since it's not a kernel driver, but a userspace one (with ext4, you'll run into permission problems if you swap the drives between computers) [01:43] i only have one computer using linux [01:43] a part of me think its cause my slax live scripts use overlay [01:44] and overlay and ntfs-3g both use fuse kernel module [01:44] maybe [01:58] *** schbirid has quit IRC (Ping timeout: 258 seconds) [02:01] *** JesseW has joined #archiveteam-bs [02:03] so i found a newspaper i could grab at some point [02:05] *** vitzli has joined #archiveteam-bs [02:06] do I need --phantom-js for twitter thread in #archivebot? [02:12] *** schbirid has joined #archiveteam-bs [02:20] it is a good idea [02:21] thank you, will do next time :) [02:24] i'm going after deadspin.com [02:26] Wait, you don't mean a print newspaper, do you? [02:29] *** JesseW has quit IRC (Ping timeout: 370 seconds) [02:32] yes [02:32] scans of print newspaper [03:02] *** VADemon has quit IRC (left4dead) [03:03] *** DoomTay has quit IRC (Quit: Page closed) [03:04] *** DoomTay has joined #archiveteam-bs [03:29] *** dashcloud has quit IRC (Read error: Operation timed out) [03:35] *** dashcloud has joined #archiveteam-bs [03:37] *** Swizzle has quit IRC (Read error: Operation timed out) [04:05] *** Sk1d has joined #archiveteam-bs [04:21] *** vitzli has quit IRC (Quit: Leaving) [04:41] *** hook54321 has quit IRC (Quit: Connection closed for inactivity) [04:43] *** tomwsmf-a has quit IRC (Ping timeout: 258 seconds) [04:49] *** JesseW has joined #archiveteam-bs [05:10] *** DoomTay has quit IRC (Quit: Page closed) [05:44] *** JesseW has quit IRC (Quit: Leaving.) [05:44] *** JesseW has joined #archiveteam-bs [06:32] *** JesseW has quit IRC (Ping timeout: 370 seconds) [08:17] *** dashcloud has quit IRC (Read error: Connection reset by peer) [08:17] *** dashcloud has joined #archiveteam-bs [09:41] hey cfoo, I just looked in the changelog for wpull and it says scripting support was removed? https://wpull.readthedocs.io/en/master/changelog.html#id1 "Removed: Lua scripting support and "its Python counterpart (--lua-script and --python-script) [09:41] am I mistaken or does this mean I can't use the python hooks to control it now? [09:46] hah I didn't notice that before [09:47] Ctrl-S___: "Scripting is now done using plugin interface via --plugin-script." [09:48] thanks. [09:49] Now i can go back to trying to helloworld with the hooks knowing they are supposed to work [10:00] *** dashcloud has quit IRC (Read error: Connection reset by peer) [10:02] *** dashcloud has joined #archiveteam-bs [10:03] Greetings from a plane [10:03] Hi SketchCow [10:13] Going to do a little hackery on a shaky plane. [10:14] https://de.flightaware.com/live/flight/JAL4 (Me) [10:15] *** whopper has joined #archiveteam-bs [10:50] *** dashcloud has quit IRC (Read error: Operation timed out) [10:50] *** dashcloud has joined #archiveteam-bs [10:58] *** Honno has joined #archiveteam-bs [11:17] *** Swizzle has joined #archiveteam-bs [12:11] *** Honno_ has joined #archiveteam-bs [12:12] *** ndiddy has quit IRC (Read error: Connection reset by peer) [12:12] *** zhongfu has quit IRC (Quit: No Ping reply in 180 seconds.) [12:14] *** DFJustin has quit IRC (Remote host closed the connection) [12:16] *** is-_ has joined #archiveteam-bs [12:16] *** wp494_ has joined #archiveteam-bs [12:17] *** vtyl has quit IRC (Read error: Connection reset by peer) [12:22] *** BlueMaxim has quit IRC (Quit: Leaving) [12:22] *** i0npulse has quit IRC (hub.efnet.us hub.dk) [12:22] *** pikhq has quit IRC (hub.efnet.us hub.dk) [12:22] *** Dark_Star has quit IRC (hub.efnet.us hub.dk) [12:22] *** PotcFdk has quit IRC (hub.efnet.us hub.dk) [12:22] *** coretx has quit IRC (hub.efnet.us hub.dk) [12:22] *** JordanJ2 has quit IRC (hub.efnet.us hub.dk) [12:22] *** altlabel has quit IRC (hub.efnet.us hub.dk) [12:22] *** Coderjoe has quit IRC (hub.efnet.us hub.dk) [12:22] *** luckcolor has quit IRC (hub.efnet.us hub.dk) [12:22] *** chfoo has quit IRC (hub.efnet.us hub.dk) [12:22] *** ring has quit IRC (hub.efnet.us hub.dk) [12:22] *** dan- has quit IRC (hub.efnet.us hub.dk) [12:22] *** Lord_Nigh has quit IRC (hub.efnet.us hub.dk) [12:22] *** SilSte has quit IRC (hub.efnet.us hub.dk) [12:22] *** Fletcher has quit IRC (hub.efnet.us hub.dk) [12:22] *** decay has quit IRC (hub.efnet.us hub.dk) [12:22] *** brayden has quit IRC (hub.efnet.us hub.dk) [12:22] *** wp494 has quit IRC (hub.efnet.us hub.dk) [12:22] *** xmc has quit IRC (hub.efnet.us hub.dk) [12:22] *** Baljem_ has quit IRC (hub.efnet.us hub.dk) [12:22] *** MrRadar has quit IRC (hub.efnet.us hub.dk) [12:22] *** Kenshin has quit IRC (hub.efnet.us hub.dk) [12:22] *** chazchaz_ has quit IRC (hub.efnet.us hub.dk) [12:22] *** Famicoma1 has quit IRC (hub.efnet.us hub.dk) [12:22] *** alfie has quit IRC (hub.efnet.us hub.dk) [12:22] *** joepie91 has quit IRC (hub.efnet.us hub.dk) [12:22] *** dxrt- has quit IRC (hub.efnet.us hub.dk) [12:22] *** jk[SVP] has quit IRC (hub.efnet.us hub.dk) [12:22] *** FalconK has quit IRC (Ping timeout: 276 seconds) [12:23] *** is- has quit IRC (Read error: Operation timed out) [12:23] *** Honno has quit IRC (Read error: Operation timed out) [12:23] *** is-_ is now known as is- [12:26] *** Start has quit IRC (Read error: Connection timed out) [12:30] Digging through the mail backlog. [12:36] *** jk[[SVP]] has joined #archiveteam-bs [12:38] *** jk[[SVP]] is now known as jk[SVP] [12:39] *** lytv has joined #archiveteam-bs [12:40] *** i0npulse has joined #archiveteam-bs [12:40] *** Coderjoe has joined #archiveteam-bs [12:40] *** luckcolor has joined #archiveteam-bs [12:40] *** chfoo has joined #archiveteam-bs [12:40] *** ring has joined #archiveteam-bs [12:40] *** Lord_Nigh has joined #archiveteam-bs [12:40] *** dan- has joined #archiveteam-bs [12:40] *** SilSte has joined #archiveteam-bs [12:40] *** Fletcher has joined #archiveteam-bs [12:40] *** decay has joined #archiveteam-bs [12:40] *** xmc has joined #archiveteam-bs [12:40] *** pikhq has joined #archiveteam-bs [12:40] *** Baljem_ has joined #archiveteam-bs [12:40] *** Dark_Star has joined #archiveteam-bs [12:40] *** MrRadar has joined #archiveteam-bs [12:40] *** Kenshin has joined #archiveteam-bs [12:40] *** PotcFdk has joined #archiveteam-bs [12:40] *** chazchaz_ has joined #archiveteam-bs [12:40] *** Famicoma1 has joined #archiveteam-bs [12:40] *** alfie has joined #archiveteam-bs [12:40] *** coretx has joined #archiveteam-bs [12:40] *** JordanJ2 has joined #archiveteam-bs [12:40] *** altlabel has joined #archiveteam-bs [12:40] *** joepie91 has joined #archiveteam-bs [12:40] *** hub.dk sets mode: +oooo Fletcher xmc Kenshin joepie91 [12:40] *** dxrt- has joined #archiveteam-bs [12:40] *** hub.dk sets mode: +o dxrt- [12:40] *** swebb sets mode: +o xmc [12:40] *** Fletcher_ sets mode: +o Fletcher [12:41] *** FalconK has joined #archiveteam-bs [12:42] *** Famicoma1 has quit IRC (Remote host closed the connection) [12:43] *** i0npulse has quit IRC (Remote host closed the connection) [12:44] *** signius_ has quit IRC (Ping timeout: 1208 seconds) [12:53] *** midas sets mode: +o Baljem_ [12:53] *** midas sets mode: +o joepie91 [12:53] *** signius has joined #archiveteam-bs [12:59] *** Start has joined #archiveteam-bs [13:01] *** DFJustin has joined #archiveteam-bs [13:01] *** swebb sets mode: +o DFJustin [13:01] *** slyphic has quit IRC (Read error: Operation timed out) [13:02] *** zhongfu has joined #archiveteam-bs [13:02] *** slyphic has joined #archiveteam-bs [13:04] *** i0npulse has joined #archiveteam-bs [13:13] *** VADemon has joined #archiveteam-bs [13:14] *** DFJustin has quit IRC (Read error: Connection reset by peer) [13:15] *** DFJustin has joined #archiveteam-bs [13:15] *** swebb sets mode: +o DFJustin [13:16] *** zhongfu has quit IRC (Quit: No Ping reply in 180 seconds.) [13:17] *** brayden has joined #archiveteam-bs [13:17] *** swebb sets mode: +o brayden [13:19] *** zhongfu has joined #archiveteam-bs [14:14] *** ndiddy has joined #archiveteam-bs [14:29] *** Famicoma1 has joined #archiveteam-bs [14:58] *** dashcloud has quit IRC (Read error: Operation timed out) [15:03] *** dashcloud has joined #archiveteam-bs [15:43] *** JesseW has joined #archiveteam-bs [15:44] *** DoomTay has joined #archiveteam-bs [15:50] *** JesseW has quit IRC (Ping timeout: 370 seconds) [16:25] Peak Hosting, an Oregon-based data center service provider, has filed for bankruptcy following the loss of a customer that was responsible for 80 percent of its revenue, Oregon Live reported, citing the company’s bankruptcy filing. [16:26] Ow [16:26] New project? [16:30] no idea, I don't know if they host anything of particular value that has been abandoned by its owners [16:31] Well that one guy may have been hosting a LOT of things if he alone resulted in that much revenue [16:33] joepie91, any idea of ASN? [16:34] DoomTay: the big customer was a gaming company [16:34] HCross: not a clue [16:35] http://www.thewhir.com/web-hosting-news/report-data-center-provider-peak-hosting-files-for-bankruptcy [16:35] Will do some digging around [16:36] AS33529 hmm [16:41] *** j08nY has joined #archiveteam-bs [17:00] *** Honno_ has quit IRC (Leaving) [17:00] *** Honno has joined #archiveteam-bs [17:05] *** sep332_ has joined #archiveteam-bs [17:07] *** vitzli has joined #archiveteam-bs [17:08] *** sep332 has quit IRC (Read error: Operation timed out) [17:11] IPs Originated (v4): 78,336 [17:11] well [17:58] *** anjacks0n has joined #archiveteam-bs [18:00] *** anjacks0n has quit IRC (anjacks0n) [18:01] *** anjacks0n has joined #archiveteam-bs [18:03] *** anjacks0n has quit IRC (anjacks0n) [18:06] *** ndiddy has quit IRC (Read error: Connection reset by peer) [18:08] *** anjacks0n has joined #archiveteam-bs [18:12] *** anjacks0n has quit IRC (anjacks0n) [18:13] *** tomwsmf-a has joined #archiveteam-bs [18:22] *** espes__ has quit IRC (Ping timeout: 244 seconds) [18:22] *** espes__ has joined #archiveteam-bs [18:27] *** Swizzle has quit IRC (Quit: Leaving) [18:30] *** ris has joined #archiveteam-bs [18:33] *** SDragon has left [18:44] *** dashcloud has quit IRC (Read error: Operation timed out) [18:47] *** dashcloud has joined #archiveteam-bs [18:53] from what i can see, it does not host many domains [18:53] I am making a list. [19:00] yeah, from my tests not a lot was on port 80 on those ranges [19:07] http://pastebin.com/fGLsTsed [19:16] *** vitzli has quit IRC (Leaving) [19:17] *** ndiddy has joined #archiveteam-bs [19:19] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [19:22] aaaaand I just got 22TB hard drives for <400€ [19:23] whaaat [19:24] fffffff [19:24] is it 44 drives? ;) [19:24] 2x wd green 4 tb, 2x ghst 3tb, 1x hgst 4 tb [19:24] And here I was thinking it was going to be ~$1000 [19:24] that's 18tb? [19:25] ok, oly 18tb, I lost that last auction in the last second [19:25] but still [19:25] yeah, solid price [19:25] Last second losses are just evil [19:25] but I still have 3 unused wd Enterprise drives [19:25] yeah [19:26] but, meh, I got more than I expected, so no complains. [19:26] I actually dont have any use for them [19:26] besides IA.BAK [19:26] Make a machine to serve as an ArchiveBot pipeline [19:26] and that takes ages to fill up, b/c only 50m down [19:27] DoomTay: I am thinking about building a server and colocating it, but bw is so damn expensive... and my home connection sucks for this. 50down 10 up. [19:27] Ha, here I have 55mb down, 5mb up [19:28] or does anyone know a good colo provider for 2 HE and some 20 TB BW? [19:28] in Europe. [19:28] *** anjacks0n has joined #archiveteam-bs [19:29] *** dashcloud has quit IRC (Read error: Operation timed out) [19:35] Medowar, https://www.online.net/en/datacenter they arent amazing though [19:37] 11U are a bit too much. I am contacting core-backbone and see, if we can work out something. http://www.core-backbone.com/housing-2u/ [19:37] *** dashcloud has joined #archiveteam-bs [19:38] ooo. Contact https://www.m247.com/business/colocation and see what they can do [19:38] its UK though, so not cheap. Ask about their RO DC though [19:38] UK right now is too risky with the BREXIT coming up [19:38] true true. try https://www.m247.ro/en/business-colocation.php [19:38] *** tomwsmf-a has joined #archiveteam-bs [19:38] hm, how does Western digital advance RMA work. Can I literally have them send me a new drive before I have to unplug the old one? [19:40] yes and no. Depending on the drive type and if you are buisness or private customer [19:41] *** Aranje has quit IRC (Quit: Three sheets to the wind) [19:41] private, red 3tb. going through it, it seems to give me the option to do advance, but I haven't actually tried to go through with it as it doesn't really need doing just yet [19:41] id say unlikely [19:42] from personal experience: green/red/black/blue no, Enterprise Storage: yes [19:43] but also dependent on the merchant/distributor. Ingram Micro does it. [20:09] *** nickname_ has joined #archiveteam-bs [20:09] *** wp494_ is now known as wp494 [20:22] *** sep332 has joined #archiveteam-bs [20:26] *** sep332_ has quit IRC (Read error: Operation timed out) [20:32] have NPR podcasts been archived? [20:32] RE series? [20:32] WD RE [20:33] or is RE still consumer [20:37] I just realized that the Geocities Site Builder is proably lost forever [20:37] haven't been able to even find any screenshots of it [20:38] er, probably [20:39] ranma: RE is Enterprise Storage [20:40] *** dashcloud has quit IRC (Read error: Operation timed out) [20:41] prosumer? or full on enterprise? [20:42] wd treats it as enterprise. But is dependent on the merchant/distributor [20:42] *** dashcloud has joined #archiveteam-bs [20:46] jspiros: dang, you're probably right [20:48] *** Honno has quit IRC (Read error: Operation timed out) [20:53] *** schbirid has quit IRC (Quit: Leaving) [21:27] *** dashcloud has quit IRC (Read error: Operation timed out) [21:30] *** dashcloud has joined #archiveteam-bs [21:32] Does anyone know how to convert or play a *.wax file or *.rpm (not the package, the audio format) [21:42] Never mind, the original file is lost to time... [21:46] Is it? [21:46] Let me see [21:49] *** BlueMaxim has joined #archiveteam-bs [21:49] DoomTay: Certain portions of this program (http://www.npr.org/programs/morning-edition/2005/12/22/12927340/?showDate=2005-12-22) are only availible in *.wax and *.rpm format [21:50] Trying to play the *.wax file gives an error in windows media player [21:51] I managed to trace one RPM to http://download.npr.org/real.npr.na-central/npr/me/2005/12/20051222_me_13.rm [21:52] maybe that will work in VLC? I doubt it [21:52] VLC opened it, played me 2 seconds of gobbeldygook [21:52] same here, both vlc and mpv [21:52] *** anjacks0n has quit IRC (anjacks0n) [21:52] same here [21:52] What .wax file? I just skimmed the source code for that page and nothing [21:53] When it says only availible in "archive formats" it gives you two choices for format, and the *.wax file is the "Windows" button [21:55] it should play in realplayer though [21:56] i'll try realplayer [22:00] *** tomwsmf-a has quit IRC (Read error: Operation timed out) [22:05] *** anjacks0n has joined #archiveteam-bs [22:07] *** anjacks0n has quit IRC (Client Quit) [22:10] *** anjacks0n has joined #archiveteam-bs [22:26] godane, found some more newspapers for you http://feweek.co.uk/downloads/ [22:55] so looks like i really only have worry about gizmodo, deadspin, and jalopnik for gawker's sites [23:05] *** anjacks0n has quit IRC (anjacks0n) [23:06] *** JW_work has quit IRC (Read error: Operation timed out) [23:15] *** JW_work has joined #archiveteam-bs [23:16] *** fie has quit IRC (Leaving) [23:56] amazingly after much configuration, the *.rpm file worked in a Windows Server 2008 installation of RealPlayer [23:56] yow [23:56] however the Data Execution Prevention prevents me from using it [23:56] what version of RealPlayer? [23:56] so i am just going to make another virtual machine of windows 7 [23:56] dashcloud: latest version from their site [23:57] still is compatible with vista it seems