[00:14] so i got twilight 58 to mount [00:14] i had to do this: mount TWILIGHT58B.ISO tmp -t iso9660 [00:15] old archlinux forum post help me with this: https://bbs.archlinux.org/viewtopic.php?id=79401 [00:18] i'm uploading twilight iso 58 and 59 now [00:24] SketchCow: your going to be getting 6 hours of NBC Decision 78 [00:26] *** BlueMaxim has joined #archiveteam-bs [01:29] *** nickname_ has quit IRC (Read error: Connection reset by peer) [02:12] *** bwn_ has quit IRC (Read error: Operation timed out) [02:19] *** bwn has joined #archiveteam-bs [02:30] *** zenguy has joined #archiveteam-bs [03:14] *** Stiletto has joined #archiveteam-bs [04:38] *** dashcloud has quit IRC (Read error: Operation timed out) [04:42] *** dashcloud has joined #archiveteam-bs [04:56] *** Sk1d has quit IRC (Ping timeout: 194 seconds) [05:02] *** Sk1d has joined #archiveteam-bs [05:52] *** bwn_ has joined #archiveteam-bs [05:54] *** bwn has quit IRC (Quit: Quit) [06:09] *** Honno has joined #archiveteam-bs [06:51] *** VADemon has quit IRC (Quit: left4dead) [07:32] *** metalcamp has joined #archiveteam-bs [07:47] *** schbirid has joined #archiveteam-bs [07:54] *** Fletcher has quit IRC (Read error: Connection reset by peer) [08:05] *** Fletcher has joined #archiveteam-bs [08:06] *** Fletcher_ sets mode: +o Fletcher [08:16] *** Stiletto has quit IRC (Read error: Operation timed out) [09:24] *** bwn_ has quit IRC (Read error: Operation timed out) [09:24] *** antomati_ is now known as antomatic [09:38] *** bwn_ has joined #archiveteam-bs [09:50] *** Medowar has joined #archiveteam-bs [10:00] hey, anyone who's written warrior scripts, should i be including everything needed to view the page in each pageload or is it acceptable to just do a single grab of global assets (header images, site logos, ect) [10:01] I'd like to minimise the disk and network usage [10:10] *** vitzli has joined #archiveteam-bs [10:11] *** Stiletto has joined #archiveteam-bs [10:11] *** arkiver2 has joined #archiveteam-bs [10:13] Ctrl-S___: I did not write any, but duplicate links should be grabbed once. That is how wget works anyway [10:14] okay, thanks. RE wget grabbing once: not when you're running it a thousand times [10:14] use wpull [10:14] i am [10:15] I'm splitting up the job into ranges [10:16] Will the WARC be screwey if i grab global assets seperately from the normal range jobs? [10:16] or can i just merge them all back together? [10:26] *** arkiver3 has joined #archiveteam-bs [10:26] *** arkiver2 has quit IRC (Ping timeout: 244 seconds) [10:28] *** arkiver3 has quit IRC (Client Quit) [10:33] Does the Archive Bot in case of PDFs grab the links in the PDF? [10:33] I feel it is even more critical to have references from whitepapers then on sites. [10:34] *** Stiletto has quit IRC (Read error: Operation timed out) [11:29] IPFS is beautiful. Incorporating a lot of stuff I did thought of and worked on for private stuff since 2008. Back then noone would use it even if there was a block chain. But hey... [11:30] Storing a simple javascript enabled page that makes application like qr code generator, or markdown renderer for any file, ... a lot of power in small space, omnipresent. And that is beuty. [12:03] Anyone in here up for running a NewsGrabber pipeline? [12:04] how much b/w we talking? [12:05] @ HCross2 [12:07] I would say, until we get more workers online, tops of 1TB per day or so [12:07] Under 1tb per day [12:10] It will use CPU though [12:13] youch, ok [12:13] i don't have that much room right now :/ [12:14] What do you have? There is something else you could do [12:15] small b/w, small spare cpu atm [12:16] :D [12:16] 50Mb down, 3 up [12:16] but if i max it 24/7 there'll be trouble (and a moaning wife) [12:16] Don't even have real wall time free atm, moved house 2 weeks ago, still recovering [12:18] HCross: what are we looking at in terms of RAM usage [12:23] I got 100M with 20/20 at least working. [12:23] Did not try speedtest, so it might have been limit of other side. I have storage problem and single core though, so I guess would be useless. [12:24] Smiley: Trickle at 80% of actual speed helps ;) [12:25] HCross: Can you talk a little bit more about this? Never ran a pipe, and no idea what is the work flow, how much intermediate storage is needed for this exact task, etc. [12:26] Can potentially have multiple places with FTTB [12:27] I never worked on anything much, never got to a software firm (except accounting and gov oriented software they are next to non-existent around), would like to "get in the loop". [12:33] Kazzy, I ran it on 4GB, could get away with less though [12:33] Domestic connections might not be good, as file sizes can get bit [12:33] big [12:33] anyone remind me the port for the warrior? [12:34] i have a dedibox sitting idle, let me know what you need ^^ [12:34] Smiley: 8001 [12:34] hmmm crap, must of changed it [12:34] vbox? [12:34] yah [12:34] headless [12:34] Kazzy, cool [12:34] i know how to fix, just been awhile since i ran it (as i moved house) [12:34] come into #newsgrabber and ill get you going [12:34] there's port config somewhere, should be able to open it up and have a look [12:35] oh yeah drat it might be on it's own IP too lol [12:40] *** Meroje has joined #archiveteam-bs [12:59] *** bsmith093 has quit IRC (Ping timeout: 244 seconds) [13:15] *** Honno has quit IRC (Quit: Leaving) [13:17] *** bsmith093 has joined #archiveteam-bs [13:21] About Archivebot: If I wanted a site that does have /blog/ and that should be included in crawl, will it get autoexcluded? [13:24] *** dashcloud has quit IRC (Read error: Operation timed out) [13:27] I do recommend checking this one out, some very useful info there, for those more close to hardware/software (I did also put that for archival): http://danluu.com/ [13:27] *** dashcloud has joined #archiveteam-bs [13:49] *** Honno has joined #archiveteam-bs [13:54] *** VADemon has joined #archiveteam-bs [13:58] *** Stiletto has joined #archiveteam-bs [14:21] https://programmers.stackexchange.com/questions/315810/is-there-a-good-reason-to-run-32-bit-software-instead-of-64-bit-on-64-bit-machin [14:23] Yoshimura: surprisingly good accepted answer [14:24] pretty much nails it [14:25] Yeah, it is related to warrior. [14:26] I was ... looking to porting it everything to 64. But we can run 32bit on 64, including in Docker. [14:27] Potentially having only one way to run stuff (in docker) (might be bashed for that idea). But overall using 64bit system maybe. [15:18] *** BlueMaxim has quit IRC (Quit: Leaving) [15:48] *** schbirid has quit IRC (Quit: Leaving) [16:34] *** RichardG has quit IRC (Ping timeout: 633 seconds) [16:36] Packer.io sucks. Cannot specify not to mount disk. Size 0 means smallest possible. [16:37] If I use default sata, it also mounts iso as sata, which is again, not what I want. [17:24] *** RichardG has joined #archiveteam-bs [17:46] *** dashcloud has quit IRC (hub.efnet.us irc.colosolutions.net) [17:46] *** zenguy has quit IRC (hub.efnet.us irc.colosolutions.net) [17:46] *** jspiros has quit IRC (hub.efnet.us irc.colosolutions.net) [17:46] *** yakfish has quit IRC (hub.efnet.us irc.colosolutions.net) [17:46] *** matthusb- has quit IRC (hub.efnet.us irc.colosolutions.net) [17:46] *** SadDM has quit IRC (hub.efnet.us irc.colosolutions.net) [17:49] *** zenguy has joined #archiveteam-bs [17:49] *** jspiros has joined #archiveteam-bs [17:49] *** dashcloud has joined #archiveteam-bs [17:49] *** matthusby has joined #archiveteam-bs [17:49] *** yakfish has joined #archiveteam-bs [17:50] *** SadDM has joined #archiveteam-bs [17:50] *** swebb sets mode: +o SadDM [18:20] *** vitzli has quit IRC (Quit: Leaving) [18:42] *** bwn_ has quit IRC (Read error: Operation timed out) [19:03] *** bwn_ has joined #archiveteam-bs [19:31] Do you think it counts as 'fair use' when background music in a video game you're recording yourself playing gets flagged as copyrighted? (example: Legend of Zelda theme on the NES, or the Green Hill Zone music in Sonic the Hedgehog) [19:32] Because youtube is pissing me off right no [19:32] *now [19:33] https://www.youtube.com/watch?v=7WvHYgQ7EHM&t=3m44s flagged [19:33] :\ [19:33] s-video capture from a console I modded and will be selling off soo [19:34] *soon [19:35] should have used archive.org [19:35] heh, will post it there too probably. [19:36] I have a webm off of youtube on dropbox [19:36] https://dl.dropboxusercontent.com/u/57311112/sonic1-svideo-em2861-chipset-capture.webm [19:38] so i'm up to 2008-06-10 with funny or die archives [19:39] I just cannot get packer running, yipdw_ [19:39] It creates a harddisk, no idea how to delete it, but if one wants stream optimized it does not provide any config. [19:40] Tried deleting but complains about nonexistent controller. Just having one useless drive. [19:41] *** Smiley has quit IRC (Ping timeout: 244 seconds) [19:45] *** Smiley has joined #archiveteam-bs [20:11] *** pwnsrv_ has joined #archiveteam-bs [20:13] *** pwnsrv has quit IRC (Ping timeout: 250 seconds) [20:17] *** bwn_ is now known as bwn [20:58] *** Medowar has quit IRC (Quit: Connection closed for inactivity) [21:04] SketchCow: i uploaded this video to your FOS : http://www.imdb.com/title/tt0169005/ [21:05] filename: Alien Abduction Tape 1983 [final].avi [21:18] *** arkiver2 has joined #archiveteam-bs [21:21] does anyone in northern california have a little server room with gigabit that I can use, I have data stuck in my ADSL blackhole [21:21] I sent a hard drive to Delimiter to use their slot hosting service but I'm not sure there's anyone on the other end of the line [21:22] ivan`, have you emailed Mike or Mark? [21:23] HCross: I have not. I will if I don't hear from them on Monday [21:23] Probably the best bet [21:23] thanks [21:23] Need email addresses? [21:23] yes please [21:30] Mark's quick to fix issues the moment you complain somewhere remotely public, if emails don't end up working [21:32] heh [21:32] is there anyone else who provides a service like that [21:32] delimiter slot hosting i mean [21:39] mr-b: macstadium told me they would accept a USB-powered drive and leave it attached for no monthly fee [21:39] it comes out to $79/mo with unmetered gigabit [21:39] but I don't know how happy they'd be about swapping drives back and forth [21:40] seems cheap for real unmetered gig [21:52] *** Honno has quit IRC (Read error: Operation timed out) [21:54] yipdw_: Packer sucks, terrible. [21:55] Would have to file a lot of issues and wait or just do it my own in a while [22:39] *** arkiver2 has quit IRC (Ping timeout: 244 seconds) [23:19] *** atrocity has joined #archiveteam-bs [23:19] awkward when you reboot and forget to restart your IRC client for like a day... [23:30] *** dashcloud has quit IRC (Read error: Operation timed out) [23:33] *** BlueMaxim has joined #archiveteam-bs [23:34] *** dashcloud has joined #archiveteam-bs