#archiveteam-bs 2016-09-26,Mon

↑back Search

Time Nickname Message
00:00 πŸ”— kvieta has joined #archiveteam-bs
00:23 πŸ”— arkiver HCross: well, flickr is currently paused because I need to have a look at a problem with WARCs being too smal
00:25 πŸ”— dashcloud has joined #archiveteam-bs
00:30 πŸ”— BlueMaxim has joined #archiveteam-bs
00:44 πŸ”— kvieta has quit IRC (Ping timeout: 370 seconds)
00:47 πŸ”— kvieta has joined #archiveteam-bs
01:21 πŸ”— kvieta has quit IRC (Read error: Operation timed out)
01:51 πŸ”— kvieta has joined #archiveteam-bs
03:52 πŸ”— wp494 has quit IRC (Read error: Connection reset by peer)
04:04 πŸ”— ndiddy has quit IRC (Ping timeout: 244 seconds)
04:12 πŸ”— mutoso_ has quit IRC (Read error: Connection reset by peer)
04:17 πŸ”— mutoso has joined #archiveteam-bs
04:21 πŸ”— yipdw huh neat, you can use colons in sqlite table names, and pretty much every sqlite tool that isn't the sqlite shell breaks in awesome ways
04:23 πŸ”— xmc deeelightful
04:43 πŸ”— Sk1d has quit IRC (Ping timeout: 194 seconds)
04:49 πŸ”— Sk1d has joined #archiveteam-bs
04:55 πŸ”— ranma has this been backed up? http://www.therobotsvoice.com/
04:55 πŸ”— ranma it's site that has shitty 15-short-paged articles
04:56 πŸ”— ranma stumbled across it linked from a stackexchange post on swearing from Firefly
05:05 πŸ”— ranma http://www.therobotsvoice.com/2010/11/fireflys_15_best_uses_of_chinese_profanity.php
05:05 πŸ”— ranma "I’ve given this site formerly known as Topless Robot three years of my life and hard work, and I wouldn’t trade them. I hoped that covering the subjects and culture that I love would sustain the site. For three years, it has β€” the three years it took to make The Force Awakens, no less. But all things must end. Today is the The Robot’s Voice’s final day of publication. After
05:05 πŸ”— ranma years of trying, we couldn’t make this work financially..."
05:06 πŸ”— ranma Thank you for reading the site, supporting it and creating a community here over the years. I spent more time each day with our regular commenters than I did with my own wife or family, so even though I don’t actually know all your real names, I’ll miss you. Sly, Timely, Abraxas, FakeAss, Gallen, Polk, Mindbender, Zoidberg, Canadian Scott, GrimlockPrime, and everyone else…I’ll
05:06 πŸ”— ranma never forget you. I stayed up until the early hours of the morning, created social media posts on weekends, ran from dinner tables when news happened, and generally made TR/TRV the focus of my life. You got 100 percent of me, like it or not. And I hope you did..."
05:06 πŸ”— ranma etc etc
05:07 πŸ”— ranma not sure if worth backing up
05:24 πŸ”— yipdw I'd argue it's more "worth backing up" than the latest leak of NSA documents or whatever
05:24 πŸ”— yipdw every nerd on the Internet gets off on saving a copy of those and then never reading them
05:25 πŸ”— ranma lol
05:25 πŸ”— yipdw fanworks though, they don't get much
05:25 πŸ”— ranma i presume this community had just a small reach
05:25 πŸ”— yipdw so in the long run we end up with thousands of copies of unknown integrity of one thing and significantly incomplete copies of everything else
05:25 πŸ”— yipdw so I threw that site into archivebot since that's what it was made for
05:26 πŸ”— ranma will try to keep that in mind!
05:27 πŸ”— yipdw also just for full disclosure, yes, I have a copy of the wikileaks insurance file
05:27 πŸ”— yipdw I too get off on that stuff
05:28 πŸ”— ranma i just don't want to throw EVERYTHING at archivebot
05:28 πŸ”— ranma still gauging what's worth time etc
05:28 πŸ”— ranma time, space
05:28 πŸ”— yipdw I might whine about it a lot but really it's better to just throw something in
05:28 πŸ”— yipdw we do have some limits like github/bitbucket links just making it a mess
05:29 πŸ”— ranma http://archive.fart.website/archivebot/viewer/ <-can incomplete URLs be searched?
05:29 πŸ”— yipdw hostnames only
05:30 πŸ”— yipdw but if you throw in "tumblr" you'll get all hostnames matching tumblr
05:30 πŸ”— ranma ah,yeah, that's what i was wondering
05:30 πŸ”— ranma if "digitalocean" would return all domains
05:30 πŸ”— ranma and subdomains
05:31 πŸ”— ranma is it not good at searching backups? or is everything backed up not necessarily tracked there?
05:31 πŸ”— ranma i'd assume digitalocean, linode (for their guides) have been backed up
05:31 πŸ”— yipdw that's just archivebot's catalog
05:31 πŸ”— yipdw there's a ton of other stuff that isn't in there
05:32 πŸ”— yipdw Warrior projects, works from other AT members, everything else in IA, ...
05:34 πŸ”— ranma how good is archivebot at backing up sites with dynamic "next page/more posts" buttons? https://www.digitalocean.com/community/tutorials
05:35 πŸ”— ranma at the end of the page is a js button "load more results"
05:35 πŸ”— yipdw it's not going to work
05:35 πŸ”— ranma damn
05:35 πŸ”— yipdw phantomjs mode just scrolls, there's no "click this button" function
05:36 πŸ”— yipdw if that button is actually an <a> you might have luck with phantomjs
05:36 πŸ”— yipdw I'm not sure
05:37 πŸ”— ranma <a class="load-more-results" href="javascript:void(0);">Load More Results</a>
05:41 πŸ”— ranma is there a way to archive site.com/dir2
05:41 πŸ”— ranma and site.com/dir2/sub1 site.com/dir2/sub2
05:41 πŸ”— ranma but not traverse back to site.com
05:41 πŸ”— ranma and not backup site.com/dir1, etc, linked from site.com
05:42 πŸ”— yipdw yes, !a https://site.com/dir2/
05:43 πŸ”— ranma does !ao only backup site.com/dir2/index.html + images/resources?
05:43 πŸ”— ranma or does it still spider
05:43 πŸ”— yipdw it's page plus prerequisites
05:43 πŸ”— yipdw https://archivebot.readthedocs.io/en/latest/commands.html#archiveonly
05:44 πŸ”— ranma it didn't make much sense to me :<
05:44 πŸ”— * ranma holds onto his butt and feeds archivebot something
05:47 πŸ”— ranma lol @ kebsonsecurity
05:47 πŸ”— ranma was just reading about their DDoS
06:32 πŸ”— Aranje has quit IRC (Quit: Three sheets to the wind)
06:50 πŸ”— wp494 has joined #archiveteam-bs
06:54 πŸ”— fie has joined #archiveteam-bs
07:08 πŸ”— HCross2 ranma: same few days as OVH got hit with 1.5Tbps
07:11 πŸ”— ranma wheee
07:11 πŸ”— ranma and the company i work for is banking on IoT
07:30 πŸ”— ravetcofx has quit IRC (Read error: Operation timed out)
08:05 πŸ”— xmc sets mode: +o yipdw
08:40 πŸ”— GE has joined #archiveteam-bs
09:00 πŸ”— midas they like DDoSing?
09:03 πŸ”— ranma their store salespeople are a bag of dicks, so i don't have much sympathy
09:05 πŸ”— ranma not implying i'm an aggressor. just don't like babysitting them
09:41 πŸ”— kurt has joined #archiveteam-bs
10:18 πŸ”— GE has quit IRC (Remote host closed the connection)
11:07 πŸ”— GE has joined #archiveteam-bs
11:47 πŸ”— kyounko has quit IRC (Read error: Operation timed out)
12:08 πŸ”— BlueMaxim has quit IRC (Quit: Leaving)
12:22 πŸ”— GE has quit IRC (Remote host closed the connection)
13:59 πŸ”— GE has joined #archiveteam-bs
14:34 πŸ”— Start has quit IRC (Quit: Disconnected.)
14:34 πŸ”— Start has joined #archiveteam-bs
14:35 πŸ”— Start has quit IRC (Client Quit)
14:45 πŸ”— achip has joined #archiveteam-bs
15:29 πŸ”— kurt has quit IRC (Remote host closed the connection)
16:15 πŸ”— VADemon has joined #archiveteam-bs
17:24 πŸ”— Swizzle has quit IRC (Quit: Leaving)
17:50 πŸ”— GE has quit IRC (Quit: zzz)
17:50 πŸ”— GE has joined #archiveteam-bs
18:03 πŸ”— VADemon has quit IRC (Read error: Operation timed out)
18:04 πŸ”— godane i'm at 889k items now
18:11 πŸ”— yipdw whoa https://mosh.org/
18:12 πŸ”— xmc yeah mosh is super nifty
18:12 πŸ”— yipdw I need to try this
18:12 πŸ”— xmc highly recommended
18:12 πŸ”— yipdw intermittent connectivity is the rule for me now and I'd love something that doesn't broken-pipe on me every time
18:13 πŸ”— xmc not sure how its prediction works with how fish likes to redraw the command line in various colors
18:13 πŸ”— xmc but it's worth a shot
18:13 πŸ”— yipdw I need to figure out how to make mosh work with siped
18:13 πŸ”— yipdw er spiped
18:13 πŸ”— Frogging mosh is amazing
18:14 πŸ”— yipdw maybe just tell mosh to connect via spiped and have networking work out the rest
18:14 πŸ”— Frogging the one issue I have with it is that it breaks scrolling, so you'll probably want to use tmux/screen with it
18:17 πŸ”— xmc i regularly use mosh on airplane wifi. it makes it tolerable.
18:18 πŸ”— yipdw oh nice, I guess mosh uses SSH to establish the initial connection and start mosh-server
18:18 πŸ”— yipdw so my existing spipe ProxyCommands work fine
18:20 πŸ”— yipdw oh my god this is amazing
18:21 πŸ”— xmc :D
18:22 πŸ”— yipdw my build server runs in online.net's Paris datacenter and you get some noticable lag on the Chicago -> Paris hop
18:22 πŸ”— yipdw but not here
18:28 πŸ”— yipdw this probably also means I can go back to using irssi
18:29 πŸ”— Frogging I would suggest trying out Weechat
18:29 πŸ”— xmc or irssi
18:32 πŸ”— ravetcofx has joined #archiveteam-bs
18:32 πŸ”— yipdw over the years I've come to know irssi fairly well so that's why
18:32 πŸ”— yipdw one of these days I'll try a new client
18:33 πŸ”— Frogging I used irssi up until I got a bouncer with multiple networks, and it didn't let connect to the same hostname multiple times. I wonder if they fixed that
18:33 πŸ”— Frogging didn't let me*
18:34 πŸ”— yipdw or I can be really obstinate and reinstall ircii
18:35 πŸ”— Frogging this is kind of neat :p http://tools.suckless.org/ii/
18:35 πŸ”— yipdw that kinda reminds me of trying to use Plan9
18:35 πŸ”— yipdw cool ideas in theory but I couldn't really integrate them comfortably
18:37 πŸ”— yipdw on the other hand, ii would probably make a good bot substrate
19:21 πŸ”— HCross2 godane: I'll buy you a cake when you get to one million items
19:57 πŸ”— SketchCow has joined #archiveteam-bs
19:57 πŸ”— midas sets mode: +o SketchCow
19:57 πŸ”— swebb sets mode: +o SketchCow
20:04 πŸ”— godane that should be around december sometime
20:05 πŸ”— godane based on me pushing for about 50k items a month
20:32 πŸ”— Stiletto has quit IRC ()
20:38 πŸ”— RichardG has quit IRC (Quit: Keyboard not found, press F1 to continue)
20:40 πŸ”— acridAxid has quit IRC (Quit: marauder)
20:41 πŸ”— acridAxid has joined #archiveteam-bs
20:51 πŸ”— Stiletto has joined #archiveteam-bs
21:03 πŸ”— ndiddy has joined #archiveteam-bs
21:05 πŸ”— computerf has quit IRC (Read error: Operation timed out)
21:08 πŸ”— computerf has joined #archiveteam-bs
21:08 πŸ”— RichardG has joined #archiveteam-bs
21:24 πŸ”— computerf has quit IRC (Read error: Operation timed out)
21:35 πŸ”— computerf has joined #archiveteam-bs
21:41 πŸ”— ndiddy has quit IRC (Quit: Leaving)
21:51 πŸ”— yeoldetoa has quit IRC (Remote host closed the connection)
21:52 πŸ”— kristian_ has joined #archiveteam-bs
22:22 πŸ”— BlueMaxim has joined #archiveteam-bs
22:33 πŸ”— Start has joined #archiveteam-bs
22:35 πŸ”— achip has quit IRC (Read error: Operation timed out)
22:37 πŸ”— GE has quit IRC (hub.efnet.us irc.Prison.NET)
23:07 πŸ”— SketchCow Hi Jason,
23:07 πŸ”— SketchCow I read that your group is archiving Gawker. I'm a documentary producer, have created events and series for History Channel, National Geographic, MTV and others, and produce feature/festival documentaries for Smithsonian Network etc.
23:07 πŸ”— SketchCow I am currently in pre-productions on a film about why Gawker and what you're doing are important.
23:07 πŸ”— SketchCow May we get on the phone so that I can tell you a bit more about the project?
23:07 πŸ”— SketchCow My goal is to interview you and document you and your volunteers saving history.
23:07 πŸ”— SketchCow ...
23:07 πŸ”— SketchCow My intention is to not respond, unless someone thinks different.
23:09 πŸ”— xmc approved βœ“
23:25 πŸ”— arkiver yeah, sounds interesting
23:25 πŸ”— xmc no i mean not responding is meeting with my approval
23:26 πŸ”— arkiver I think it sounds interesting
23:27 πŸ”— arkiver If he's positive about us, it would be nice to have us in a documentary
23:28 πŸ”— xmc i've been burned enough times
23:30 πŸ”— SketchCow Not responding.
23:30 πŸ”— SketchCow "Gawker" + "Documentary" = hellscape
23:31 πŸ”— arkiver :( ok
23:31 πŸ”— arkiver if it's only about gawker then not
23:32 πŸ”— arkiver but maybe he wants to do something more about web history in general too
23:32 πŸ”— xmc from the description it's a gawker documentary
23:32 πŸ”— SketchCow No.
23:32 πŸ”— SketchCow This will be a gawker documentary
23:32 πŸ”— arkiver oh well
23:32 πŸ”— arkiver I'm off anyway
23:32 πŸ”— arkiver have a good day all!
23:32 πŸ”— * arkiver zzzzzzz
23:32 πŸ”— SketchCow I'd rather watch my eye going through a spaghetti strainer with my remaining eye than be involved in anything glorifying gawker
23:33 πŸ”— arkiver ^ got it
23:33 πŸ”— SketchCow Hi Jason,
23:33 πŸ”— SketchCow I read that your group is archiving Gawker. I'm a documentary producer, have created events and series for History Channel, National Geographic, MTV and others, and produce feature/festival documentaries for Smithsonian Network etc.
23:33 πŸ”— SketchCow <3
23:33 πŸ”— SketchCow Added disk check to pipeline, by the way, sleepy
23:33 πŸ”— arkiver yes, saw it!
23:33 πŸ”— arkiver Looks awesome :D
23:33 πŸ”— arkiver thanks
23:52 πŸ”— RichardG has quit IRC (Quit: Keyboard not found, press F1 to continue)

irclogger-viewer