#archiveteam-bs 2016-06-08,Wed

↑back Search

Time Nickname Message
00:08 🔗 BlueMaxim has joined #archiveteam-bs
01:04 🔗 ranma has quit IRC (Read error: Operation timed out)
01:04 🔗 ranma_ has joined #archiveteam-bs
01:05 🔗 ranma_ is now known as ranma
01:14 🔗 JesseW has joined #archiveteam-bs
01:38 🔗 Ravenloft has quit IRC (Ping timeout: 244 seconds)
02:20 🔗 xmc has joined #archiveteam-bs
02:20 🔗 swebb sets mode: +o xmc
02:30 🔗 ranma what's the difference between having the bot back up a site vs waiting for archive.org crawl a site?
02:30 🔗 ranma is the bot more likely to grab all of the assets?
02:36 🔗 Stiletto has quit IRC (Read error: Operation timed out)
02:42 🔗 JesseW ranma: The Internet Archive does a wide variety of different sorts of crawls -- which ones were you thinking of?
02:43 🔗 JesseW The only one that is available to the general public is http://web.archive.org/save/ which only saves a single page at a time (it doesn't follow any links to other pages).
02:44 🔗 JesseW They also have the Archive-It service, which they sell access and support for to large institutions so the institutions can run crawls of specific sites or lists of sites they are interested in.
02:45 🔗 JesseW They also regularly crawl various lists of popular or otherwise important sites.
02:45 🔗 JesseW But those lists are not generally public, or subject to public suggestions.
02:46 🔗 JesseW They also re-check URLs they previously archived, although I don't know on what schedule.
02:46 🔗 JesseW And there are probably other crawls, too.
02:52 🔗 yipdw at this point IA has tools that are more likely to get more of a page
02:52 🔗 yipdw e.g. ivan has pointed out to me several IA tools that use Chromium as a crawler, which we could adapt for ArchiveBot
02:52 🔗 yipdw really it's just an augment
02:55 🔗 JesseW hm, I didn't know IA was actively using Chromium-using tools
02:55 🔗 JesseW Do you remember any links?
02:55 🔗 yipdw they're in the #archivebot logs
02:56 🔗 yipdw I don't know if they're actively being used
02:56 🔗 yipdw but they are in the internetarchive github account
02:56 🔗 JesseW Ah, ok.
02:57 🔗 bsmith093 has quit IRC (Ping timeout: 370 seconds)
02:58 🔗 bsmith093 has joined #archiveteam-bs
03:20 🔗 fie has joined #archiveteam-bs
03:52 🔗 BlueMaxim has quit IRC (Read error: Operation timed out)
03:53 🔗 BlueMaxim has joined #archiveteam-bs
04:00 🔗 Honno has joined #archiveteam-bs
04:15 🔗 Honno_ has joined #archiveteam-bs
04:17 🔗 Sk1d has quit IRC (Ping timeout: 250 seconds)
04:20 🔗 Honno has quit IRC (Ping timeout: 492 seconds)
04:28 🔗 Sk1d has joined #archiveteam-bs
04:28 🔗 Sk1d has quit IRC (Connection closed)
04:32 🔗 fie has quit IRC (Ping timeout: 244 seconds)
04:50 🔗 fie has joined #archiveteam-bs
05:05 🔗 Honno__ has joined #archiveteam-bs
05:07 🔗 Honno has joined #archiveteam-bs
05:08 🔗 jspiros has quit IRC (Read error: Operation timed out)
05:08 🔗 jspiros has joined #archiveteam-bs
05:13 🔗 Honno__ has quit IRC (Read error: Operation timed out)
05:17 🔗 Honno_ has quit IRC (Read error: Operation timed out)
05:26 🔗 froakie has joined #archiveteam-bs
05:27 🔗 froakie has quit IRC (Client Quit)
05:32 🔗 Honno_ has joined #archiveteam-bs
05:36 🔗 yakfish has quit IRC (Read error: Operation timed out)
05:39 🔗 yakfish has joined #archiveteam-bs
05:39 🔗 Honno has quit IRC (Read error: Operation timed out)
05:54 🔗 yipdw I feel like usage of python-pip on Debian is intentionally difficult because of some stupid easy_install/pip holy war
05:55 🔗 yipdw all I want is to be able to use pessimistic version constraints on Debian jessie
05:55 🔗 yipdw the version of pip shipped in jessie is too old for that and it's even too old to be upgraded via pip install --upgrade
06:03 🔗 JesseW has quit IRC (Ping timeout: 370 seconds)
06:03 🔗 bwn ugh.. I've had some.. issues with debian jessie as well
06:06 🔗 yipdw when developers of development tools bikeshed, everyone loses
06:07 🔗 yipdw it's too bad there's no way to redirect that misery to said developers
06:08 🔗 yipdw or, rather, in a constructive manner that doesn't involve internet deathmobs
06:09 🔗 tomwsmf-a has quit IRC (Read error: Operation timed out)
06:13 🔗 bwn hah, seems like it usually ends in pitchforks or yet another shed
06:43 🔗 dashcloud has quit IRC (Read error: Connection reset by peer)
06:44 🔗 dashcloud has joined #archiveteam-bs
07:29 🔗 balrog has quit IRC (Read error: Operation timed out)
07:30 🔗 BlueMaxim has quit IRC (Read error: Operation timed out)
07:32 🔗 BlueMaxim has joined #archiveteam-bs
07:35 🔗 balrog has joined #archiveteam-bs
07:35 🔗 swebb sets mode: +o balrog
07:44 🔗 schbirid has joined #archiveteam-bs
07:44 🔗 BlueMaxim has quit IRC (Read error: Operation timed out)
07:45 🔗 BlueMaxim has joined #archiveteam-bs
07:46 🔗 Cameron_D has quit IRC (Ping timeout: 370 seconds)
07:46 🔗 Cameron_D has joined #archiveteam-bs
07:48 🔗 midas i've started hating debian since they switched to systemd
07:48 🔗 midas dont like it.
07:48 🔗 midas at all.
07:56 🔗 schbirid mimimi
08:06 🔗 dashcloud has quit IRC (Read error: Operation timed out)
08:07 🔗 Atluxity systemd is nice
08:11 🔗 midas https://www.reddit.com/r/linux/comments/4lh7yv/systemd_developer_asks_tmux_and_other_programs_to/
08:11 🔗 dashcloud has joined #archiveteam-bs
08:17 🔗 Medowar systemd sucks and is designed bad, but there arent any better options right now. The init system is way too old, the code for it is horrible and it lacks a lot of modern functionality. So systemd is the thing we have to roll with right now, if we like it or not.
08:18 🔗 joepie91 Medowar: that's a non-argument
08:18 🔗 joepie91 the latter doesn't follow from the former
08:18 🔗 joepie91 there's no moratorium on developing better solutions
08:18 🔗 joepie91 so no, "we have to roll with it right now, whether we like it or not" is nonsense
08:19 🔗 joepie91 it is perfectly valid to refuse to use it, ESPECIALLY if it is trying to take over everything and drive out alternatives compatibility-wise, and at the same time get a better thing developed in some way
08:19 🔗 joepie91 that fixes the issues of old systems without introducing the new systemd issues
08:20 🔗 joepie91 going "oh well it's all we have right now so let's just shut up and use it" can be actively harmful
08:23 🔗 schbirid as a user i love most of systemd
08:23 🔗 schbirid journalctl dumping huge coredumps by default is stupid though
08:24 🔗 joepie91 I don't like how journalctl deals with large logs
08:24 🔗 joepie91 seeking to the end can take forever
08:28 🔗 Medowar joepie91: I am coming from a Supercomputer-background. For us, init was the thing that caused most of our problems. It is way too slow, the codebase is horrible to adapt, it is overly complicated, it lacks a lot of functinality.
08:29 🔗 Medowar We are using our own linux images and it was an absolute nightmare to build a new system(even though we based it on scientific linux) for new hardware or new software
08:29 🔗 Medowar optimizing for certain workloads was impossible.
08:30 🔗 Medowar When we implemented systemd, we saw a 9% performance plus.
08:30 🔗 Medowar Which, for our standarts is HUGE.
08:31 🔗 Medowar But yes, it is getting overly complicated, is introducing unnecesary depencies and has a questionable moral
08:31 🔗 Medowar But for us, it was the better choice, so we went with it
08:34 🔗 asie has joined #archiveteam-bs
08:34 🔗 asie hi
08:35 🔗 Medowar hi
08:58 🔗 JW_work1 has quit IRC (Read error: Operation timed out)
09:01 🔗 JW_work has joined #archiveteam-bs
09:20 🔗 jut has joined #archiveteam-bs
10:13 🔗 SN4T14 has quit IRC (west.us.hub irc.mzima.net)
10:13 🔗 xXx_ndidd has quit IRC (west.us.hub irc.mzima.net)
10:13 🔗 mutoso has quit IRC (west.us.hub irc.mzima.net)
10:17 🔗 SN4T14 has joined #archiveteam-bs
10:17 🔗 xXx_ndidd has joined #archiveteam-bs
10:17 🔗 mutoso has joined #archiveteam-bs
10:48 🔗 midas has quit IRC (Read error: Operation timed out)
10:48 🔗 midas has joined #archiveteam-bs
10:58 🔗 Honno__ has joined #archiveteam-bs
10:58 🔗 Honno_ has quit IRC (Read error: Connection reset by peer)
11:03 🔗 antomati_ is now known as antomatic
11:39 🔗 hictooth_ has quit IRC (Remote host closed the connection)
11:56 🔗 dashcloud has quit IRC (Read error: Operation timed out)
12:00 🔗 dashcloud has joined #archiveteam-bs
12:02 🔗 n00bLurke has joined #archiveteam-bs
12:10 🔗 n00bLurke has quit IRC (n00bLurke)
12:21 🔗 arkiver has quit IRC (Ping timeout: 257 seconds)
12:22 🔗 sigkell has quit IRC (Ping timeout: 260 seconds)
12:22 🔗 sigkell_ is now known as sigkell
12:33 🔗 sigkell_ has joined #archiveteam-bs
12:36 🔗 arkiver has joined #archiveteam-bs
12:56 🔗 n00bLurke has joined #archiveteam-bs
13:04 🔗 Honno__ has quit IRC (Ping timeout: 492 seconds)
13:20 🔗 dashcloud has quit IRC (Read error: Operation timed out)
13:24 🔗 dashcloud has joined #archiveteam-bs
13:50 🔗 Boppen has joined #archiveteam-bs
13:52 🔗 Sk1d has joined #archiveteam-bs
13:55 🔗 Boppen has quit IRC (Ping timeout: 190 seconds)
13:58 🔗 Sk1d has quit IRC (Ping timeout: 190 seconds)
13:59 🔗 Sk1d has joined #archiveteam-bs
14:20 🔗 Stiletto has joined #archiveteam-bs
14:21 🔗 Boppen has joined #archiveteam-bs
14:32 🔗 Boppen has quit IRC (Ping timeout: 190 seconds)
14:52 🔗 Cameron_D has quit IRC (Ping timeout: 370 seconds)
14:52 🔗 Cameron_D has joined #archiveteam-bs
14:57 🔗 Boppen has joined #archiveteam-bs
15:05 🔗 Sk1d has quit IRC (Ping timeout: 190 seconds)
15:22 🔗 Stiletto has quit IRC (Read error: Operation timed out)
15:22 🔗 Stiletto has joined #archiveteam-bs
15:30 🔗 Sk1d has joined #archiveteam-bs
15:36 🔗 Boppen has quit IRC (Ping timeout: 190 seconds)
15:38 🔗 Sk1d has quit IRC (Ping timeout: 190 seconds)
15:38 🔗 Boppen has joined #archiveteam-bs
15:39 🔗 Sk1d has joined #archiveteam-bs
15:53 🔗 JesseW has joined #archiveteam-bs
16:03 🔗 VADemon has joined #archiveteam-bs
16:08 🔗 Boppen has quit IRC (Ping timeout: 190 seconds)
16:09 🔗 BlueMaxim has quit IRC (Quit: Leaving)
16:09 🔗 Sk1d has quit IRC (Ping timeout: 190 seconds)
16:23 🔗 JesseW has quit IRC (Ping timeout: 370 seconds)
16:31 🔗 godane Decentralized Web Summit - Live From The Internet Archive : https://www.youtube.com/watch?v=Yth7O6yeZRE
16:37 🔗 Boppen has joined #archiveteam-bs
16:37 🔗 Sk1d has joined #archiveteam-bs
16:42 🔗 SilSte has quit IRC (Ping timeout: 633 seconds)
16:46 🔗 Boppen has quit IRC (Ping timeout: 190 seconds)
16:46 🔗 Boppen has joined #archiveteam-bs
16:46 🔗 Sk1d has quit IRC (Ping timeout: 190 seconds)
16:48 🔗 Sk1d has joined #archiveteam-bs
16:52 🔗 dan- has quit IRC (Ping timeout: 260 seconds)
16:54 🔗 MrRadar godane: Ironcially streaming on Youtube, probably the largest single point of failure in the modern web
16:55 🔗 dan- has joined #archiveteam-bs
17:12 🔗 yakfish has quit IRC (Read error: Operation timed out)
17:15 🔗 Sk1d has quit IRC (Ping timeout: 190 seconds)
17:16 🔗 yakfish has joined #archiveteam-bs
17:19 🔗 Boppen has quit IRC (Ping timeout: 190 seconds)
17:43 🔗 Boppen has joined #archiveteam-bs
17:45 🔗 Sk1d has joined #archiveteam-bs
17:50 🔗 yipdw "Debian packaging is not that hard."
17:50 🔗 yipdw ha
17:53 🔗 Sk1d has quit IRC (Ping timeout: 190 seconds)
17:53 🔗 Boppen has quit IRC (Ping timeout: 190 seconds)
18:09 🔗 midas who said that yipdw ?
18:11 🔗 yipdw https://wiki.debian.org/HowToPackageForDebian
18:12 🔗 midas all lies and slander
18:14 🔗 yipdw well I gotta build them for something so
18:16 🔗 Sk1d has joined #archiveteam-bs
18:17 🔗 Boppen has joined #archiveteam-bs
18:17 🔗 yipdw this is probably easy in the easy case, but I need to have a package install several systemd unit files
18:17 🔗 yipdw the documentation around dh_systemd is uh
18:17 🔗 yipdw it's wiki
18:26 🔗 Boppen has quit IRC (Ping timeout: 190 seconds)
18:27 🔗 Sk1d has quit IRC (hub.se irc.du.se)
18:40 🔗 tomwsmf-a has joined #archiveteam-bs
18:43 🔗 xXx_ndidd has quit IRC (Read error: Operation timed out)
18:50 🔗 dashcloud has quit IRC (Read error: Operation timed out)
18:51 🔗 godane looks like Watch Dogs 2 is SF
18:51 🔗 godane *is in SF
18:52 🔗 Frogging it's been announced?
18:52 🔗 HCross It would be really cool to see the IA in there
18:52 🔗 godane all i can think is a mod to put Internet Archive Building into it
18:52 🔗 godane https://www.youtube.com/watch?v=m2qEYCuFxGs
18:54 🔗 dashcloud has joined #archiveteam-bs
18:55 🔗 Frogging it'd be cool if they did somewhere in Canada
18:55 🔗 Frogging just for a random change :p
19:02 🔗 dashcloud has quit IRC (Read error: Operation timed out)
19:06 🔗 dashcloud has joined #archiveteam-bs
19:15 🔗 ranma cough
19:15 🔗 ranma https://i.imgur.com/HLp8z11.jpg
19:15 🔗 * ranma ducks
19:24 🔗 closure has joined #archiveteam-bs
19:44 🔗 dashcloud has quit IRC (Read error: Operation timed out)
19:48 🔗 jut has quit IRC (Leaving)
19:49 🔗 Simpbra1 has quit IRC (Leaving)
19:49 🔗 mutoso has quit IRC (Read error: Operation timed out)
19:52 🔗 mutoso has joined #archiveteam-bs
20:06 🔗 dashcloud has joined #archiveteam-bs
20:35 🔗 schbirid has quit IRC (Quit: Leaving)
20:59 🔗 dashcloud has quit IRC (Read error: Operation timed out)
21:03 🔗 dashcloud has joined #archiveteam-bs
21:22 🔗 Simpbrain has joined #archiveteam-bs
21:38 🔗 ndiddy has joined #archiveteam-bs
22:01 🔗 HCross Hi all. Got an old PC sat in the corner with 400Gb or so of disk, and was thinking of throwing it on internetarchive.bak - is that still a thing? If so, would a system on a 200/12 domestic home connection work?
22:04 🔗 JW_work 400Gb seems a bit small, but the network connection should be OK. (This is a mostly ignorant opinion, though)
22:05 🔗 HCross yeah, its just an old PC that I got donated from a friend who isnt using it
22:11 🔗 JW_work AFAIK, IA.BAK is still a thing, it's just in hibernation mainly due to needing either lots more donated space and/or significant improvements to ease-of-installation. But I haven't looked into it in detail, so this may be wrong.
22:14 🔗 n00bLurke has quit IRC (n00bLurke)
22:14 🔗 HCross JW_work, thanks. Going to have a look at setting it up and seeing what happens
22:15 🔗 JW_work nice!
22:16 🔗 JW_work I'm interested in doing so too — but I'll probably buy a 1TB drive and just use that.
22:16 🔗 HCross the script is just doing its SSH thing now
22:23 🔗 dashcloud has quit IRC (Read error: Operation timed out)
22:28 🔗 dashcloud has joined #archiveteam-bs
22:35 🔗 xXx_ndidd has joined #archiveteam-bs
22:36 🔗 HCross http://paste.harrycross.me/view/b011721a
22:37 🔗 HCross JW_work, seem to have run into an issue
22:38 🔗 JW_work that does look like a bug :-)
22:38 🔗 JW_work IDK more than that.
22:39 🔗 ndiddy has quit IRC (Read error: Operation timed out)
22:39 🔗 JW_work I may look into it more this evening, but there are likely more knowledgable people here.
22:40 🔗 HCross JW_work, restarted it and it seems to now be downloading away
22:41 🔗 JW_work still worth opening a ticket in the appropriate repo
22:41 🔗 HCross or not
22:42 🔗 HCross nvm, shuf is doing its thing slowly
22:58 🔗 HCross JW_work, now verification is failing all over the place
22:58 🔗 JW_work we should probably move this to #internetarchive.bak
23:24 🔗 yakfish has quit IRC (Read error: Operation timed out)
23:32 🔗 yakfish has joined #archiveteam-bs
23:51 🔗 godane so looks like MBC Newsdesk for 2003-09-11 only has have of the broadcast for some reason
23:53 🔗 godane there a are even pictures of weather forecast thats not in video on there pages for that date

irclogger-viewer