#archiveteam-bs 2020-03-08,Sun

↑back Search

Time Nickname Message
00:05 🔗 Raccoon has joined #archiveteam-bs
00:21 🔗 Ryz Time to work on writing that M&A article soon - I have a list of sources I picked up while constantly archiving those goods~
00:40 🔗 bitbit has quit IRC (Quit: Leaving)
00:55 🔗 Pixi has quit IRC (Read error: Operation timed out)
01:06 🔗 Pixi has joined #archiveteam-bs
01:16 🔗 VADemon has joined #archiveteam-bs
01:42 🔗 pew has quit IRC (Ping timeout: 276 seconds)
01:54 🔗 pew has joined #archiveteam-bs
04:21 🔗 qw3rty__ has joined #archiveteam-bs
04:29 🔗 qw3rty_ has quit IRC (Read error: Operation timed out)
06:28 🔗 HP_Archiv has joined #archiveteam-bs
06:28 🔗 HP_Archiv has quit IRC (Read error: Connection reset by peer)
06:29 🔗 HP_Archiv has joined #archiveteam-bs
06:30 🔗 HP_Archiv has quit IRC (Client Quit)
06:30 🔗 HP_Archiv has joined #archiveteam-bs
06:32 🔗 HP_Archiv has quit IRC (Client Quit)
06:33 🔗 HP_Archiv has joined #archiveteam-bs
06:33 🔗 HP_Archiv has quit IRC (Read error: Connection reset by peer)
06:33 🔗 HP_Archiv has joined #archiveteam-bs
06:33 🔗 HP_Archiv has quit IRC (Client Quit)
07:01 🔗 HP_Archiv has joined #archiveteam-bs
07:01 🔗 HP_Archiv has quit IRC (Read error: Connection reset by peer)
10:07 🔗 BlueMax has quit IRC (Read error: Connection reset by peer)
11:57 🔗 morgandaw has joined #archiveteam-bs
12:03 🔗 MeeDee has quit IRC (Read error: Operation timed out)
13:37 🔗 icedice has joined #archiveteam-bs
13:38 🔗 MeeDee has joined #archiveteam-bs
13:39 🔗 morgandaw has quit IRC (Ping timeout: 276 seconds)
13:50 🔗 VADemon has quit IRC (left4dead)
14:05 🔗 themadpro has joined #archiveteam-bs
14:20 🔗 atphoenix quite the list here https://www.zdnet.com/article/a-list-of-security-conferences-canceled-or-postponed-due-to-coronavirus-concerns/
14:23 🔗 VADemon has joined #archiveteam-bs
14:32 🔗 nicolas17 atphoenix: oh cool a list of cv-related sites
14:32 🔗 nicolas17 for your consideration https://coronainfo.xyz/ I don't know about its reliability
14:44 🔗 bitbit has joined #archiveteam-bs
14:58 🔗 Mateon1 has quit IRC (Read error: Connection reset by peer)
14:58 🔗 Mateon1 has joined #archiveteam-bs
15:30 🔗 themadpro has quit IRC (Quit: This computer has gone to sleep)
15:35 🔗 wyatt8740 has quit IRC (Read error: Operation timed out)
15:37 🔗 themadpro has joined #archiveteam-bs
15:39 🔗 themadpro has quit IRC (Client Quit)
15:41 🔗 wyatt8740 has joined #archiveteam-bs
15:51 🔗 atphoenix another cv-related tech conference change list https://www.zdnet.com/article/sxsw-canceled-due-to-coronavirus-all-the-2020-tech-conference-cancellations-and-travel-bans/
15:52 🔗 themadpro has joined #archiveteam-bs
16:17 🔗 themadpro has quit IRC (Quit: This computer has gone to sleep)
16:23 🔗 themadpro has joined #archiveteam-bs
16:45 🔗 themadpro has quit IRC (Quit: This computer has gone to sleep)
16:56 🔗 themadpro has joined #archiveteam-bs
18:13 🔗 mtntmnky has quit IRC (Remote host closed the connection)
18:13 🔗 mtntmnky has joined #archiveteam-bs
18:27 🔗 Dallas has joined #archiveteam-bs
18:41 🔗 Video_ has joined #archiveteam-bs
19:54 🔗 d5f4a3622 has quit IRC (Read error: Connection reset by peer)
19:56 🔗 d5f4a3622 has joined #archiveteam-bs
20:22 🔗 bitbit has quit IRC (Remote host closed the connection)
20:34 🔗 betamax regarding the discussion on #archiveteam re: BBC Mixital, I was the one who initially made the page on the wiki for it
20:34 🔗 betamax then subsequently realised I just didn't have the time to work on it
20:35 🔗 betamax I don't think the discussion around our approach to archiving it ever got past the "lets decide on a name for the channel" :)
20:35 🔗 betamax a lot will be difficult to get as it's all JS
20:36 🔗 betamax but the fanfic, reports and videos should be do-able
20:37 🔗 OrIdow6 A full archive looks like it would require a lot of human effort
20:37 🔗 betamax cc: themadpro, JAA, PurpleSym (tagging you all as you've edited the wiki page)
20:37 🔗 OrIdow6 Might have to split off into subgroups to work on each of the different channels (which seem to be more or less separate pieces of software, with a few superificial qualities and e.g. the view counter and comments in common)
20:38 🔗 OrIdow6 Which porbably won't happen
20:38 🔗 OrIdow6 *probably
20:38 🔗 betamax shall we create a channel to avoid clogging up -bs? how about #mixdown ?
20:39 🔗 betamax (although I'm trying to avoid becoming to active in this since I have very little free time)
20:44 🔗 themadpro same here
20:45 🔗 themadpro at least for this week
20:45 🔗 themadpro #mixdown does sound nice but hear me out
20:45 🔗 themadpro #missal
20:45 🔗 balrog has quit IRC (Quit: Bye)
20:46 🔗 themadpro btw I couldn't finish the wiki page but in summary they seem to come in three types:
20:48 🔗 themadpro 1) Text
20:48 🔗 themadpro 1) Text, 2) Media (Photo/Video/Artwork), 3) "Interactives" (Moviemakers, level designers and a "dancing robot maker")
20:48 🔗 balrog has joined #archiveteam-bs
20:51 🔗 themadpro The first two categories should be pretty trivial but I fear the third might be particularly challenging
20:52 🔗 themadpro And they knew, the FAQ both mentions how to download your stuff and how to "screen-cap" specifically for one of the interactives
21:01 🔗 OrIdow6 They say that there are millions, but I only count 32350 with the search API
21:02 🔗 themadpro yeah, I don't think it's in the millions, it shouldn't be too heavy of a task
21:03 🔗 Lord_Nigh has quit IRC (Quit: ZNC - http://znc.in)
21:04 🔗 Lord_Nigh has joined #archiveteam-bs
22:17 🔗 JAA phuzion: Thanks, I've thrown it into ArchiveBot. That will not be complete though as in-thread pagination of replies as well as posts on user profiles are done with JS.
22:18 🔗 JAA (That's for http://activedir.org/ for reference.)
22:27 🔗 Video_ has quit IRC (Leaving)
22:29 🔗 wyatt8740 has quit IRC (Read error: Operation timed out)
22:36 🔗 Sanqui where www. and non-www domains exist, which one do we prefer
22:36 🔗 Sanqui and does wayback show when the "other version" is available?
22:37 🔗 atphoenix Sanqui, I have not seen WBM offer alternative equivalent domains even when they exist. IME you need to know about them.
22:37 🔗 Sanqui darn
22:37 🔗 JAA The WBM strips the protocol, any authentication data, and a leading "www." from all URLs when you access it.
22:38 🔗 JAA It's just not very obvious.
22:38 🔗 Sanqui oh!
22:38 🔗 Sanqui that's good to know
22:38 🔗 Sanqui that weakly implies we should omit www. too
22:39 🔗 bitbit has joined #archiveteam-bs
22:39 🔗 JAA For example, https://web.archive.org/web/20200308101216/http://example.org/ can also be accessed as https://web.archive.org/web/20200308101216/something://foo:bar@www.example.org/
22:39 🔗 JAA Which means that in the rare case of the www domain serving different content from the non-www, you get it all mixed on the WBM.
22:40 🔗 Sanqui I know WBM also ignores case, which is arguably a bigger issue.
22:40 🔗 Sanqui And probably http and https.
22:40 🔗 JAA Yep
22:40 🔗 Sanqui But well, as long as the underlying WARCs are sane, it can all be fixed one day.
22:40 🔗 Sanqui I'm going to be stripping out www. then
22:40 🔗 JAA (Narrator: It was never fixed.)
22:40 🔗 Sanqui lol.
22:41 🔗 JAA The case collapsing is a serious issue, cf. Picosong for example.
22:41 🔗 VADemon There was an article in favor of keeping "www" but i cant find it because search engines
22:41 🔗 Sanqui There are articles against keeping www, for keeping www, and for using double www.www.
22:42 🔗 wyatt8740 has joined #archiveteam-bs
22:42 🔗 JAA http://no-www.org/ and https://web.archive.org/web/20081217064517/http://www.www.extra-www.org/ from back in the day. :-)
22:43 🔗 JAA I'm in the former camp.
22:43 🔗 JAA But for archival, well, whatever's appropriate for the particular site. Sometimes there are hard links with a particular spelling in the HTML and the site is served on both, so that then prescribes which to use.
22:43 🔗 JAA If both versions are used, welp.
22:44 🔗 JAA (At least when talking about AB. Custom tools can of course do recursion on a set of domains instead.)
22:47 🔗 Sanqui Yup, yup.
22:47 🔗 Sanqui Cheers.
22:53 🔗 atphoenix betamax, themadpro OrIdow6 : on mixital: #mixedup #mixitdown #mixitnone #unmixed #unmixit
22:53 🔗 bitbit urls should are case sensitive?
22:54 🔗 nicolas17 URLs are case sensitive except for the hostname
22:54 🔗 nicolas17 seems WBM works differently :)
22:54 🔗 atphoenix URL case sensitivity depends on the web server. Some web servers can be configured to be case-agnostic
22:55 🔗 nicolas17 a webserver running on Windows and serving static files will probably be case-insensitive in practice
22:55 🔗 bitbit yikes. I think I messed up with a proxy I wrote a while ago for a workplace then
22:55 🔗 nicolas17 (I think IIS used to allow backslashes in place of slashes too D:)
22:55 🔗 JAA *URLs* are always case sensitive. What the web server does with a particular HTTP request is unrelated to URL syntax.
22:55 🔗 nicolas17 ^
22:56 🔗 themadpro has quit IRC (Quit: This computer has gone to sleep)
22:56 🔗 atphoenix ^that says it better
22:56 🔗 bitbit double www sounds pretty crazy
22:56 🔗 atphoenix and double www is crazy
22:57 🔗 nicolas17 yesterday I reviewed *.kde.org and there's some www -> no www redirects and some the other way
22:58 🔗 nicolas17 er
22:58 🔗 nicolas17 kde domains*
22:58 🔗 JAA Yes, double www is crazy. One should use as many wwws as DNS allows.
22:58 🔗 nicolas17 calligra.org -> www.calligra.org; www.kde.org -> kde.org
22:58 🔗 atphoenix anyhow Apache webservers can do lots of fancy stuff with mod_redirect
22:59 🔗 atphoenix I mean mod_rewrite
23:00 🔗 nicolas17 mod_rewrite is voodoo
23:00 🔗 JAA And this discussion is getting into -ot territory.
23:00 🔗 nicolas17 according to its official documentation
23:00 🔗 bitbit I'm pretty surprised that "we" managed to get the entire world to even accept "https://www." ... it's such an awkward technical thing to have on every billboard, etc.
23:00 🔗 nicolas17 JAA: true
23:13 🔗 betamax in the interests of making progress with Mixital, I've made a channel on hackint: #missitall
23:13 🔗 betamax cc: themadpro, OrIdow6, JAA, PurpleSym
23:14 🔗 JAA On hackint, I hope?
23:14 🔗 betamax yup
23:14 🔗 JAA :-)
23:16 🔗 jamrs has quit IRC (Read error: Operation timed out)
23:21 🔗 BlueMax has joined #archiveteam-bs

irclogger-viewer