[00:08] *** enowaldo has quit IRC (Read error: Operation timed out) [00:11] *** enowaldo has joined #archiveteam-bs [00:21] *** enowaldo has quit IRC (Ping timeout: 265 seconds) [00:22] *** enowaldo has joined #archiveteam-bs [00:33] *** enowaldo has quit IRC (Ping timeout: 265 seconds) [00:33] *** enowaldo has joined #archiveteam-bs [01:10] *** enowaldo has quit IRC (Read error: Operation timed out) [01:12] *** enowaldo has joined #archiveteam-bs [01:42] *** ATrescue2 has joined #archiveteam-bs [01:43] *** enowaldo has quit IRC (Read error: Operation timed out) [01:43] Ok, this one is going to be tricky: http://www.seniorcitizens.9f.com/ [01:44] 9f.com used to be operated by bizhosting.com, but it seems that they no longer support that domain. The nameservers are dead. [01:45] But their web servers still serve the site alright if you send the appropriate Host header: curl -H 'Host: www.seniorcitizens.9f.com' http://64.136.20.35/ [01:45] *** ATrescue3 has quit IRC (Ping timeout: 252 seconds) [01:46] *** enowaldo has joined #archiveteam-bs [01:46] Of course that can be done without a problem, but how to do it properly so there is no illusion that the 9f.com domain still exists? [01:46] SketchCow: ^ Any advice for situations like this? [01:47] Not the first time this comes up of course (expired or banned domains etc.), but I don't think we ever came up with a good solution. [01:50] *** enowaldo has quit IRC (Ping timeout: 265 seconds) [01:51] *** enowaldo has joined #archiveteam-bs [01:52] JAA: grab-site, upload into IA and then just stick around with it until nobody cares anymore [01:53] Yeah [01:54] Can you put metadata in it to indicate as such? [01:54] Even a .txt. [01:55] Yeah, or item description. [01:55] Alright, I'll go with that then. [01:58] *** enowaldo has quit IRC (Read error: Operation timed out) [02:09] *** omglolba- has joined #archiveteam-bs [02:10] *** balrog has quit IRC (Quit: Bye) [02:10] *** omglolbah has quit IRC (Ping timeout: 615 seconds) [02:13] *** enowaldo has joined #archiveteam-bs [02:13] One of you mailed me with a request to move Fusl's stuff to a archiveteam_fire since grab-site's a different hing [02:13] Unless that breaks someone's booty, I'm fine with that [02:13] *** balrog has joined #archiveteam-bs [02:14] yup, JAA did reach out to you and im fine with that [02:15] I'm to take it relatively easy over the next week or two, post operation. [02:15] Then I go to the Netherlands [02:15] So I'm now stepping through things [02:16] My big push this week is the sorting hat, which is now a terror on the archive's shitpiles [02:17] *** HashbangI has quit IRC (Remote host closed the connection) [02:17] *** SilSte has quit IRC (Read error: Operation timed out) [02:18] I don't think anyone even noticed that those items are in there. [02:18] Also, relax and get well soon! [02:19] *** SilSte has joined #archiveteam-bs [02:19] *** enowaldo has quit IRC (Read error: Operation timed out) [02:20] Does anyone have the ability to search the justin.tv video archives by channel name? - the one video ID I do have doesn't show up in the search that's avaliable (Looking for db_high channel's old videos, I'm desertbus's archivist) [02:24] *** enowaldo has joined #archiveteam-bs [02:29] *** enowaldo has quit IRC (Ping timeout: 252 seconds) [02:30] *** enowaldo has joined #archiveteam-bs [02:31] *** HashbangI has joined #archiveteam-bs [02:36] *** enowaldo has quit IRC (Read error: Operation timed out) [02:46] *** enowaldo has joined #archiveteam-bs [02:49] "Your upload of www.seniorcitizens.9f.com_20190514 ... appears to be spam." Oof. [02:59] JAA: propose that the next warc spec add a field to indicate a DNS override [03:00] marked: I don't have the patience to argue about this for half a year until it maybe gets included in the spec draft. [03:01] But I did include a header field in the WARC explaining the situation. The WARC spec already allows you to add any headers as necessary. [03:01] (I went with: "Host-Note: 9f.com nameservers are down, but bizhosting.com's web servers still serve the sites; hard-resolved www.seniorcitizens.9f.com and seniorcitizens.9f.com to 64.136.20.37") [03:02] *** enowaldo has quit IRC (Read error: Operation timed out) [03:04] maybe those future drafters will want to capture what's being done in practice. the need is at least clear already [03:09] *** enowaldo has joined #archiveteam-bs [03:17] *** marked is now known as marked1 [03:18] *** enowaldo has quit IRC (Read error: Operation timed out) [03:26] *** enowaldo has joined #archiveteam-bs [03:43] *** qw3rty111 has joined #archiveteam-bs [03:49] *** qw3rty119 has quit IRC (Read error: Operation timed out) [03:58] *** odemgi_ has joined #archiveteam-bs [04:00] *** enowaldo has quit IRC (Read error: Operation timed out) [04:01] *** odemgi has quit IRC (Ping timeout: 252 seconds) [04:01] *** enowaldo has joined #archiveteam-bs [04:36] *** Despatche has quit IRC (Quit: Read error: Connection reset by deer) [04:40] *** enowaldo has quit IRC (Read error: Operation timed out) [04:40] *** enowaldo has joined #archiveteam-bs [04:50] *** enowaldo has quit IRC (Ping timeout: 268 seconds) [05:02] *** fuzzy8021 has quit IRC (Read error: Connection reset by peer) [05:02] *** fuzzy8021 has joined #archiveteam-bs [05:05] *** Hani111 has joined #archiveteam-bs [05:13] *** enowaldo has joined #archiveteam-bs [05:14] *** Hani has quit IRC (Ping timeout: 615 seconds) [05:14] *** Hani111 is now known as Hani [05:19] *** enowaldo has quit IRC (Read error: Operation timed out) [05:59] *** Zerote_ has joined #archiveteam-bs [06:03] *** Despatche has joined #archiveteam-bs [06:15] JAA: that's a decent solution imo [06:15] not very machinable but in the absence of a consensus ... [06:15] idk [06:15] dns override has been a sticking point for us since forever [06:22] There could be a header for lines in the hosts file format. But that would only work for the simple cases. It breaks down already when you resolve all subdomains to a given IP, for example, since there is no such thing as "127.0.0.1 *.example.net" in hosts files. [06:34] *** wyatt8750 has quit IRC (Read error: Connection reset by peer) [07:11] *** Zerote_ has quit IRC (Ping timeout: 252 seconds) [07:13] *** Despatche has quit IRC (Quit: Read error: Connection reset by deer) [07:21] *** enowaldo has joined #archiveteam-bs [07:22] *** Despatche has joined #archiveteam-bs [07:29] *** Zerote_ has joined #archiveteam-bs [07:49] *** wyatt8740 has joined #archiveteam-bs [08:14] *** wyatt8740 has quit IRC (Read error: Operation timed out) [09:18] *** deevious has quit IRC (Quit: deevious) [10:33] *** deevious has joined #archiveteam-bs [11:12] *** enowaldo has quit IRC (Read error: Connection reset by peer) [11:12] *** enowaldo has joined #archiveteam-bs [11:21] *** enowaldo has quit IRC (Remote host closed the connection) [11:22] *** enowaldo has joined #archiveteam-bs [12:02] *** Zerote__ has joined #archiveteam-bs [12:05] *** Zerote_ has quit IRC (Ping timeout: 252 seconds) [12:06] *** enowaldo has quit IRC (Read error: Operation timed out) [12:08] *** enowaldo has joined #archiveteam-bs [12:39] hosts 2.0 [12:49] *** deevious has quit IRC (Quit: deevious) [13:07] *** enowaldo has quit IRC (Read error: Operation timed out) [13:13] *** enowaldo has joined #archiveteam-bs [13:23] *** enowaldo has quit IRC (Ping timeout: 265 seconds) [13:33] *** enowaldo has joined #archiveteam-bs [13:41] *** voltagex has left [13:44] *** deevious has joined #archiveteam-bs [13:52] *** Zerote__ has quit IRC (Ping timeout: 252 seconds) [14:01] *** SketchCow has quit IRC (Ping timeout: 252 seconds) [14:15] *** SketchCow has joined #archiveteam-bs [14:15] *** Fusl sets mode: +o SketchCow [14:30] *** Zerote__ has joined #archiveteam-bs [14:32] JAA: I would say cover everything, split the EU election into 28 local election pages and one page foe the EU wide parties [14:36] jut: Seems reasonable. We might want to grab the national party websites as well since there'll be a lot of campaigning on there, but that's easy to integrate. [14:38] So we'd have the following pages: [14:38] - ArchiveBot/2019 European Union parliamentary elections -- Overview linking to everything else [14:38] - ArchiveBot/European Union political parties -- List of all parties on the EU level [14:38] - ArchiveBot/2019 European Union parliamentary elections/Germany -- Anything local relating to the elections [14:38] And ArchiveBot/German political parties can be added to the third one if we get that far. [14:38] s/added to/linked from/ [14:39] Not sure if "European Union political parties" is correct. Is there an adjective for the EU? [14:40] Pan-European [14:40] JAA: it sounds as correct as you're likely to get to me [14:40] jut: That's not only EU. [14:41] otherwise you'd have European Unionian, which is a bit ridiculous :) [14:41] Yeah [14:42] Wikipedia lists just "European", meh. [14:43] In Lithuania we have a thing called "visuomeniniai rinkimu komitetai", basically temporary parties formed just for a single election where shall I put them? (5 of them are running in this election) [14:44] i've heard that sort of thing called a "coalition" in english [14:44] I'd put those on ArchiveBot/2019 European Union parliamentary elections/Lithuania. Archivebot/Lithuanian political parties is for permanent parties I'd say. [14:44] JAA: normally I'd just go for "EU political parties", except for titles. It's short enough and correct enough. [14:44] astrid: are they formed for the election, or post-election? [14:45] if post election, then they work together (in theory) for the term, definitely coalition. [14:45] coalitions are when a couple of existing small political parties team up to win an election [14:45] astrid: that could be the same, in the places where I know how it works the reverse happens: they team up after the election. [14:45] * astrid nod [14:45] but, it's probably a close enough or completely correct term. [14:46] coalitions can also form in parliament if three small parties don't have enough votes to do anything unless they cooperate [14:46] and they tend to be short lived [14:47] The thing I am describing is a thing formed prior to an election basically a very small party the English translation is: [14:47] Public Election Committees [14:48] hm interesting [14:48] I think that's mostly a meaningless phrase in english :) [14:48] i.e. without context [14:48] no i've heard of election committees [14:49] in my experience they tend to be ... yeah, a couple of candidates who share offices [14:49] i guess that's a tiny party [14:49] according to wikipedia, it's a thing in Hong Kong. [14:49] * astrid -> afk [14:50] I think "temporary coalition" might work. But I'm just bikeshedding now and should do some work instead. [14:50] I came across structures like this yesterday while looking into the Philippine elections. They're called "party lists" there. [14:50] oh that rings a bell [14:50] oh, because it's a different thing in the place I vote :) [14:51] (it's the ordered list of members from a party who will become MPs by default if they form government, as opposed to those who become MPs because they're voted in as local representatives.) [14:51] IIRC [14:52] but, maybe different things with the same name in different places. [14:52] There are currently 87 Public Election Committees registered with 5991 members an average of just 68 members [14:53] per commitee [14:53] Many of them will probably be difficult to find or don't really have a large online presence anyway. [14:54] * eythian wonders if "temporary coalition" is a good term for it. [14:54] * eythian goes back to real work though [14:55] It realy depends, the 5 of them that are running in the ES election should have a reasonable presence [14:55] *EU [14:55] true [14:57] By the way, I'd use the local term on the ArchiveBot/2019 European Union parliamentary elections/Country page plus perhaps an English translation. [15:01] *** phiresky1 has joined #archiveteam-bs [15:02] *** phiresky has quit IRC (Ping timeout: 265 seconds) [15:02] btw if you guys slam the transfer instance, I am pretty sure it will be quite unstable since aws seems to have provisioned that vm on their most loaded node. So ¯\_(ツ)_/¯ [15:03] kiska: We'll make good use of it. :-P [15:03] Oh, another wording question: "EU parliamentary elections" or "... election"? [15:03] Yep, just keep an eye on grafana [15:04] Also its one election right? Or multiple? [15:04] I like how the Wikipedia page is "2019 European Parliament election" and the text begins with "The next elections to the European Parliament are ...". [15:04] 404: Consistency not found [15:05] Well, I think you could see it as either one giant election or an individual election in each country, I guess. [15:06] Or you could say that a person being elected to the parliament is "an election", in which case it would be plural already in each country. [15:06] ¯\_(ツ)_/¯ [15:12] I guess the answer depends on your political opinions [15:15] For the love of god what the fuck is happening? Why the hell is there a public commitee for Rolandas Paksas, he has a party, Tvarka ir teisingumas. [15:16] They are debating against each other, how, why, did I blink? [15:17] Oh he left the party 9 months ago, just like he left the presidency youtube.com/watch?v=6zXDo4dL7SU [15:20] What are the correct adjectives for Cyprus, Czech Republic, Luxembourg, and the UK? Cypriot, Czech (Republican?), Luxembourgian, not the slightest idea? [15:21] As in "German political parties" [15:21] Cypriot, Czech are correct [15:22] There's an impressive number of ways how you can get the UK one wrong. lol [15:23] So i will find the VRK online presence and also the online presence of the leader since they are mostly one and the same [15:24] *VRK visuomeniniai rinkimu komitetai [15:24] I'm beginning to think we should've named those pages "Political parties from Germany" etc. instead. That would've been much easier. [15:25] Looks like it's Luxembourgish. [15:26] I never thought that grammar would be a problem :) [15:26] :-P [15:32] JAA: is that grabsite job done already or have you killed it? [15:32] By the way, please keep the country pages organised similarly to the party pages, i.e. one section for each party plus subsections if the list is too long (many subdomains, social media accounts, whatever). [15:32] Fusl_: Looks like it's done. :-) [15:33] that didnt grab very much then i guess [15:33] It's not a big site, just an annoying one that banned AB after a few dozen requests. [15:33] No matter what delay settings, UA, etc. [15:34] 3k responses, 92 MiB [15:50] [15:51] Does that ! need to be escaped? [15:51] Nope [15:51] As long as the section name doesn't contain " -->", it's all fine. [15:52] No actually, just " -->". [15:56] https://www.archiveteam.org/index.php?title=ArchiveBot/2019_European_Union_parliamentary_elections/Lithuania [15:56] is this page ok? [15:57] Yep, looks good, apart from maybe the [[Category:English]]; is there really primarily English content on those sites? [15:57] Oh, no [15:59] Okay I will redo the Lithuanian political parties and do the Latvian parties [16:28] https://lsddp.org/portfolios/youtube-video/ :) [16:29] Yeah our parties haven't figured this internet thing out yet [16:37] :-) [16:55] Latvian have a party whose name is literally "The state belongs to who?" [16:56] *Latvians [17:01] Surely there is someone somewhere whose election slogan is "I am the senate". [17:03] Well our candidate whose slogan was "Lithuania is here" was beaten by a candidate from the Polish Russian alliance take from that what you will [17:07] Do you often come across some interesting/crazy shit when doing this? [17:08] Well, the elections I've been covering so far were all in languages I don't even remotely understand, so that never really came up. [17:09] *** wp494 has quit IRC (Ping timeout: 604 seconds) [17:10] Yeah be glad, looking at Lithuanian politics has been interesting™ [17:10] Latvians also have some crazy stuff [17:10] *** wp494 has joined #archiveteam-bs [17:11] JAA: What languages do you speak/read? [17:14] jut: German and English. I understand a decent amount of French and can usually decipher enough of other close Latin languages (i.e. Italian, Spanish) to more or less understand what it's about. [17:15] You? [17:16] Lithuanian, English, can read and understand Russian, and with effort some Latvian [17:39] *** PurpleSym has quit IRC (Ping timeout: 252 seconds) [17:41] *** PurpleSym has joined #archiveteam-bs [18:44] *** enowaldo has quit IRC (Read error: Operation timed out) [18:46] *** Hani has quit IRC (Ping timeout: 615 seconds) [18:55] *** Hani has joined #archiveteam-bs [19:37] *** Despatche has quit IRC (Quit: Read error: Connection reset by deer) [19:38] *** JH88 has joined #archiveteam-bs [19:45] *** Despatche has joined #archiveteam-bs [20:35] *** cynthia has quit IRC (Quit: WeeChat 1.5) [22:56] *** Jens has quit IRC (Remote host closed the connection) [22:57] *** Jens has joined #archiveteam-bs [22:59] *** BlueMax has joined #archiveteam-bs [23:11] *** tapos has joined #archiveteam-bs [23:14] *** tapos has quit IRC (Client Quit) [23:17] *** icedice has quit IRC (Quit: Leaving) [23:35] *** Silvan has joined #archiveteam-bs [23:36] *** SilSte has quit IRC (Ping timeout: 615 seconds) [23:54] *** asdf0101 has quit IRC (The Lounge - https://thelounge.chat) [23:58] *** asdf0101 has joined #archiveteam-bs