Wikidata:Requests for permissions/Bot/BorkedBot 8
From Wikidata
Jump to navigation
Jump to search
The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Approved--Ymblanter (talk) 21:45, 2 November 2022 (UTC)[reply]
BorkedBot 8[edit]
BorkedBot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: BrokenSegue (talk • contribs • logs)
Task/s: Update official website (P856) to HTTPS when appropriate
Code: Not yet ready. Want approval first. Hopefully my long track records of running bots allows for that.
Function details:
- For all items with official website (P856) where there is one best value uses HTTP (not HTTPS).
- Fetch the URL.
- If it redirects to the exact same URL but with HTTPS then
- Add a new official website (P856) statement with the HTTPS URL and mark it as preferred (copy over any qualifiers from the original best value).
--BrokenSegue (talk) 20:29, 11 October 2022 (UTC)[reply]
- Support Dealing with HTTPS is not always straightforward, but your algorithm sounds sound. Infrastruktur (talk) 19:16, 17 October 2022 (UTC)[reply]
- Can't we just replace http with https, instead of adding a new statement? Otherwise it seems that there are 2 websites, but it is actually the same website, it is like "they changed protocol from http 1.0 to http 1.1, then to SSL, then to TLS 1.0, then to TLS 1.2" - information that no one is interested in. --Lockal (talk) 03:51, 19 October 2022 (UTC)[reply]
- @Lockal: so going by the letter of the rules it seems like technically we should add a new statement. I do agree that it's a bit clunky. Maybe it's ok just to modify in place. BrokenSegue (talk) 18:18, 19 October 2022 (UTC)[reply]
- Support I think it should just modify in place. Can it also add language of work or name (P407) based on meta tags in HTML if the original statement lacks it? For those which remain HTTP, it is possible that the websites could have been converted to "domain for sale page". I hope the bot will find such cases! Midleading (talk) 13:49, 2 November 2022 (UTC)[reply]
- identifying that the domain has been taken over for some other use (or is a landing page) is very hard. identifying language is possible (I've previously investigated using machine learning on the language text). BrokenSegue (talk) 17:56, 2 November 2022 (UTC)[reply]