The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at Wikipedia:Bots/Noticeboard. The result of the discussion was  Approved.

New to bots on Wikipedia? Read these primers!

Operator: MichaelMaggs (talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)

Time filed: 14:22, Friday, January 22, 2021 (UTC)

Function overview: (a) Add new short descriptions to organism articles. (b) Improve some existing moth short descriptions

Automatic, Supervised, or Manual: Automatic, after pre-review

Programming language(s): Pywikibot

Source code available: GitHub

Links to relevant discussions (where appropriate): WikiProject. Also noted on the WP short description page. Not a lot of interest, but there wasn't much for the moths task either, and that was entirely uncontentious.

Edit period(s): One time

Estimated number of pages affected: (a) 210,000 with relevant infobox; (b) 2000 moth articles

Namespace(s): Mainspace

Exclusion compliant: Yes

Already has a bot flag: Yes

Function details: ShortDescBot has successfully completed its addition of new short descriptions to all the moth articles. Next, I want to move on to categories of other organisms. This is a good bot task since non-technical short descriptions complying with WP:HOWTOSD can’t automatically be generated from the usual infoboxes, at least without expensive Lua calls.

Each bot run is based on a single category at some level in the tree that I can manually associate with a suitable common generic name. Sometimes that may be the same as the category name (Category:Butterflies --> "butterfly"), but often not (Category:Poaceae --> "grass" or Category:Onychophorans --> "velvet worm"). The bot then constructs and adds new short descriptions such as "Species of butterfly", "Genus of velvet worms", "Family of grasses" and so on. The text is deliberately simple so that a low error rate (<1%) can be maintained while minimising the number of non-standard articles that the bot has to skip as 'too difficult to parse'. For each category the procedure is:

  1. With the bot in trial mode, write the proposed descriptions to a local spreadsheet; review and repeat until the error rate is sufficiently low
  2. Manually remove from the list any obvious classes of article that the bot will not realistically be able to handle [not had to do this so far in testing]
  3. Re-run the bot in edit mode, making live changes only to the pages in the final corrected list.

The bot won't change existing short descriptions, with one small exception. A new feature this time is the inclusion of "Extinct ..." in the bot-created description of extinct organism articles, and also "Single-species .." in Monotypic genus articles (where that can be done without making the text too long). 2000 or so moth short descriptions of the form "Genus of moths" etc can be improved.

You can see a sample of suggested edits from a variety of categories at User:MichaelMaggs/ShortDesc.

Discussion[edit]

Approved for trial (50 edits). Please provide a link to the relevant contributions and/or diffs when the trial is complete. Primefac (talk) 16:10, 22 January 2021 (UTC)[reply]

Trial complete. The results look good, I think, although I did notice that in a few cases such as Phomatosphaeropsis and Pecoramyces the bot used "Genus of fungi" rather than "Single-species genus of fungi" which would have been better. So far I've not been adding "Single-species ..." to genus articles solely on the basis that the article is in a monotypic-specific category, as categorisations can very often be wrong. But in practice, monotypic categorisation seems to be done carefully, by specialists, and I suspect that using the name of the category will pick up a few more instances that can't be parsed from the lead: things like Wollemia, for example, where the fact that the genus is monotypic is well-hidden in the body of the article but can easily be seen from the category. I'll do that from now on. MichaelMaggs (talk) 15:35, 23 January 2021 (UTC)[reply]

((BAG assistance needed)) Hi, hope it's OK for me to request follow-up, as I haven't heard anything for just over a week now. MichaelMaggs (talk) 17:48, 30 January 2021 (UTC)[reply]

((BAG assistance needed)) Any chance of a conclusion please? I posted the results of the trial two weeks ago so there's been more than ample time for community comments. MichaelMaggs (talk) 09:35, 6 February 2021 (UTC)[reply]

Peter, the question of whether short descriptions should include family details was discussed at some length here before I requested bot approval. The conclusion was they should not, partly because in many fields species are constantly being moved between families and it would require long-term efforts to keep the descriptions up to date. Editors sometimes add them manually, which may in particular cases be OK, but something so complicated and contentious isn't really appropriate for a bot. MichaelMaggs (talk) 11:18, 8 February 2021 (UTC)[reply]

The Earwig Futher discussions here have persuaded me that it's best to avoid both "Single-species" and "Monotypic", and I've changed the bot so that it doesn't make use of either. Are you OK to approve the bot please? MichaelMaggs (talk) 09:48, 12 February 2021 (UTC)[reply]

Agree with abandoning this terminology.  Approved. — The Earwig ⟨talk⟩ 05:21, 14 February 2021 (UTC)[reply]
The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at Wikipedia:Bots/Noticeboard.