User:Mesoderm/sandbox
  • A good place to start is getting all high traffic bio/med articles to B/A quality.
start at Biology
staying in cat_list tree ... is_cat_descendant()
add_to_list() if (get_stats() > min_traffic);
next_article();


Superfund sites
  1. Start with major cities/bioregions/watersheds/etc. and use TOXMAP to generate maps of superfund and TRI sites for articles related to the area in question.
  2. Making articles for each of the sites/events on these maps
  • Usability and information architecture ... we've got a lot of content on Wikipedia, but it's poorly organized, and vast swaths of it are in terrible condition ... we need to start looking at the bigger picture, and consider how this information should be organized. Not just w/ simple things like "portals", but by changing the way we choose to break reality up into individual topics, how we link between them, and article structure, amongst other things. Right now, most of the focus seems to be on how to improve individual articles, without much rhyme or reason to the information architecture at the site or subject level.
  • Categories most commonly edited by user ...
  • User clusters -- which groups of people tend to edit similar articles
  • Sources most commonly used by user (publisher, url, etc.)
  • Avg. chars modified per edit & net (non-template, etc.) chars contributed to project
  • Geostats
  • Categories with poorest sourcing (few scholarly/reputable works, low sources/sentences ratio, etc.)
  • They are machine parse-able and we can make automated tools to analyze articles for their sourcing. This is an extremely useful feature both for improving the quality of the encyclopedia, and by making it more easily and deeply searchable by people who are using Wikipedia as a research tool.