From Fedora Project Wiki

m (moved Data mining use cases to Statistics 2.0: "statistics" sounds nicer than "data mining" :))
(change cat)
Line 40: Line 40:
== Actual package contents (repoquery) ==
== Actual package contents (repoquery) ==


[[Category:Data mining|Use cases]]
[[Category:Statistics 2.0|*]]

Revision as of 20:43, 24 June 2009

Please add ideas you have for Fedora data mining use cases on the talk page. Those ideas will be compiled into this list.

This page is also used as a notepad for the feasibility of each use case.

General

  • Determine "activity" as a boolean based on wiki edits, translations, mailing list posts, CVS/git/whatever commits, and determine how many accounts are active as a history over time (graph)
  • Determine what types of "activity" are "talk" and "action"

Fedora Accounts System

  • History over time of account registrations and signed CLAs
  • History over time of number of members in each group
  • History over time of involvement of people from $COMPANY

pkgdb/koji

Updates system (bodhi)

  • Update submitters
  • Feedback submitters
  • Most updated packages

Mailing lists

Wiki

  • Wiki edits and other actions (page moves, etc)
  • People who actually use edit summaries

Fedora Hosted

  • Commits and committers

Non-fedorahosted.org SCMs

Red Hat Bugzilla

Mirrormanager

IRC meetings

Nagios, Zenoss, and other fun infrastructure things

Actual package contents (repoquery)