Calc/To-Dos/Statistical Data Analysis Tool

From Apache OpenOffice Wiki
< Calc‎ | To-Dos
Revision as of 00:41, 24 June 2006 by Kohei (Talk | contribs)

Jump to: navigation, search

The purpose of this page is to outline the requirements of Calc's Statistical Data Analysis Tool (working name), which is yet to be developed by someone with willingness and skills. Feel free to contribute to this page what features you want to see in this tool, how such tool could be developed, or anything else that you need to add.

Goal

Large amount of single or multivariate data require convenient functions to analyze them. There are countless statistical methods which accomplish this. Some of them are already integrated, but many of them are still missing. In most of the cases they produce a one or more values out of one or many vectors of numbers (or categorical variables) which describe those vectors or their relationship between them. The aim is a collection of methods which are easy to use.

Overview single and multivariate data analysis

just some thoughts ....

Graphics

  • histogram
  • density
  • bubble
  • Box and Whiskers Plot
  • Bland and Altman Plot

Models

linear

non-linear

multivariate response variables

  • multivariate tests

Patterns

Cluster analysis

Desired Features

What features do users need as statistical data analysis tools?

  • analysis of one-dimensional data
  • Deviation from the median
  • Variation Ratio
  • Range
  • Pearson r²
  • analysis of multidimensional data
  • graphical representation

Task Breakdown

User Input and Output

How this application needs to be structured...

Third Party Library Integration

R

R can be used as a backend statistical analysis engine. Since we can't ship R with OO.o due to licensing incompatibility (it's released under GPL), the location of its executable or shared library needs to be specified by the user so that OO.o can locate it at run-time.

Personal tools