Difference between revisions of "OpenOffice.org Internship/Projects/2010/Statistical Data Analysis Tool"

From Apache OpenOffice Wiki
Jump to: navigation, search
(Tasks Completed)
Line 40: Line 40:
 
*F-test two-sample for variances
 
*F-test two-sample for variances
 
*Z test- Two samples for means
 
*Z test- Two samples for means
 +
*Implementation of data validation and further testing of the methods already added
 +
*First steps to the analysis methods with charting
 +
  
 
==Current tasks==
 
==Current tasks==
 
Currently working on adding statistical functions to the developed extension. Familiarization with the statistical method and the development is going together with a set of statistical functions have been decided.
 
Currently working on adding statistical functions to the developed extension. Familiarization with the statistical method and the development is going together with a set of statistical functions have been decided.
*Implementation of data validation and further testing of the methods already added--''In progress''
+
*Histogram analysis with chart-''In progress''
*First steps to the analysis methods with charting
+
  
 
==Project status==
 
==Project status==

Revision as of 00:31, 10 September 2010

Overview

Statistical models are widely used in analysis of different types of data currently.Statistical analysis of data acts a major role in decision making under uncertainty and it is widely used in surveys, research,business and science. Hence it is very useful to have a statistical data analysis tool in a data manipulating application like Calc. The main aspect of the statistical data analysis tool project is to provide the user ability of performing various statistical functions to analyze data in openoffice calc in a very user friendly manner.

Project plan

  • Developing a basic calc extension for data analysis--Done
  • Determining the different analysis methods to be included in the tool --In progress
  • Getting familiar with each analysis method and collecting information about --In progress
    • How the data input should be given to the method
    • Other external user given parameters required
    • How the output should be displayed
  • Developing each analysis method under the following steps -- In progress
    • User interface design considering the input and other parameters required
    • Developing the functionality of the method
    • Displaying the output of the analysis
    • Integrating the developed method with the analysis tool ( extension)

Enhancements

  • Addition of more useful analysis methods
  • User interface enhancements
  • Translating extension to other languages

Documentation

  • Statistical data analysis tool documentation (updating the wiki etc.)
  • Code documentation

Tasks Completed

  • The basic calc extension development has been completed together with the UI structure and the development structure of the extension
  • The extension has been reviewed by my mentor and suggested necessary improvements
  • Correlation analysis
  • Covariance analysis
  • Rank and percentile analysis
  • T test
    • Paired t test for means
    • t test assuming equal variances
    • t test assuming unequal variances
  • ANOVA test
    • One-way ANOVA
    • Two-way ANOVA with replications
    • Two-way ANOVA without replications
  • F-test two-sample for variances
  • Z test- Two samples for means
  • Implementation of data validation and further testing of the methods already added
  • First steps to the analysis methods with charting


Current tasks

Currently working on adding statistical functions to the developed extension. Familiarization with the statistical method and the development is going together with a set of statistical functions have been decided.

  • Histogram analysis with chart-In progress

Project status

  • The project is accepted for the OpenOffice summer internship program 2010
  • The project is proceeding with adding new analysis methods
Personal tools