Freie wissenschaftliche Software
List of free statistical software
Open Source & Public Domain Packages with Source Code
For the following packages, source code availability has been checked:
RStudio, written by JJ Allaire, Joe Cheng, Josh Paulson and Paul DiCristina, integrates the comprehensive state-of-the-art statistical package R with a superb user interface, available both as desktop application and as a browser-based server application.
We are impressed especially by the web version of RStudio, which seems to be a great opportunity for research advisors and IT-departments to bring custom R-applications to their intranets without much hand coding. We think that RStudio server has the potential of becoming a very popular linux research application. Anyone interested in supporting a demo server project (we are currently testing this, hardening the rstudio server behind an Apache proxy against attacks is the main point to be considered), please contact Tobias Gabele (tobi at mathsim dot com).
SciGraphica, developed by Adrian E.
Feiguin, is a scientific application for data analysis and technical graphics. It has similarities
with Sigmaplot and pretends to be a clone of the popular commercial (and expensive) application
"Microcal Origin". It fully supplies plotting features for 2D, 3D and polar charts. The aim is to
obtain a fully-featured, cross-plattform, user-friendly, self-growing scientific application. It is
free and open-source, released under the GPL license.
- You can plot functions and manipulate data in worksheets.
- You can open several worksheets and plots and work with them interactively and at the same
- The plots are fully configurable using a control panel dialog.
- The look and feel is completely WYSIWYG.
- Production/Publication quality PostScript output.
- You can interact with the plots double-clicking, dragging and moving objects with the
- Native XML file format.
- You can insert Python expressions in the worksheets.
- Terminal with command-line Python interface for interacting with plots and worksheets
It is completely programmed in C from scratch, using the GTK+ and GtkExtra libraries, and
released under the GPL agreement.
Data manipulation and fitting features are in the roadmap.
Binaries are currently available for several Linux platforms.
R: a programming language and environment for
statistical computing and graphics. Similar to S or S-plus (will run most S code unchanged).
Available for Windows, various Unix flavors
(including Linux), NextStep and Mac. Provides a wide variety of statistical (linear and nonlinear
modelling, classical statistical tests, time-series analysis, classification, clustering, ...) and
graphical techniques, and is highly extensible. Well-designed publication-quality plots can be
produced, including mathematical symbols and formulae where needed.
Don't miss the new RStudio integrated development environment for R!
by Ricco Rakotomalala, University of Lyon, is an open source DATA MINING Windows-software for
academic and research purposes. It proposes several data mining methods from exploratory data
analysis, statistical learning, machine learning and databases area.
gretl: Gnu Regression, Econometrics and
Time-series Library, is a cross-platform software package for econometric analysis, written in the
C programming language. It is is free software. You may redistribute it and/or modify it under the
terms of the GNU General Public License (GPL) as
published by the Free Software Foundation.
- Easy intuitive interface (now in French, Italian, Spanish and Polish as well as English)
- A wide variety of least-squares based estimators, including two-stage least squares and
nonlinear least squares
- Single commands to launch things like augmented Dickey-Fuller test, Chow test for structural
stability, Vector Autoregressions, ARMA estimation
- Output models as LaTeX files, in tabular or equation format
- Integrated scripting language: enter commands either via the gui or via script
- Command loop structure for Monte Carlo simulations and iterative estimation procedures
- GUI controller for fine-tuning Gnuplot graphs
- Link to GNU R for further data
ViSta: a Visual Statistics program for Windows,
Mac and Linux/Unix, developed by Prof. Dr. Forrest W. Young at the University of North
- Dynamic, High-Interaction, Multi-View Graphics: ViSta constructs very-high-interaction, dynamic
graphics that show you multiple views of your data simultaneously. The graphics are designed to
augment your visual intuition so that you can better understand your data.
- See What Your Data Have To Say: ViSta's visually intuitive and computationally intensive
approach to statistical data analysis is designed to clarify the meaning of data so that you can
see what your data have to say.
- Freeware/Open Software: ViSta is free and open. It can be downloaded from the web.
- Platforms: ViSta runs under Windows, on Macintosh, and under Unix.
- Languages: ViSta is available in English, Français, and Español.
- Look and Feel:ViSta has a Structured Graphical Interface that includes
- WorkMaps that visually summarize your data analysis session
- SpreadPlots - a spreadsheet-like arrangement of linked, dynamic, interactive plots that show
you what your data or analysis seems to say
- GuideMaps that guide your analysis
- DataSheets for inputing and editing your data
- Visual GuideMap Authoring for teachers who wish to create guidemaps
- Reports present data statistics and analysis results in the classical way
- Context-Sensitive Help and Web-based Help and Notes
- Statistical Capabilities: ViSta performs Univariate and Multivariate Statistical Visualization
and Data Analysis.
- Applets, Plugins, Scripts and DataPrograms:
- Applets: Teachers can develop ViSta Web Applets for demonstrations, examples, test problems,
etc. Students can download and run these on their own machines (example of class notes using an
- Scripts: Teachers can write ViSta scripts using the same techniques as for Applets, but keep
them privately on their machines. A grading script is an example.
- DataPrograms:Teachers and students can write dataprograms to manipulate their data.
- Plugins:Programmers can develop plugins to add entirely new data analysis and visualization
- Developer's Tools: ViSta provides access to the underlying development languages, including:
- ViDAL, ViSta's Data Analysis Language for writing applets, scripts and data programs; and
- XLispStat, an object-oriented programming language which can be used to write ViSta plugins for
statistical computing and dynamic graphics.
- XLisp, a free and open Lisp system satisfying most of the Common Lisp standards.
- C++ and FORTRAN may be used for developing specialized new features.
In addition, ViSta has developers tools including a byte-code compiler, a stepper,
tracing and back tracing, object and bitmap editors, distribution maker, etc...
MacAnova developed by
Gary Oehlert and Christopher Bingham, University of Minnesota, is an interactive statistical
analysis program with focus on analysis of variance and related models, matrix algebra, time series
analysis (time and frequency domain), and (to a lesser extent) uni- and multi-variate exploratory
statistics. It is available for Macintosh, DOS/Windows and Unix/Linux.
MacAnova has a functional/command oriented interface. The Macintosh and Windows versions also have
several window/menu/mouse type features but retain the basic interface. Although the language and
syntax are S-like (similar to S, S-Plus or R), MacAnova is not S or R.
Extensive documentation for MacAnova is available.
Bayesian inference using Gibbs sampling, open source version of BUGS origianlly developed by
Alastair Stevens, Cambridge University (and many other contributors). It is a widely used software
package for the Bayesian analysis of complex statistical models using Markov chain Monte Carlo
The OpenBUGS project is opening out the BUGS project into versions that run on Windows and Linux,
as well as inside the R statistical package (BRugs). They have released an open-source version of
the core BUGS code for multiple versions of this program
Octave: a high-level mathematical programming
language (free MATLAB clone) for numerical computations -- solving common numerical linear algebra
problems, finding the roots of nonlinear equations, integrating ordinary functions, manipulating
polynomials, and integrating ordinary differential and differential-algebraic equations. It is
easily extensible and customizable via user-defined functions written in Matlab syntax, or using
dynamically loaded modules written in C++, C, Fortran, or other languages. Available for Linux/Unix
Free statistical Toolboxes for Matlab at the [ StatLib ] website.
Scilab: a scientific software package for numerical
computations in a user-friendly environment. Available for Windows, Mac and Unix computers, this is
a sophisticated programming language with a MatLab-like syntax, hundreds of built-in functions and
libraries, 3-d graphics, and symbolic capabilities through a Maple interface.
PSPP, a free SPSS clone. PSPP is a program for statistical analysis of sampled data. It interprets
commands in the SPSS language and produces tabular output in ASCII, HTML, or PostScript
format. The name PSPP is not intended as an acronym, but it is easy to invent expansions for it. In
previous releases, PSPP was known as Fiasco. PSPP
development is ongoing. It already supports a large subset of SPSS's transformation language. Its
statistical procedure support is limited but growing. A manual is available.
formerly known as OS3 and OS2, is a general stats package for Windows and Linux, developed by Bill
Miller of Iowa State University, with an SPSS-like user interface. A Manual in PDF-Format is
OpenStat was originally written as an aid to students and teachers for introductory and advanced
statistics courses, primarily in the social sciences. To insure that students and professionals,
particularly in developing countries with very limited funding resources, have access to a package
that completes nearly all of the analyses required in statistics courses, the package contains a
wide variety of procedures for data analysis:
- A spreadsheet-type of user interface in which variable labels are created (columns of a grid),
and observed values are inserted into cells of rows for each case.
- Descriptive statistics including central tendency and variability indexes, x-y plots, frequency
- Pearson product-moment correlations and partial correlations.
- Analyses of variance including 1, 2 and 3 way ANOVAs, mixed design ANOVAs, Latin Square ANOVAs,
Mulivariate ANOVA and Analysis of Covariance.
- Multivariate analyses including Hierarchical, Factor Analysis, Canonical Correlation, and a
least-squares General Linear Model procedure.
- Non-Parametric analyses.
- Measurement programs.
- Financial programs.
- Simulation programs.
- A Neural Network program.
- A matrix manipulation program.
JMulti, developed at the HU-Berlin by Alexander
Benkwitz, Markus Krätzig and many contributors, is a graphical Java software for advanced
econometric time series analysis.
Incomplete list of Features:
- Initial Analysis
- various tools for creating, transforming, editing time series
- Unit Root tests: ADF, HEGY (quarterly, monthly), Schmidt-Phillips, KPSS, Unit Root test with
- Cointegration tests: Johansen Cointegration test with response surfaces, Saikkonen &
- VAR (can be used for univariate modelling as well)
- VAR modelling (with arbitrary deterministic/exogenous variables)
- Impulse Responses with bootstrapped confidence intervals also for accumulated responses,
orthogonal and forecast error versions
- Forecast Error Variance Decomposition
- SVAR modelling: AB model, Blanchard-Qua Model with bootstrapped standard errors
- SVAR Forecast Error Variance Decomposition
- VECM modelling (with arbitrary deterministic/exogenous variables)
- restrictions on cointegration space, Wald test for beta restrictions
- automatic model selection (various strategies based on information criteria)
- residual analysis with tests for nonnormality, autocorrelation, ARCH, spectrum, kernel density,
autocorrelation plots, crosscorrelation
- Impulse Responses with bootstrapped confidence intervals also for accumulated responses,
orthogonal and forecast error versions
- SVEC modelling with bootstrapped standard errors
- GARCH Analysis
- univariate ARCH, GARCH, T-GARCH estimation with different error distributions
- multivariate GARCH(1,1) estimation, residual analysis, plotting of variance process together
with univariate estimates, kernel density for residuals
- Smooth Transition Regression
- STR model specification with exogenous/deterministic variables
- Nonparametric Analysis
- lag selection for univariate models based on linear and nonlinear selection criteria
- nonlinear estimation with configurable 3D plots
- estimation of volatility process
- ARIMA Analysis with fixed regressors (univariate)
JMulti Binaries for Windows and Linux as well as the
sourcecode and documentation are available for download under the GPL license.
fityk by Marcin Wojdyr is a free
GPL-licensed peak fitting program for Linux, Windows and MacOS X. For optimization it has builtin
algorithms for Levenberg-Marquard gradient-based method, Nelder-Mead downhill simplex method and
Genetic Algorithms. Every of these methods has a set of adjustable parameters, for greater
Dap: a statistics and graphics
package developed by Susan Bassein for Unix and Linux systems, with commonly-needed data
management, analysis, and graphics (univariate statistics, correlations and regression, ANOVA,
categorical data analysis, logistic regression, and nonparametric analyses). Provides some of the
core functionality of SAS, and is able to read and run many (but not all) SAS program files.
Dap is freely distributed under a GNU-style license.
[ source: http://StatPages.net ]
PINT is a program for
Power analysis IN Two-level designs (for determination of standard errors and
optimal sample sizes in multilevel designs with 2 levels). It was written by Tom Snijders, Roel
Bosker, and Henk Guldemond. The newest (Windows) version is 2.11 (April 2003). This program
calculates approximate standard errors for estimates of fixed effect parameters in hierarchical
linear models with two levels.
(2002) Software for Scientific Visualization, Statistical Analysis, and
Non-Linear Modeling. Freeware for Unix, VMS, Linux, Windows.
Collection of Free SPSS Syntax
Files by Raynald Levesque, provides hundereds of SPSS syntax files, macros,
scripts and tutorials.
Packages distributed free of charge, not open source (or license model not yet classified)
For the following software, full source code availability has not been verified, although it is
likely that some of the authors might provide the source upon request. Please contact the authors
if you need the source code.
The Statistical Lab, developed by Members
of Center for Digital Systems (CeDiS) at FU-Berlin is an easy to use Windows package designed
primarily for teaching statistics. It uses the powerful R engine as its backend and is an
explorative and interactive tool designed both to support education in statistics and provide a
tool for the simulation and solution of statistical problems.
The graphical user interface is designed to make complex statistical relations easy to understand.
It connects and displays data frames, frequency tables, random numbers or matrixes in a
user-friendly statistical worksheet allowing users to run calculations, conduct analyses and
perform multiple simulations and manipulations.
MYSTAT 12, developed by
SYSTAT is essentially a limited version of SYSTAT 12 for Windows, available free of charge for
academic users (students and teachers). While some of the more advanced analysis capabilities of
SYSTAT 12 are missing, it appears to be one of the best and most professionally designed free
statistics software packages currently available. It comes with the full set of SYSTAT 12 PDF
documentation. MYSTAT does at most accept 100 variables (Columns) with no limit on the cases
(Rows). If you import a larger data file, only the first 100 variables will be read. Teachers who
have Systat 12 installed can install a 'Mystat Theme' for Systat to make Systat look and behave
exactly like Mystat.
Bayesian inference using Gibbs sampling, developed by Alastair Stevens, Cambridge University (and
many other contributors), is a widely used software package for the Bayesian analysis of complex
statistical models using Markov chain Monte Carlo (MCMC) methods.
The stand-alone WinBUGS 1.4.1 package for Windows has a graphical user interface and on-line
monitoring and convergence diagnostics. Users are asked to register to receive a `key' which
provides full functionality (this is free).
The OpenBUGS project is opening out the
BUGS project into versions that run on Windows and Linux, as well as inside the R statistical
package (BRugs). They have released an open-source version of the core BUGS code for multiple
versions of this program
WinIDAMS is a software package for
the validation, manipulation and statistical analysis of data, developed by the UNESCO Secretariat
in co-operation with experts from various countries. It is distributed free-of-charge upon
To get an Impression of WinIDAMS, please see the
- modern graphical user interface and on-line Reference Manual,
- possibility to customize the environment for an application,
- facilities for editing/creating data files and data description files,
- interactive data import/export,
- editor for creating/updating files with instructions for program execution,
- viewer for displaying and quick navigation through results,
- advanced text editing facilities,
- facilities for sorting and merging files, data editing, checking of codes and consistencies,
correcting, listing, subsetting, aggregating, merging and transforming data, including construction
of new variables,
- wide range of data analysis techniques such as: table building, regression analysis, one-way
analysis of variance, discriminant analysis, cluster analysis, principal components factor analysis
and analysis of correspondences, partial order scoring, rank ordering of alternatives, segmentation
and iterative typology,
- interactive components for construction of multidimensional tables and their graphical
presentation, for graphical exploration of data and for times series analysis.
document and the
[ WinIDAMS Reference
CSPro: Census and Survey Processing
System (CSPro) is a questionnaire-oriented statistical package for Windows. CSPro is developed by
MEASURE partners, the U.S. Bureau of the Census, ORC Macro's MEASURE DHS+ project, and SerPro S.A..
and has been funded by the Center for Population, Health & Nutrition of the United States
Agency for International Development. The software combines and expands upon the capabilities of
both ISSA and IMPS. It takes advantage of the power and flexibility of both of these programs, but
adds the friendliness, ease of use, and intuitive nature of Windows. CSPro provides a more visual
approach to the creation and manipulation of data and reduces the programming needs. This easy to
use package facilitates defining data structures, developing applications, entering and checking
data, generating reports, and producing tabulations. More advanced users, including computer
programmers, can access the full CSPro language to perform complicated tasks.
CSPro consists of modules for Data Entry, Batch Editing and Processing, and Cross Tabulation, as
well as Designer modules for creating Data Dictionaries and Data Entry Forms, and for writing
applications. It also includes a set of tools, including Text Viewer, Table Viewer, Map Viewer,
Table Retrieval, Tabulate Frequencies, Sort Data, Export Data, Reformat Data, Compare Data, Convert
Dictionary, and Convert Shape to Map.
The package supports easy creation of questionnaire forms and spatial representation of
A detailed list of capabilities is included in the Getting Started Guide
Free registration is required for download access.
VisiCube: a graphical tool for manual
explorative data analysis. Freeware available for Windows.
- Exploration paradigm which facilitates the actual activity of exploration.
- High-quality graphic images.
- Powerful subsetting capabilities.
- Multi-plot visuals for direct side-by-side comparisons.
- Continuous and instant visual feedback.
- Simplified interface for ease of use.
- Appropriate visualization techniques only.
- Collaboration through completely portable and independent analysis projects.
- Multi-dimensional data model for accurate modeling of your phenomena.
- No mathematical expertise required.
- It's free.
IRRISTAT: developed by the International Rice Research
Institute (IRRI), is a computer program for data management and basic statistical analysis of
experimental data. It can be run in any 32-bit Windows operating system.
IRRISTAT has been developed primarily for the analysis of data from agricultural field trials,
but many of the features can be used for analysis of data from other sources.
The main modules and facilities are:
- Data management with a spreadsheet
- Text editor
- Summary Statistics and Scatterplot Graphics
- Analysis of Variance
- Regression and Correlation
- Single Site Analysis of Plant Breeding Variety Trials
- Cross site and AMMI anaysis
- Pattern analysis of GxE Interaction
- Quantitative trait loci analysis
- Randomization and layout of experimental designs
- Display of linear forms for general factorial EMS
- Generation of coefficients for orthogonal polynomials
IRRISTAT is available for free download or by purchasing a CD at minimal cost.
Mac Diff, developed by Rainer Petschick, is a free peak-fitting program for analysis and
display of X-ray powder diffractogrammes on Apple Macintosh platforms. It is designed as a
complement to the XRD applications distributed by Philips or Siemens and it is equipped with a GUI
to facilitate user-friendly and interactive processing of XRD profiles. It offers conventional XRD
profile-correction processes (e.g. outliers, smoothing, divergence, alpha2-stripping) and supports
qualtiative and quantitative phase analysis. Moreover, by its programmable peak analysis (recording
of intensities, peak areas, half widths, etc.) it offers an automated analysis of entire series of
diffractogrammes. By means of peak fits the contributions of coinciding lines can be calculated.
The user has at his disposal the capability of continuous measurement checks and the possibility to
perform corrections as well (e.g. by manual or computed background adaptations). For peak
indication the user has the option to employ variable peak data sets. High resolution hardcopies of
the diffractogrammes can be produced. The standard single-scan formats (Philips-ADP or some
Siemens-RAW, ASCII) are identified automatically. Data and plots can be exported to most of the
available table-processing or vector-graphics programmes. The applications of MacDiff range from
routine analysis in sedimentology, especially clay minerals, over analysis of X-ray
diffractogrammes of all types of rocks and minerals all the way to phase analysis of various
Instat+: a visual
general statistics package for Windows, developed by the Statistical Services Centre at the
University of Reading. The full version of Instat+ may be downloaded and used for non-commercial
purposes by any individual free of charge. Instat+ has a familiar interface, similar to other
statistics packages. It includes many special facilities for the processing of climatic data.
statistical add-in for Excel , developed by the Statistical Services Centre at the University of
Reading. The full version of SSC-Stat may be downloaded and used for non-commercial purposes by any
individual free of charge. SSC-Stat is designed to strengthen those areas where the spreadsheet
package is already strong, principally in the areas of data management, graphics and descriptive
AM: a Windows stats package developed by the
American Institutes for Research, Washington/DC.
AM is a statistical software package for analyzing data from complex samples, especially
largescale assessments such as the National Assessment of Educational Progress (NAEP) and the Third
International Mathematics and Science Studies (TIMSS).
From its origin as a specialized tool for analyzing large-scale assessment data, AM has evolved
into a more generalized and growing tool for analyzing data from complex samples in general.
Originally, AM was developed to estimate regression models through marginal maximum likelihood
(MML). Because large-scale assessments are often low-stakes assessments for students, students are
usually asked to respond to only a few items; each student sees only part of the whole test.
Otherwise, they would be unlikely to expend real effort on any items. As a result, individual test
scores are subject to substantial measurement error, which would bias many statistical estimates.
Rather than assign each student an error-filled score, MML procedures represent each students
proficiency as a probability distribution over all possible scores. MML procedures use these
probability distributions in the estimation process.
Another characteristic of large-scale assessments has led to a wider applicability of AM?they
almost always draw a sample from a complex design. AM automatically provides appropriate standard
errors for complex samples using a Taylor-series approximation. This happens automatically even
when new procedures are added to the software. Over time, the software has grown to offer a set of
non-MML statistics, including regression, probit, logit, cross-tabs, and other statistics that are
useful for survey data in general.
Info™ : a very large Windows package designed by CDC, Department of Health and
Human Service (Atlanta/Georgia/USA), primarily for epidemiological research. It offers basic
general statistics and spatial statistics/mapping capabilities.
Key Features of Epi Info™
With Epi Info™ and a personal computer, epidemiologists and other public health and medical
professionals can rapidly develop a questionnaire or form, customize the data entry process, and
enter and analyze data. Epidemiologic statistics, tables, graphs, and maps are produced with simple
commands such as READ, FREQ, LIST, TABLES, GRAPH, and MAP. Epi Map displays geographic maps with
data from Epi Info™.
A new version, Epi Info™ for Windows retains many features of the familiar Epi Info™
for DOS, while offering Windows ease of use strengths such as point-and-click commands, graphics,
fonts, and printing.
- Maximum compatibility with industry standards, including:
- Microsoft Access and other SQL and ODBC databases
- Visual Basic, Version 6
- World Wide Web browsers and HTML
- Extensibility, so that organizations outside CDC can produce additional modules
- Epi Report, a tool that allows the user to combine Analysis output, Enter data and any data
contained in Access or SQL Server and present it in a professional format. The generated reports
can be saved as HTML files for easy distribution or web publishing.
- Epi Map, an ArcView®-compatible GIS
- NutStat, a nutrition anthropometry program that calculates percentiles and z-scores using
either the 2000 CDC or the 1978 CDC/WHO growth reference
- Logistic regression and Kaplan-Meier survival analysis
- Data Compare does double data entry comparison
- Epi Lock password protects, encrypts, and compresses Epi Info™ data
- Teaching exercises
- Entirely new, not just a “port“of Epi Info™ for DOS
- Allows analysis and import of other file types
- Windows 98, ME, NT 4.0, 2000, or XP is required.
- 32 MB of Random Access Memory. More RAM: 64 MB for Windows 4.0 and 2000, 128 MB for Windows
- 200 megahertz processor is recommended - 300 for Windows XP.
- At least 260 megabytes of free hard disk space (Drive C) to install; 130 megabytes after
(2004) Statistical Software for Multivariate Analysis and Graphical Display.
Freeware download for Mac and Windows
written by Herman J. Bierens, Professor of econometrics at Penn State University, conducts various
econometric estimation and testing tasks on all 32bit Windows platforms (95/98/2000/ME/NT4/XP).
EasyReg is designed for use in empirical research, and for teaching econometrics. In the latter
case the user can choose his or her own econometrics level. EasyReg is almost self-explanatory:
there should be no need for a manual, provided that the user chooses the appropriate econometrics
level. It is free for non-commercial purposes. EasyReg does most of the econometrics tasks
available in competing commercial software, in particular:
and even more. Some of the listed software have additional features, though. EasyReg is a
point-and-click program, whereas some of the listed commercial software packages (in particular
RATS and TSP) are programmable.
- E-Views 3.1. This is the closest competitor of EasyReg.
Regress+: a free
nonlinear estimation / curve fitting / modeling program for Macintosh Computers, developed by
Michael P.McLaughlin. It has many built in deterministic and stochastic models and allows user
defined models with up to 10 parameters.
It has many advanced features such as
Extensive documentation is available in PDF format.
- choice of optimization criterion,
- robust goodness-of-fit testing
- bootstrap confidence intervals
Smith's Statistical Package
(SSP) Smith's Statistical Package (SSP) is a user-friendly statistics program, developed by
Prof. Gary N. Smith, Department of Economics, Pomona College Claremont/California. SSP is available
for Mac OSX/OS9 and Windows.
Ox : is an object-oriented matrix programming
language with a comprehensive mathematical and statistical function library, developed by Jurgen A.
Doornik. The Console (command line) versions may be used freely for academic research and teaching
purposes . Matrices can be used directly in expressions, for example to multiply two matrices, or
to invert a matrix. The major features of Ox are its speed, extensive library, and well-designed
syntax, which leads to programs which are easier to maintain. For a first impression of the matrix
and statistical function library see the Function summary. Versions of Ox are available for
Windows, Linux, several Unixes, and Mac OS-X.
Mx, developed by M. C. Neale (Department of
Psychiatry, MCV Virginia Commonwealth University), is a combination of a matrix algebra interpreter
and a numerical optimizer. It enables exploration of matrix algebra through a variety of operations
and functions. There are many built-in fit fuctions to enable structural equation
modelingand other types of statistical modeling of data. It offers the fitting fuctions found
in commercial software such as LISREL, LISCOMP, EQS and CALIS, along with facilities for maximum
likelihood estimation of parameters from missing data structures, under normal theory. Complex
'nonstandard' models are easy to specify. For further general applicability, it allows the user to
define their own fit functions, and optimization may be performed subject to linear and nonlinear
equality or boundary constraints. It is freely available with a graphical user interface for
Windows platforms or as a commandline/server version for Unix/Linux.
Mondrian, developed by Martin Theus, is a
statistical data-visualization system written in JAVA, running on Mac/Windows/Linux. The main
emphasis of Mondrian is on visualization techniques for Categorical Data , Geographical Data and
All plots in Mondrian are fully linked, and offer various interactions and queries. Any case
selected in a plot in Mondrian is highlighted in all other plots.
PAST, developed by Øyvind Hammer of
University of Oslo and Co-Workers David A.T. Harper and P.D. Ryan, is a free, easy-to-use data
analysis package aimed at paleontology. Inspired by PALSTAT, it includes common statistical,
plotting and modelling functions:
- A spreadsheet-type data entry form
- Graph, scatter, 3D scatter, histogram, box, ternary, survivorship, spindle, matrix and normal
- Curve fitting: Linear (Standard and Reduced Major Axis) with bootstrapping, lin-log
(exponential), log-log (allometric), logistic, von Bertalanffy, sum-of-sines, B-splines.
- F, t, permutation t, Chi-squared w. permutation test, Kolmogorov-Smirnov, Mann-Whitney,
Shapiro-Wilk, Spearman's Rho and Kendall's Tau tests, correlation, covariance, contingency tables,
one-way ANOVA, Kruskal-Wallis test, mixture analysis.
- Diversity indices with bootstrapping and permutation, rarefaction. Capture-recapture richness
estimators. Dice, Jaccard, Simpson and Raup-Crick similarity indices, Renyi diversity
- Abundance model fitting: Geometric, log-series and log-normal.
- Multivariate statistics: Principal Components (with Minimal Spanning Tree), Principal
Coordinates (14 distance measures), Non-metric Multidimensional Scaling (14 distance measures),
Detrended Correspondence Analysis, Cluster analysis (three algorithms, 13 distance measures,
two-way clustering), k-means clustering, seriation, discriminant analysis, one-way MANOVA, one-way
ANOSIM, one-way NPMANOVA, Hotelling's T2, paired Hotelling's T2, Mahalanobis-distance permutation,
Box's M, Canonical Variates Analysis, multivariate allometry with bootstrapping.
- Time series analysis: Spectral analysis, autocorrelation, cross-correlation, wavelet transform,
Walsh transform, runs test.
- Geometrical analysis: Directional statistics, rose plots, point distribution statistics,
Fourier shape analysis, elliptic Fourier shape analysis, eigenshapes, landmark analysis with
Procrustes fitting (2D and 3D), thin-plate spline transformation grids with expansions and
principal strains, partial warps and scores, relative warps and scores, centroid size from
landmarks, size removal by Burnaby's method.
- Parsimony analysis (cladistics): Exhaustive, branch-and-bound and heuristic algorithms, Wagner,
Fitch and Dollo characters. Bootstrap, strict and majority rule consensus trees. Consistency and
retention indices. Three stratigraphic congruency indices with permutation tests.
- Biostratigraphy with the methods of Unitary Associations, Ranking-Scaling (RASC) and
Constrained Optimization (CONOP). Confidence intervals on stratigraphic ranges.
- Gridding (spatial interpolation): Moving average, thin-plate spline and kriging with three
Included in the distribution are real data sets for educational use, together with extensive
documentation and case studies.
PAST has been tested under Windows 95, 98, 2000, NT 4 and XP.
The Calcugator is a calculator, a plotting engine, and a programming
environment. It's also a computer program you may download and use for free. An online applet
with a restricted user interface is available too.
[source: Calgugator Website ]
- As a calculator it excels in simplicity of use and rivals programs like MATLAB. It is
equally suited for engineers as well as for high school students. It offers close to 200
functions/operators to perform real, integer, rational, complex, boolean, statistical, vector,
array and matrix computations. Both the input and output of the program are displayed on standard
windows which can be further edited, saved, merged, print-previewed and printed.
- As a plotting engine it allows rapid creation of 2D plots of functions as well as
polar and parametric displays. It also allows creating 3D plots of functions, contour plots and 3D
parametric displays. The Calcugator has also functions to create bar, pie, pareto and xy charts.
All plots can be configured using the mouse (zooming, panning, selecting). Titles and labels are
supported. However, the most useful characteristic is that all figures created by the Calcugator
can be exported into popular file formats or pasted into an editable window.
- As a programming environment it offers a computer language with identical syntax as
that of Java/C/C++. However, the user does not need to compile his/her programs. The language is
simple, compact and the error control is intelligent when displaying messages. The creation of
user-defined functions is natural and mimics the syntax used by a teacher on the blackboard.
[ this site ]
MicrOsiris: is a comprehensive statistical and
data management package for Windows developed and marketed by Van Eck Computer Consulting. It is
derived from OSIRIS IV, a statistical and data management package developed and used at the
University of Michigan and includes special techniques for data mining and analysis of nominal- and
ordinal-scaled data (MNA, MCA). MicrOsiris can readily accept data from SPSS, SAS, and Excel as
well as from other sources..
Of special interest is SEARCH, a binary segmentation command used to develop predictive models for
dependent variables. It is an elaboration of the programs developed at the Institute for Social
Research. SEARCH divides a sample, through a series of binary splits, into mutually exclusive
series of subgroups. The groups are chosen so that at each step the two new groups account for more
of the variance or distribution than any other pair of subgroups.
NORM, written by Joe
Schafer and Maren Olsen of the Department of Statistics, The Pennsylvania State University, is a
Windows Software for multiple imputation of multivariate continuous data under a normal
Xtremes, written by
a Prof. R.D. Reiss (University of Siegen/Germany) and a large team of contributors, is an
interactive software for general statistics and risk analysis.
Xtremes possesses graphics facilities, a facility to generate and load data, an arsenal of
diagnostic tools and statistical procedures, and a numerical part for Monte-Carlo simulations.
Xtremes is first of all designed especially for use with a mouse, yet it can also perform
excellently with just the keyboard.
The experienced user can employ the integrated Pascal-like programming language StatPascal to
implement simulation or estimation procedures not covered by the menu system. Moreover, the
UserFormula facility may be utilized when a single formula is required to plot a curve or to
generate or transform a data set.
The free academic Windows edition, included in the Statistical Analysis book, only allows
a certain limited number of rows in a data set.
It consists of Windows (95, 98, 2000, NT) applications
Facilities to download the academic edition may be found in the download
- a rich menu system
- the integrated StatPascal programming
- a Windows help system
- a HTML help system
- additional statistical procedures - e.g., for sum-stable distributions - utilizing recent
Xtremes is documented in the Statistical Analysis book and the
pertaining online help. The StatPascal Manual can be read online and is also provided in the file
spdoc.ps in the sp subdirectory of the Xtremes installation.
Major enhancements of Xtremes 3.0:
The menu system
- Plotting facilites and estimators for multivariate
- extreme value
- and POT models,
- in conjunction with facilities to compare parametric and empirical curves such as contour
- New time series tools
- Bayes and L-Moment estimators for univariate POT models
- Statistics for sum-stable distributions (using STABLE by DLL)
- Statistics in further parametric models (e.g. Student)
- Estimators for ARMA series
- Moving estimators in conjunction with VaR considerations
- Animation facilities for generating discrete and continuous data
- Improved support for printing
- Direct generation of EPS files
Xtremes is driven by StatPascal, a statistical programming language based on the Pascal
language. StatPascal can be executed within the Xtremes package using the SP button in the menu
toolbar. StatPascal supports
- most operations of standard Pascal
- use of most Xtremes menu options as functions
- vector and matrix extensions
IVEware Imputation and Variance
Estimation Software by T. E. Raghunathan, Peter W. Solenberger and John Van Hoewyk, Institute
for Social Research at the University of Michigan, is a set of C and FORTRAN routines that can be
launched from SAS or run independently as a commandline program using data from many sources.
IVEware is currently available for personal computers using the Microsoft Windows and Linux
operating systems and for UNIX workstations using the Sun Solaris operating system. The SAS
implementation requires SAS 6.12 or higher.
- Perform single or multiple imputations of missing values using the Sequential Regression
Imputation Method described in the article "A multivariate technique for multiply imputing missing
values using a sequence of regression models" by Raghunathan, Lepkowski, Van Hoewyk and Solenberger
(Survey Methodology, June 2001).
- Perform a variety of descriptive and model based analyses accounting for such complex design
features as clustering, stratification and weighting.
- Perform multiple imputation analyses for both descriptive and model-based survey
- BrightStat, developed by Dr. Daniel Stricker,
University of Bern, is a comprehensive Java-based statistics package running on the net. The aim
was to provide a freely accessible statistic tool with many capabilities, yet easy to use.
BrightStat is expanded continually.
Tests running on BrightStat so far:
- Binomial Test,
- kxl Chi-Square (likelihood Ratio, Phi, Cramer's V, Symmetric Measures),
- McNemar (McNemar exact, likelihood Ratio, Phi, Cramer's V, Symmetric Measures),
- Fisher's Exact Test,
- Mann-Whitney U,
- Kruskal-Wallis (paired comparisons for post-hoc analysis),
- Friedman (Conover Statistic and paired comparison for post-hoc analysis),
- Kolmogorov-Smirnov one sample (Uniform, Normal and Exponential, Lilliefors correction for test
- One-Sample T-test,
- Two independent Samples T-test (Levene test for homogeneity of variances),
- Two dependent Samples T-test (paired correlations),
- One-Way ANOVA (Duncan, Tukey HSD, Scheffe, Games-Howell),
- Repeated One-Way ANOVA (estimation of Epsilon, LowerBound, Greenhouse-Geisser,
- Pearson Bivariate Correlations,
- Spearman Rankcorrelations,
- Multiple Linear Regression (Correlationmatrix, Model-Fit, ANOVA-Table and Parameter Estimates,
Part and Partial Correlations, Collinearity Diagnostic, Change Statistics, Methods: Enter, Forward,
Backward and Step),
- One-Sample Variance Test,
- k-Sample Variance Test (Levene's Test for homogeneity of Variances).
BrightStat already comes along with many useful features for data handling and manipulation,
statistics and data managing, all packed in an easy to use interface. XML based data and output
structures simplify the future development of external viewers.
More lists of free statistical software
Reviews of free statistical software
Xiaoping Zhu and Ognjen Kuljaca
2005: A Short Preview of Free Statistical Software Packages for Teaching Statistics to
Industrial Technology Majors. Journal of Industrial Technology 21, 2, April 2005.
Review of WinBugs 1.4 by Harvey Goldstein, University of London
Comparison chart of
Statistical Packages, comparing capabilities of several free and commercial packages.
R for SAS and SPSS Users, a Book by Robert
A. Muenchen, Statistical Consulting Center at the University of Tennessee, is a guide primarily for
experienced SAS and SPSS users who want to migrate to the free R package. It covers detailed syntax
comparison of the free R package with the SAS and SPSS packages. An outdated free PDF-version can be
downloaded from the site, where book example programs, articles and comparison charts are also
Quick-R for SAS/SPSS/Stata Users, a
Website by Rob Kabacoff, is a guide to R with focus on migration from SAS, SPSS, STATA and
SYSTAT to R syntax
List of free statistical handbooks & online tools