Freie wissenschaftliche Software
List of free statistical software
Open Source & Public Domain Packages with Source Code
For the following packages, source code availability has been checked:

RStudio, written by JJ Allaire, Joe Cheng, Josh Paulson and Paul DiCristina, integrates the comprehensive stateoftheart statistical package R with a superb user interface, available both as desktop application and as a browserbased server application.
We are impressed especially by the web version of RStudio, which seems to be a great opportunity for research advisors and ITdepartments to bring custom Rapplications to their intranets without much hand coding. We think that RStudio server has the potential of becoming a very popular linux research application. Anyone interested in supporting a demo server project (we are currently testing this, hardening the rstudio server behind an Apache proxy against attacks is the main point to be considered), please contact Tobias Gabele (tobi at mathsim dot com).

SciGraphica, developed by Adrian E.
Feiguin, is a scientific application for data analysis and technical graphics. It has similarities
with Sigmaplot and pretends to be a clone of the popular commercial (and expensive) application
"Microcal Origin". It fully supplies plotting features for 2D, 3D and polar charts. The aim is to
obtain a fullyfeatured, crossplattform, userfriendly, selfgrowing scientific application. It is
free and opensource, released under the GPL license.
Main features:
 You can plot functions and manipulate data in worksheets.
 You can open several worksheets and plots and work with them interactively and at the same
time.
 The plots are fully configurable using a control panel dialog.
 The look and feel is completely WYSIWYG.
 Production/Publication quality PostScript output.
 You can interact with the plots doubleclicking, dragging and moving objects with the
mouse.
 Native XML file format.
 You can insert Python expressions in the worksheets.
 Terminal with commandline Python interface for interacting with plots and worksheets
It is completely programmed in C from scratch, using the GTK+ and GtkExtra libraries, and
released under the GPL agreement.
Data manipulation and fitting features are in the roadmap.
Binaries are currently available for several Linux platforms.

R: a programming language and environment for
statistical computing and graphics. Similar to S or Splus (will run most S code unchanged).
Available for Windows, various Unix flavors
(including Linux), NextStep and Mac. Provides a wide variety of statistical (linear and nonlinear
modelling, classical statistical tests, timeseries analysis, classification, clustering, ...) and
graphical techniques, and is highly extensible. Welldesigned publicationquality plots can be
produced, including mathematical symbols and formulae where needed.
Don't miss the new RStudio integrated development environment for R!

TANAGRA, written
by Ricco Rakotomalala, University of Lyon, is an open source DATA MINING Windowssoftware for
academic and research purposes. It proposes several data mining methods from exploratory data
analysis, statistical learning, machine learning and databases area.

gretl: Gnu Regression, Econometrics and
Timeseries Library, is a crossplatform software package for econometric analysis, written in the
C programming language. It is is free software. You may redistribute it and/or modify it under the
terms of the GNU General Public License (GPL) as
published by the Free Software Foundation.
Features:
 Easy intuitive interface (now in French, Italian, Spanish and Polish as well as English)
 A wide variety of leastsquares based estimators, including twostage least squares and
nonlinear least squares
 Single commands to launch things like augmented DickeyFuller test, Chow test for structural
stability, Vector Autoregressions, ARMA estimation
 Output models as LaTeX files, in tabular or equation format
 Integrated scripting language: enter commands either via the gui or via script
 Command loop structure for Monte Carlo simulations and iterative estimation procedures
 GUI controller for finetuning Gnuplot graphs
 Link to GNU R for further data
analysis

ViSta: a Visual Statistics program for Windows,
Mac and Linux/Unix, developed by Prof. Dr. Forrest W. Young at the University of North
Carolina.
Features:
 Dynamic, HighInteraction, MultiView Graphics: ViSta constructs veryhighinteraction, dynamic
graphics that show you multiple views of your data simultaneously. The graphics are designed to
augment your visual intuition so that you can better understand your data.
 See What Your Data Have To Say: ViSta's visually intuitive and computationally intensive
approach to statistical data analysis is designed to clarify the meaning of data so that you can
see what your data have to say.
 Freeware/Open Software: ViSta is free and open. It can be downloaded from the web.
 Platforms: ViSta runs under Windows, on Macintosh, and under Unix.
 Languages: ViSta is available in English, Français, and Español.
 Look and Feel:ViSta has a Structured Graphical Interface that includes
 WorkMaps that visually summarize your data analysis session
 SpreadPlots  a spreadsheetlike arrangement of linked, dynamic, interactive plots that show
you what your data or analysis seems to say
 GuideMaps that guide your analysis
 DataSheets for inputing and editing your data
 Visual GuideMap Authoring for teachers who wish to create guidemaps
 Reports present data statistics and analysis results in the classical way
 ContextSensitive Help and Webbased Help and Notes
 Statistical Capabilities: ViSta performs Univariate and Multivariate Statistical Visualization
and Data Analysis.
 Applets, Plugins, Scripts and DataPrograms:
 Applets: Teachers can develop ViSta Web Applets for demonstrations, examples, test problems,
etc. Students can download and run these on their own machines (example of class notes using an
applet).
 Scripts: Teachers can write ViSta scripts using the same techniques as for Applets, but keep
them privately on their machines. A grading script is an example.
 DataPrograms:Teachers and students can write dataprograms to manipulate their data.
 Plugins:Programmers can develop plugins to add entirely new data analysis and visualization
capabilities.
 Developer's Tools: ViSta provides access to the underlying development languages, including:
 ViDAL, ViSta's Data Analysis Language for writing applets, scripts and data programs; and
 XLispStat, an objectoriented programming language which can be used to write ViSta plugins for
statistical computing and dynamic graphics.
 XLisp, a free and open Lisp system satisfying most of the Common Lisp standards.
 C++ and FORTRAN may be used for developing specialized new features.
In addition, ViSta has developers tools including a bytecode compiler, a stepper,
tracing and back tracing, object and bitmap editors, distribution maker, etc...

MacAnova developed by
Gary Oehlert and Christopher Bingham, University of Minnesota, is an interactive statistical
analysis program with focus on analysis of variance and related models, matrix algebra, time series
analysis (time and frequency domain), and (to a lesser extent) uni and multivariate exploratory
statistics. It is available for Macintosh, DOS/Windows and Unix/Linux.
MacAnova has a functional/command oriented interface. The Macintosh and Windows versions also have
several window/menu/mouse type features but retain the basic interface. Although the language and
syntax are Slike (similar to S, SPlus or R), MacAnova is not S or R.
Extensive documentation for MacAnova is available.

OpenBUGS.
Bayesian inference using Gibbs sampling, open source version of BUGS origianlly developed by
Alastair Stevens, Cambridge University (and many other contributors). It is a widely used software
package for the Bayesian analysis of complex statistical models using Markov chain Monte Carlo
(MCMC) methods.
The OpenBUGS project is opening out the BUGS project into versions that run on Windows and Linux,
as well as inside the R statistical package (BRugs). They have released an opensource version of
the core BUGS code for multiple versions of this program

Octave: a highlevel mathematical programming
language (free MATLAB clone) for numerical computations  solving common numerical linear algebra
problems, finding the roots of nonlinear equations, integrating ordinary functions, manipulating
polynomials, and integrating ordinary differential and differentialalgebraic equations. It is
easily extensible and customizable via userdefined functions written in Matlab syntax, or using
dynamically loaded modules written in C++, C, Fortran, or other languages. Available for Linux/Unix
and Windows.

Free statistical Toolboxes for Matlab at the [ StatLib ] website.

Scilab: a scientific software package for numerical
computations in a userfriendly environment. Available for Windows, Mac and Unix computers, this is
a sophisticated programming language with a MatLablike syntax, hundreds of builtin functions and
libraries, 3d graphics, and symbolic capabilities through a Maple interface.

PSPP, a free SPSS clone. PSPP is a program for statistical analysis of sampled data. It interprets
commands in the SPSS language and produces tabular output in ASCII, HTML, or PostScript
format. The name PSPP is not intended as an acronym, but it is easy to invent expansions for it. In
previous releases, PSPP was known as Fiasco. PSPP
development is ongoing. It already supports a large subset of SPSS's transformation language. Its
statistical procedure support is limited but growing. A manual is available.

OpenStat (OS4):,
formerly known as OS3 and OS2, is a general stats package for Windows and Linux, developed by Bill
Miller of Iowa State University, with an SPSSlike user interface. A Manual in PDFFormat is
available.
OpenStat was originally written as an aid to students and teachers for introductory and advanced
statistics courses, primarily in the social sciences. To insure that students and professionals,
particularly in developing countries with very limited funding resources, have access to a package
that completes nearly all of the analyses required in statistics courses, the package contains a
wide variety of procedures for data analysis:
 A spreadsheettype of user interface in which variable labels are created (columns of a grid),
and observed values are inserted into cells of rows for each case.
 Descriptive statistics including central tendency and variability indexes, xy plots, frequency
distributions, etc.
 Pearson productmoment correlations and partial correlations.
 Analyses of variance including 1, 2 and 3 way ANOVAs, mixed design ANOVAs, Latin Square ANOVAs,
Mulivariate ANOVA and Analysis of Covariance.
 Multivariate analyses including Hierarchical, Factor Analysis, Canonical Correlation, and a
leastsquares General Linear Model procedure.
 NonParametric analyses.
 Measurement programs.
 Financial programs.
 Simulation programs.
 A Neural Network program.
 A matrix manipulation program.

JMulti, developed at the HUBerlin by Alexander
Benkwitz, Markus Krätzig and many contributors, is a graphical Java software for advanced
econometric time series analysis.
Incomplete list of Features:
 Initial Analysis
 various tools for creating, transforming, editing time series
 Unit Root tests: ADF, HEGY (quarterly, monthly), SchmidtPhillips, KPSS, Unit Root test with
structural break
 Cointegration tests: Johansen Cointegration test with response surfaces, Saikkonen &
Lütkepohl test
 VAR (can be used for univariate modelling as well)
 VAR modelling (with arbitrary deterministic/exogenous variables)
 Impulse Responses with bootstrapped confidence intervals also for accumulated responses,
orthogonal and forecast error versions
 Forecast Error Variance Decomposition
 SVAR modelling: AB model, BlanchardQua Model with bootstrapped standard errors
 SVAR Forecast Error Variance Decomposition
 VECM
 VECM modelling (with arbitrary deterministic/exogenous variables)
 restrictions on cointegration space, Wald test for beta restrictions
 automatic model selection (various strategies based on information criteria)
 residual analysis with tests for nonnormality, autocorrelation, ARCH, spectrum, kernel density,
autocorrelation plots, crosscorrelation
 Impulse Responses with bootstrapped confidence intervals also for accumulated responses,
orthogonal and forecast error versions
 SVEC modelling with bootstrapped standard errors
 GARCH Analysis
 univariate ARCH, GARCH, TGARCH estimation with different error distributions
 multivariate GARCH(1,1) estimation, residual analysis, plotting of variance process together
with univariate estimates, kernel density for residuals
 Smooth Transition Regression
 STR model specification with exogenous/deterministic variables
 Nonparametric Analysis
 lag selection for univariate models based on linear and nonlinear selection criteria
 nonlinear estimation with configurable 3D plots
 estimation of volatility process
 ARIMA Analysis with fixed regressors (univariate)
JMulti Binaries for Windows and Linux as well as the
sourcecode and documentation are available for download under the GPL license.

fityk by Marcin Wojdyr is a free
GPLlicensed peak fitting program for Linux, Windows and MacOS X. For optimization it has builtin
algorithms for LevenbergMarquard gradientbased method, NelderMead downhill simplex method and
Genetic Algorithms. Every of these methods has a set of adjustable parameters, for greater
flexibility.

Dap: a statistics and graphics
package developed by Susan Bassein for Unix and Linux systems, with commonlyneeded data
management, analysis, and graphics (univariate statistics, correlations and regression, ANOVA,
categorical data analysis, logistic regression, and nonparametric analyses). Provides some of the
core functionality of SAS, and is able to read and run many (but not all) SAS program files.
Dap is freely distributed under a GNUstyle license.
[ source: http://StatPages.net ]

PINT is a program for
Power analysis IN Twolevel designs (for determination of standard errors and
optimal sample sizes in multilevel designs with 2 levels). It was written by Tom Snijders, Roel
Bosker, and Henk Guldemond. The newest (Windows) version is 2.11 (April 2003). This program
calculates approximate standard errors for estimates of fixed effect parameters in hierarchical
linear models with two levels.

DATAPLOT
(2002) Software for Scientific Visualization, Statistical Analysis, and
NonLinear Modeling. Freeware for Unix, VMS, Linux, Windows.

Collection of Free SPSS Syntax
Files by Raynald Levesque, provides hundereds of SPSS syntax files, macros,
scripts and tutorials.
Packages distributed free of charge, not open source (or license model not yet classified)
For the following software, full source code availability has not been verified, although it is
likely that some of the authors might provide the source upon request. Please contact the authors
if you need the source code.

The Statistical Lab, developed by Members
of Center for Digital Systems (CeDiS) at FUBerlin is an easy to use Windows package designed
primarily for teaching statistics. It uses the powerful R engine as its backend and is an
explorative and interactive tool designed both to support education in statistics and provide a
tool for the simulation and solution of statistical problems.
The graphical user interface is designed to make complex statistical relations easy to understand.
It connects and displays data frames, frequency tables, random numbers or matrixes in a
userfriendly statistical worksheet allowing users to run calculations, conduct analyses and
perform multiple simulations and manipulations.

MYSTAT 12, developed by
SYSTAT is essentially a limited version of SYSTAT 12 for Windows, available free of charge for
academic users (students and teachers). While some of the more advanced analysis capabilities of
SYSTAT 12 are missing, it appears to be one of the best and most professionally designed free
statistics software packages currently available. It comes with the full set of SYSTAT 12 PDF
documentation. MYSTAT does at most accept 100 variables (Columns) with no limit on the cases
(Rows). If you import a larger data file, only the first 100 variables will be read. Teachers who
have Systat 12 installed can install a 'Mystat Theme' for Systat to make Systat look and behave
exactly like Mystat.

BUGS/WinBUGS.
Bayesian inference using Gibbs sampling, developed by Alastair Stevens, Cambridge University (and
many other contributors), is a widely used software package for the Bayesian analysis of complex
statistical models using Markov chain Monte Carlo (MCMC) methods.
The standalone WinBUGS 1.4.1 package for Windows has a graphical user interface and online
monitoring and convergence diagnostics. Users are asked to register to receive a `key' which
provides full functionality (this is free).
The OpenBUGS project is opening out the
BUGS project into versions that run on Windows and Linux, as well as inside the R statistical
package (BRugs). They have released an opensource version of the core BUGS code for multiple
versions of this program

WinIDAMS is a software package for
the validation, manipulation and statistical analysis of data, developed by the UNESCO Secretariat
in cooperation with experts from various countries. It is distributed freeofcharge upon
request.
It offers:
 modern graphical user interface and online Reference Manual,
 possibility to customize the environment for an application,
 facilities for editing/creating data files and data description files,
 interactive data import/export,
 editor for creating/updating files with instructions for program execution,
 viewer for displaying and quick navigation through results,
 advanced text editing facilities,
 facilities for sorting and merging files, data editing, checking of codes and consistencies,
correcting, listing, subsetting, aggregating, merging and transforming data, including construction
of new variables,
 wide range of data analysis techniques such as: table building, regression analysis, oneway
analysis of variance, discriminant analysis, cluster analysis, principal components factor analysis
and analysis of correspondences, partial order scoring, rank ordering of alternatives, segmentation
and iterative typology,
 interactive components for construction of multidimensional tables and their graphical
presentation, for graphical exploration of data and for times series analysis.
To get an Impression of WinIDAMS, please see the
[ Getting
Started ]
document and the
[ WinIDAMS Reference
Manual ]

CSPro: Census and Survey Processing
System (CSPro) is a questionnaireoriented statistical package for Windows. CSPro is developed by
MEASURE partners, the U.S. Bureau of the Census, ORC Macro's MEASURE DHS+ project, and SerPro S.A..
and has been funded by the Center for Population, Health & Nutrition of the United States
Agency for International Development. The software combines and expands upon the capabilities of
both ISSA and IMPS. It takes advantage of the power and flexibility of both of these programs, but
adds the friendliness, ease of use, and intuitive nature of Windows. CSPro provides a more visual
approach to the creation and manipulation of data and reduces the programming needs. This easy to
use package facilitates defining data structures, developing applications, entering and checking
data, generating reports, and producing tabulations. More advanced users, including computer
programmers, can access the full CSPro language to perform complicated tasks.
CSPro consists of modules for Data Entry, Batch Editing and Processing, and Cross Tabulation, as
well as Designer modules for creating Data Dictionaries and Data Entry Forms, and for writing
applications. It also includes a set of tools, including Text Viewer, Table Viewer, Map Viewer,
Table Retrieval, Tabulate Frequencies, Sort Data, Export Data, Reformat Data, Compare Data, Convert
Dictionary, and Convert Shape to Map.
The package supports easy creation of questionnaire forms and spatial representation of
Data.
A detailed list of capabilities is included in the Getting Started Guide
Free registration is required for download access.

VisiCube: a graphical tool for manual
explorative data analysis. Freeware available for Windows.
Features:
 Exploration paradigm which facilitates the actual activity of exploration.
 Highquality graphic images.
 Powerful subsetting capabilities.
 Multiplot visuals for direct sidebyside comparisons.
 Continuous and instant visual feedback.
 Simplified interface for ease of use.
 Appropriate visualization techniques only.
 Collaboration through completely portable and independent analysis projects.
 Multidimensional data model for accurate modeling of your phenomena.
 No mathematical expertise required.
 It's free.

IRRISTAT: developed by the International Rice Research
Institute (IRRI), is a computer program for data management and basic statistical analysis of
experimental data. It can be run in any 32bit Windows operating system.
IRRISTAT has been developed primarily for the analysis of data from agricultural field trials,
but many of the features can be used for analysis of data from other sources.
The main modules and facilities are:
 Data management with a spreadsheet
 Text editor
 Summary Statistics and Scatterplot Graphics
 Analysis of Variance
 Regression and Correlation
 Single Site Analysis of Plant Breeding Variety Trials
 Cross site and AMMI anaysis
 Pattern analysis of GxE Interaction
 Quantitative trait loci analysis
 Randomization and layout of experimental designs
 Display of linear forms for general factorial EMS
 Generation of coefficients for orthogonal polynomials
IRRISTAT is available for free download or by purchasing a CD at minimal cost.

Mac Diff, developed by Rainer Petschick, is a free peakfitting program for analysis and
display of Xray powder diffractogrammes on Apple Macintosh platforms. It is designed as a
complement to the XRD applications distributed by Philips or Siemens and it is equipped with a GUI
to facilitate userfriendly and interactive processing of XRD profiles. It offers conventional XRD
profilecorrection processes (e.g. outliers, smoothing, divergence, alpha2stripping) and supports
qualtiative and quantitative phase analysis. Moreover, by its programmable peak analysis (recording
of intensities, peak areas, half widths, etc.) it offers an automated analysis of entire series of
diffractogrammes. By means of peak fits the contributions of coinciding lines can be calculated.
The user has at his disposal the capability of continuous measurement checks and the possibility to
perform corrections as well (e.g. by manual or computed background adaptations). For peak
indication the user has the option to employ variable peak data sets. High resolution hardcopies of
the diffractogrammes can be produced. The standard singlescan formats (PhilipsADP or some
SiemensRAW, ASCII) are identified automatically. Data and plots can be exported to most of the
available tableprocessing or vectorgraphics programmes. The applications of MacDiff range from
routine analysis in sedimentology, especially clay minerals, over analysis of Xray
diffractogrammes of all types of rocks and minerals all the way to phase analysis of various
crystalline substances.

Instat+: a visual
general statistics package for Windows, developed by the Statistical Services Centre at the
University of Reading. The full version of Instat+ may be downloaded and used for noncommercial
purposes by any individual free of charge. Instat+ has a familiar interface, similar to other
statistics packages. It includes many special facilities for the processing of climatic data.

SSCStat: a
statistical addin for Excel , developed by the Statistical Services Centre at the University of
Reading. The full version of SSCStat may be downloaded and used for noncommercial purposes by any
individual free of charge. SSCStat is designed to strengthen those areas where the spreadsheet
package is already strong, principally in the areas of data management, graphics and descriptive
statistics.

AM: a Windows stats package developed by the
American Institutes for Research, Washington/DC.
AM is a statistical software package for analyzing data from complex samples, especially
largescale assessments such as the National Assessment of Educational Progress (NAEP) and the Third
International Mathematics and Science Studies (TIMSS).
From its origin as a specialized tool for analyzing largescale assessment data, AM has evolved
into a more generalized and growing tool for analyzing data from complex samples in general.
Originally, AM was developed to estimate regression models through marginal maximum likelihood
(MML). Because largescale assessments are often lowstakes assessments for students, students are
usually asked to respond to only a few items; each student sees only part of the whole test.
Otherwise, they would be unlikely to expend real effort on any items. As a result, individual test
scores are subject to substantial measurement error, which would bias many statistical estimates.
Rather than assign each student an errorfilled score, MML procedures represent each students
proficiency as a probability distribution over all possible scores. MML procedures use these
probability distributions in the estimation process.
Another characteristic of largescale assessments has led to a wider applicability of AM?they
almost always draw a sample from a complex design. AM automatically provides appropriate standard
errors for complex samples using a Taylorseries approximation. This happens automatically even
when new procedures are added to the software. Over time, the software has grown to offer a set of
nonMML statistics, including regression, probit, logit, crosstabs, and other statistics that are
useful for survey data in general.

Epi
Info™ : a very large Windows package designed by CDC, Department of Health and
Human Service (Atlanta/Georgia/USA), primarily for epidemiological research. It offers basic
general statistics and spatial statistics/mapping capabilities.
With Epi Info™ and a personal computer, epidemiologists and other public health and medical
professionals can rapidly develop a questionnaire or form, customize the data entry process, and
enter and analyze data. Epidemiologic statistics, tables, graphs, and maps are produced with simple
commands such as READ, FREQ, LIST, TABLES, GRAPH, and MAP. Epi Map displays geographic maps with
data from Epi Info™.
A new version, Epi Info™ for Windows retains many features of the familiar Epi Info™
for DOS, while offering Windows ease of use strengths such as pointandclick commands, graphics,
fonts, and printing.
Key Features of Epi Info™
 Maximum compatibility with industry standards, including:
 Microsoft Access and other SQL and ODBC databases
 Visual Basic, Version 6
 World Wide Web browsers and HTML
 Extensibility, so that organizations outside CDC can produce additional modules
 Epi Report, a tool that allows the user to combine Analysis output, Enter data and any data
contained in Access or SQL Server and present it in a professional format. The generated reports
can be saved as HTML files for easy distribution or web publishing.
 Epi Map, an ArcView®compatible GIS
 NutStat, a nutrition anthropometry program that calculates percentiles and zscores using
either the 2000 CDC or the 1978 CDC/WHO growth reference
 Logistic regression and KaplanMeier survival analysis
 Data Compare does double data entry comparison
 Epi Lock password protects, encrypts, and compresses Epi Info™ data
 Teaching exercises
 Entirely new, not just a “port“of Epi Info™ for DOS
 Allows analysis and import of other file types
System Requirements
 Windows 98, ME, NT 4.0, 2000, or XP is required.
 32 MB of Random Access Memory. More RAM: 64 MB for Windows 4.0 and 2000, 128 MB for Windows
XP.
 200 megahertz processor is recommended  300 for Windows XP.
 At least 260 megabytes of free hard disk space (Drive C) to install; 130 megabytes after
installation.
 ADE4
(2004) Statistical Software for Multivariate Analysis and Graphical Display.
Freeware download for Mac and Windows

EasyReg,
written by Herman J. Bierens, Professor of econometrics at Penn State University, conducts various
econometric estimation and testing tasks on all 32bit Windows platforms (95/98/2000/ME/NT4/XP).
EasyReg is designed for use in empirical research, and for teaching econometrics. In the latter
case the user can choose his or her own econometrics level. EasyReg is almost selfexplanatory:
there should be no need for a manual, provided that the user chooses the appropriate econometrics
level. It is free for noncommercial purposes. EasyReg does most of the econometrics tasks
available in competing commercial software, in particular:
 EViews 3.1. This is the closest competitor of EasyReg.
 GiveWinPcGivePcFiml
 Microfit
 Minitab
 RATS
 Shazam
 Stata
 TSP
and even more. Some of the listed software have additional features, though. EasyReg is a
pointandclick program, whereas some of the listed commercial software packages (in particular
RATS and TSP) are programmable.

Regress+: a free
nonlinear estimation / curve fitting / modeling program for Macintosh Computers, developed by
Michael P.McLaughlin. It has many built in deterministic and stochastic models and allows user
defined models with up to 10 parameters.
It has many advanced features such as
 choice of optimization criterion,
 robust goodnessoffit testing
 bootstrap confidence intervals
Extensive documentation is available in PDF format.

Smith's Statistical Package
(SSP) Smith's Statistical Package (SSP) is a userfriendly statistics program, developed by
Prof. Gary N. Smith, Department of Economics, Pomona College Claremont/California. SSP is available
for Mac OSX/OS9 and Windows.

Ox : is an objectoriented matrix programming
language with a comprehensive mathematical and statistical function library, developed by Jurgen A.
Doornik. The Console (command line) versions may be used freely for academic research and teaching
purposes . Matrices can be used directly in expressions, for example to multiply two matrices, or
to invert a matrix. The major features of Ox are its speed, extensive library, and welldesigned
syntax, which leads to programs which are easier to maintain. For a first impression of the matrix
and statistical function library see the Function summary. Versions of Ox are available for
Windows, Linux, several Unixes, and Mac OSX.

Mx, developed by M. C. Neale (Department of
Psychiatry, MCV Virginia Commonwealth University), is a combination of a matrix algebra interpreter
and a numerical optimizer. It enables exploration of matrix algebra through a variety of operations
and functions. There are many builtin fit fuctions to enable structural equation
modelingand other types of statistical modeling of data. It offers the fitting fuctions found
in commercial software such as LISREL, LISCOMP, EQS and CALIS, along with facilities for maximum
likelihood estimation of parameters from missing data structures, under normal theory. Complex
'nonstandard' models are easy to specify. For further general applicability, it allows the user to
define their own fit functions, and optimization may be performed subject to linear and nonlinear
equality or boundary constraints. It is freely available with a graphical user interface for
Windows platforms or as a commandline/server version for Unix/Linux.

Mondrian, developed by Martin Theus, is a
statistical datavisualization system written in JAVA, running on Mac/Windows/Linux. The main
emphasis of Mondrian is on visualization techniques for Categorical Data , Geographical Data and
LARGE Data.
All plots in Mondrian are fully linked, and offer various interactions and queries. Any case
selected in a plot in Mondrian is highlighted in all other plots.

PAST, developed by Øyvind Hammer of
University of Oslo and CoWorkers David A.T. Harper and P.D. Ryan, is a free, easytouse data
analysis package aimed at paleontology. Inspired by PALSTAT, it includes common statistical,
plotting and modelling functions:
 A spreadsheettype data entry form
 Graph, scatter, 3D scatter, histogram, box, ternary, survivorship, spindle, matrix and normal
probability plots
 Curve fitting: Linear (Standard and Reduced Major Axis) with bootstrapping, linlog
(exponential), loglog (allometric), logistic, von Bertalanffy, sumofsines, Bsplines.
 F, t, permutation t, Chisquared w. permutation test, KolmogorovSmirnov, MannWhitney,
ShapiroWilk, Spearman's Rho and Kendall's Tau tests, correlation, covariance, contingency tables,
oneway ANOVA, KruskalWallis test, mixture analysis.
 Diversity indices with bootstrapping and permutation, rarefaction. Capturerecapture richness
estimators. Dice, Jaccard, Simpson and RaupCrick similarity indices, Renyi diversity
profiles.
 Abundance model fitting: Geometric, logseries and lognormal.
 Multivariate statistics: Principal Components (with Minimal Spanning Tree), Principal
Coordinates (14 distance measures), Nonmetric Multidimensional Scaling (14 distance measures),
Detrended Correspondence Analysis, Cluster analysis (three algorithms, 13 distance measures,
twoway clustering), kmeans clustering, seriation, discriminant analysis, oneway MANOVA, oneway
ANOSIM, oneway NPMANOVA, Hotelling's T2, paired Hotelling's T2, Mahalanobisdistance permutation,
Box's M, Canonical Variates Analysis, multivariate allometry with bootstrapping.
 Time series analysis: Spectral analysis, autocorrelation, crosscorrelation, wavelet transform,
Walsh transform, runs test.
 Geometrical analysis: Directional statistics, rose plots, point distribution statistics,
Fourier shape analysis, elliptic Fourier shape analysis, eigenshapes, landmark analysis with
Procrustes fitting (2D and 3D), thinplate spline transformation grids with expansions and
principal strains, partial warps and scores, relative warps and scores, centroid size from
landmarks, size removal by Burnaby's method.
 Parsimony analysis (cladistics): Exhaustive, branchandbound and heuristic algorithms, Wagner,
Fitch and Dollo characters. Bootstrap, strict and majority rule consensus trees. Consistency and
retention indices. Three stratigraphic congruency indices with permutation tests.
 Biostratigraphy with the methods of Unitary Associations, RankingScaling (RASC) and
Constrained Optimization (CONOP). Confidence intervals on stratigraphic ranges.
 Gridding (spatial interpolation): Moving average, thinplate spline and kriging with three
semivariogram models.
Included in the distribution are real data sets for educational use, together with extensive
documentation and case studies.
PAST has been tested under Windows 95, 98, 2000, NT 4 and XP.

Calcugator:
The Calcugator is a calculator, a plotting engine, and a programming
environment. It's also a computer program you may download and use for free. An online applet
with a restricted user interface is available too.
 As a calculator it excels in simplicity of use and rivals programs like MATLAB. It is
equally suited for engineers as well as for high school students. It offers close to 200
functions/operators to perform real, integer, rational, complex, boolean, statistical, vector,
array and matrix computations. Both the input and output of the program are displayed on standard
windows which can be further edited, saved, merged, printpreviewed and printed.
 As a plotting engine it allows rapid creation of 2D plots of functions as well as
polar and parametric displays. It also allows creating 3D plots of functions, contour plots and 3D
parametric displays. The Calcugator has also functions to create bar, pie, pareto and xy charts.
All plots can be configured using the mouse (zooming, panning, selecting). Titles and labels are
supported. However, the most useful characteristic is that all figures created by the Calcugator
can be exported into popular file formats or pasted into an editable window.
 As a programming environment it offers a computer language with identical syntax as
that of Java/C/C++. However, the user does not need to compile his/her programs. The language is
simple, compact and the error control is intelligent when displaying messages. The creation of
userdefined functions is natural and mimics the syntax used by a teacher on the blackboard.
[source: Calgugator Website ]
Download from
[ this site ]

MicrOsiris: is a comprehensive statistical and
data management package for Windows developed and marketed by Van Eck Computer Consulting. It is
derived from OSIRIS IV, a statistical and data management package developed and used at the
University of Michigan and includes special techniques for data mining and analysis of nominal and
ordinalscaled data (MNA, MCA). MicrOsiris can readily accept data from SPSS, SAS, and Excel as
well as from other sources..
Of special interest is SEARCH, a binary segmentation command used to develop predictive models for
dependent variables. It is an elaboration of the programs developed at the Institute for Social
Research. SEARCH divides a sample, through a series of binary splits, into mutually exclusive
series of subgroups. The groups are chosen so that at each step the two new groups account for more
of the variance or distribution than any other pair of subgroups.

NORM, written by Joe
Schafer and Maren Olsen of the Department of Statistics, The Pennsylvania State University, is a
Windows Software for multiple imputation of multivariate continuous data under a normal
model.

Xtremes, written by
a Prof. R.D. Reiss (University of Siegen/Germany) and a large team of contributors, is an
interactive software for general statistics and risk analysis.
Xtremes possesses graphics facilities, a facility to generate and load data, an arsenal of
diagnostic tools and statistical procedures, and a numerical part for MonteCarlo simulations.
Xtremes is first of all designed especially for use with a mouse, yet it can also perform
excellently with just the keyboard.
The experienced user can employ the integrated Pascallike programming language StatPascal to
implement simulation or estimation procedures not covered by the menu system. Moreover, the
UserFormula facility may be utilized when a single formula is required to plot a curve or to
generate or transform a data set.
The free academic Windows edition, included in the Statistical Analysis book, only allows
a certain limited number of rows in a data set.
It consists of Windows (95, 98, 2000, NT) applications
 a rich menu system
 the integrated StatPascal programming
language
 a Windows help system
 a HTML help system
 additional statistical procedures  e.g., for sumstable distributions  utilizing recent
system DLLs.
Facilities to download the academic edition may be found in the download
area.
Xtremes is documented in the Statistical Analysis book and the
pertaining online help. The StatPascal Manual can be read online and is also provided in the file
spdoc.ps in the sp subdirectory of the Xtremes installation.
Major enhancements of Xtremes 3.0:
The menu system
 Plotting facilites and estimators for multivariate
 extreme value
 and POT models,
 in conjunction with facilities to compare parametric and empirical curves such as contour
plots
 New time series tools
 Bayes and LMoment estimators for univariate POT models
 Statistics for sumstable distributions (using STABLE by DLL)
 Statistics in further parametric models (e.g. Student)
 Estimators for ARMA series
 Moving estimators in conjunction with VaR considerations
 Animation facilities for generating discrete and continuous data
 Improved support for printing
 Direct generation of EPS files
Xtremes is driven by StatPascal, a statistical programming language based on the Pascal
language. StatPascal can be executed within the Xtremes package using the SP button in the menu
toolbar. StatPascal supports
 most operations of standard Pascal
 use of most Xtremes menu options as functions
 graphics
 vector and matrix extensions

IVEware Imputation and Variance
Estimation Software by T. E. Raghunathan, Peter W. Solenberger and John Van Hoewyk, Institute
for Social Research at the University of Michigan, is a set of C and FORTRAN routines that can be
launched from SAS or run independently as a commandline program using data from many sources.
IVEware is currently available for personal computers using the Microsoft Windows and Linux
operating systems and for UNIX workstations using the Sun Solaris operating system. The SAS
implementation requires SAS 6.12 or higher.
IVEware can:
 Perform single or multiple imputations of missing values using the Sequential Regression
Imputation Method described in the article "A multivariate technique for multiply imputing missing
values using a sequence of regression models" by Raghunathan, Lepkowski, Van Hoewyk and Solenberger
(Survey Methodology, June 2001).
 Perform a variety of descriptive and model based analyses accounting for such complex design
features as clustering, stratification and weighting.
 Perform multiple imputation analyses for both descriptive and modelbased survey
statistics.
 BrightStat, developed by Dr. Daniel Stricker,
University of Bern, is a comprehensive Javabased statistics package running on the net. The aim
was to provide a freely accessible statistic tool with many capabilities, yet easy to use.
BrightStat is expanded continually.
Tests running on BrightStat so far:
Nonparametric Tests:
 Binomial Test,
 kxl ChiSquare (likelihood Ratio, Phi, Cramer's V, Symmetric Measures),
 McNemar (McNemar exact, likelihood Ratio, Phi, Cramer's V, Symmetric Measures),
 Fisher's Exact Test,
 Cochran,
 MannWhitney U,
 KruskalWallis (paired comparisons for posthoc analysis),
 Wilcoxon,
 Sign,
 Friedman (Conover Statistic and paired comparison for posthoc analysis),
 KolmogorovSmirnov one sample (Uniform, Normal and Exponential, Lilliefors correction for test
against normality).
Parametric Tests:
 OneSample Ttest,
 Two independent Samples Ttest (Levene test for homogeneity of variances),
 Two dependent Samples Ttest (paired correlations),
 OneWay ANOVA (Duncan, Tukey HSD, Scheffe, GamesHowell),
 Repeated OneWay ANOVA (estimation of Epsilon, LowerBound, GreenhouseGeisser,
HuynhFeldt),
 Pearson Bivariate Correlations,
 Spearman Rankcorrelations,
 Multiple Linear Regression (Correlationmatrix, ModelFit, ANOVATable and Parameter Estimates,
Part and Partial Correlations, Collinearity Diagnostic, Change Statistics, Methods: Enter, Forward,
Backward and Step),
 OneSample Variance Test,
 kSample Variance Test (Levene's Test for homogeneity of Variances).
BrightStat already comes along with many useful features for data handling and manipulation,
statistics and data managing, all packed in an easy to use interface. XML based data and output
structures simplify the future development of external viewers.
More lists of free statistical software
Reviews of free statistical software

Xiaoping Zhu and Ognjen Kuljaca
2005: A Short Preview of Free Statistical Software Packages for Teaching Statistics to
Industrial Technology Majors. Journal of Industrial Technology 21, 2, April 2005.

Detailed
Review of WinBugs 1.4 by Harvey Goldstein, University of London

Comparison chart of
Statistical Packages, comparing capabilities of several free and commercial packages.

R for SAS and SPSS Users, a Book by Robert
A. Muenchen, Statistical Consulting Center at the University of Tennessee, is a guide primarily for
experienced SAS and SPSS users who want to migrate to the free R package. It covers detailed syntax
comparison of the free R package with the SAS and SPSS packages. An outdated free PDFversion can be
downloaded from the site, where book example programs, articles and comparison charts are also
provided.

QuickR for SAS/SPSS/Stata Users, a
Website by Rob Kabacoff, is a guide to R with focus on migration from SAS, SPSS, STATA and
SYSTAT to R syntax
List of free statistical handbooks & online tools

