The Wolfram Solution forData ScienceBring in your data and combine it with the ever-increasing store of knowledge in the Wolfram Knowledgebase. Apply sophisticated symbolic and numeric analysis, and automatically generate rich, interactive reports that can be deployed in the cloud and through APIs—all in one system, with one integrated workflow. Underlying the Wolfram data science solution are world-class machine learning and classification algorithms, semantic data representation, and the knowledge-based Wolfram Language. |
![]() |
The Wolfram data science solution offers thousands of built-in functions and curated data on many topics that let you:
- Automatically preprocess data, including missing-values imputation, normalization, and feature selection with built-in machine learning capabilities
- Semantically import and structure your data
- Create, schedule, and share interactive reports in the cloud automatically
- Build customized analytical tools
- Fit data to common models or to newly developed theoretical models
- Study voting patterns or other social statistics
- Merge built-in economic data with your customer data to understand how changes in sales are linked to broader events
- Make statistical visualizations with a high level of algorithm automation and computational aesthetics
- Easily create interactive tools for analyzing your data
- Develop original, sophisticated algorithms for data science
- Import, analyze, visualize, and publish in a single end-to-end data science environment
Develop computationally intensive business intelligence applications with superior user experience and deployment flexibility
Create histograms with log scales for easier interpretation or show histograms alongside other visualizations
Does your current tool set have these advantages?
-
Combine your data with the ever-increasing store of knowledge in the Wolfram Knowledgebase on a wide range of topics, accessible programmatically and ready for further analysis
No other system has built-in access to such a broad selection of curated data -
Instantly create interactive tools for curve-fitting or data analysis
Other programs do not allow creation of interactive tools -
Import, analyze, and deliver results in one interactive document instead of across several applications
MATLAB does not offer this feature -
Create highly customized, presentation-quality visualizations, including charts, bar charts, scatter plots, and many more
SPSS requires explicit programming for customized charts - Work with sparse arrays with algorithm support that improves performance of very-large-scale linear algebra operations
- Work with 35 properties of over 100 statistical distributions with specialized coverage for finance, medicine, and engineering—more distributions than any other system, including dedicated statistics software
- Extend built-in algorithms with your own models
-
Use readable, easily recognizable function names
R's function names are nonstandard abbreviations -
Use built-in parallel and GPU computing capabilities to speed your computations
Other programs do not offer built-in parallel or GPU computing
Work with nonparametric data models in any number of dimensions
Create dashboards that analyze and display live data in real time
Data science specific capabilities:
- More statistical distributions than any other system, with specialized coverage for finance, medicine, engineering, and science »
- Built-in machine learning functions for working with many types of data, including numerical, categorical, textual, and image »
- Systemwide support for random processes, including parametric processes, hidden Markov processes, queueing processes, time series, and stochastic differential equation processes »
- Define and use new distributions from data, formulas, or other distributions, including copulas, mixtures, order statistics, censoring, truncation, and transforms »
- Estimate distribution parameters from data and test goodness of fit of data to distributions
- Model, analyze, synthesize, and visualize graphs and networks, including mixed graphs and multigraphs, social networks, and 3D graph visualizations »
- Broad support for censored data, optimized parametric and nonparametric survival modeling frameworks, and a range of generalized hypothesis-testing functions »
- Connectivity to the Hadoop framework, including data import/export and job execution via the HadoopLink package »
- Systemwide support for working with time series, including fully automated time series model fitting and diagnostics »
- Cluster analysis for numerical, Boolean, and string data, in arbitrary dimensions and with arbitrary distance functions »
- State-of-the-art data classification algorithms »
- Text analysis with built-in string manipulation and pattern-matching functions »
- Linear, nonlinear, logit, probit, and generalized linear regression models »
- General moving-average and other smoothing functions »
- Convolutions and correlations of data »
- Powerful and efficient nearest-neighbor algorithms in any number of dimensions »
- Efficient pattern-matching functions »
- Functions to count, sort, and bin data »
- Integrate R code into your data science workflow, combining Wolfram's broad range of capabilities with the statistical computing language »
- Wide variety of built-in standard charts and graphs, including pie and bar charts, paired histograms, box-and-whisker charts, radar plots, and quantile plots, all using Mathematica's automation and flexibility »
- Access terabytes of curated data from Wolfram|Alpha, immediately ready for analysis, interactively or programmatically »
- Use natural-language commands to guide automated analyses or request particular kinds of analysis
- Hundreds of formats for importing and exporting data »
- Interactive analysis tools that are quick to build and deploy »
- Create interactive content in the CDF format, and immediately deploy it to the free Wolfram CDF Player or to the cloud as slide shows, reports, books, applications, and web objects »











