Professional Documents
Culture Documents
Instance-Based Methods
Probabilistic Classifiers
Decision Trees
Rule Learners
Regression
Learning Algorithms
Clustering
Model Validation
Cluster Validity
Model Performance
Meta Learning
Data Visualization
Relational
Databases
NoSQL
Data Science
Paradigms
Platforms
Resource Managers
Production
Architectures/Frameworks
Agnostic Specifications
Distributed ML Libraries
Cloud
Programming Languages
Data Cleaning
Data Preparation
Dimensionality Reduction
Statistics
Web Frameworks
Development
Business Acumen
http://www.datascienceontology.com/
Visualization
Version Control
Connectivity Models
Centroid Models
Distribution Models
Density Models
Subspace Models
Confusion
Matrix
Performant
Kappa
Statistic
Sensitivity
andRecall
Specificity
Precision
and
ROC Curves
Internal Evaluation
Cross
Validation
Bootstrap Sampling
External Evaluation
Automated Parameter
Customized
Tuning Tuning
Regularization
Stacked Generalization
Gradient
Boosting Machines (GBM)
Line
Chart
Bar Chart
Histogram
Scatterplot
Boxplot
Pareto
Chart
Pie
Chart
Area
Chart
Control
Chart
Run
Chart
Stem-and-Leaf Display
Cartogram
Microsoft
SQL Server
Sparkline
MySQL
Table
SQLite
PostgreSQL
Netezza
SQL
Azure
EnterpriseDB
DB2
Oracle
Key-Value Store
Document Store
Column Family Stores
Graph
Batch
Real-Time/Streaming
Mixed
R
Python
Scala
Julia
Java
Clojure
Type
Conversion
Character
Manipulation
Character
Encoding
Missing
Values
Special
Values
Outliers
Inconsistencies
Error Localization
Transformation
Deductive
Correction
Imputation
Minimal Value Adjustment
Feature
Selection
Feature Engineering
Hypothesis
Testing
P-Value
Effect
Size Interval
Confidence
Meta
Analysis
Heteroskedasticity
Benford's
Law
Multiple Hypothesis
Testing
Familywise
ErrorRate
Rate
False
Discovery
Covariance
Correlation
Frequentist
Approaches
R
Bayesian
Approaches
Java
PHP
Ruby
Python
Javascript
CSS
MapReduce
Dataflow
Pig
Hive
YARN
PMML
Mahout
MLlib
GraphLab
AWS/EC2
GCE
D3
GitHub
Strong
Communication
Simplification
of ComplextoConcepts
Alignment
of
Algorithms
Pain
Points
Augmenting
Organizational
Values
Boosting
Existing
Helping Build
DataEmployee
Culture Skills
DynamoDB
Riak
Redis
Berkeley
DB
Voldemort
MemcacheDB
ArangoDB
MongoDB
CouchDB
RavenDB
RaptorDB
HBase
Cassandra
Hypertable
Accumulo
Neo4J
Infinite
Graph
HyperGraphDB
OrientDB
Hadoop
Spark
Storm
Apache Kafka
Lambda
Apache
Naiad Samza
Shiny
Foundation
Framework
Twitter
Bootstrap
Yahoo Pure
Sean McClure
Data Scientist, ThoughtWorks
1/1