分类
确定对象属于哪个类别。
应用: 垃圾邮件检测,图像识别。 算法Algorithms: Gradient boosting, nearest neighbors, random forest, logistic regression, and more...
回归
预测与对象关联的连续值属性。
应用: 药物反应,股票价格。 算法Algorithms: Gradient boosting, nearest neighbors, random forest, ridge, and more...
降维Dimensionality reduction
减少要考虑的随机变量的数量。
应用Applications: 可视化,提高效率。 算法Algorithms: PCA, feature selection, non-negative matrix factorization, and more...
模型选择Model selection
比较、验证和选择参数和模型。
应用Applications: 通过参数调整提高精度。 算法Algorithms: Grid search, cross validation, metrics, and more...
预处理Preprocessing
特征提取和归一化。
应用Applications: 转换输入数据(如文本)以用于机器学习算法。 算法Algorithms: Preprocessing, feature extraction, and more...
"We use scikit-learn to support leading-edge basic research [...]"
"I think it's the most well-designed ML package I've seen so far."
"scikit-learn's ease-of-use, performance and overall variety of algorithms implemented has proved invaluable [...]"
"The great benefit of scikit-learn is its fast learning curve [...]"
"It allows us to do AWesome stuff we would not otherwise accomplish."
"scikit-learn makes doing advanced analysis in Python accessible to anyone."