R, Python Duel As Top Analytics, Data Science software – KDnuggets 2016 Software Poll Results
R remains the leading tool, with 49% share, but Python grows faster and almost catches up to R. RapidMiner remains the most popular general Data Science platform. Big Data tools used by almost 40%, and Deep Learning usage doubles.
Full Results and 3-year trends
The following table shows the poll results in detail, excluding Deep Learning tools for which 3 year results are not available.% alone is the percent of tool voters used only that tool alone, shown only for tools that have 5% or such votes. For example, 11.4% of RapidMiner users have used only Rapidminer.
|
What Analytics, Big Data, Data mining, Data Science software you used in the past 12 months for a real project? [2895 voters] | |
|
Legend: red: Free/Open Source tools
green: Commercial tools Fuchsia: Hadoop/Big Data tools |
|
| R (1419) | |
| Python (1325) | |
| SQL (1029) | |
| Excel (972) | |
| RapidMiner (944), 11.7 % alone | |
| Hadoop (641) | |
| Spark (624) | |
| Tableau (536) | |
| KNIME (521) | |
| scikit-learn (497) | na |
| Java (487) | na |
| Anaconda (462) | na na |
| Hive (359) | na |
| MLlib (337) | |
| Weka (315) | |
| Microsoft SQL Server (314) | |
| Unix shell/awk/gawk (301) | |
| MATLAB (263) | |
| IBM SPSS Statistics (242) | |
| Dataiku (227), 18.1 % alone | na |
| SAS base (225) | |
| IBM SPSS Modeler (222) | |
| SQL on Hadoop tools (211) | na |
| C/C++ (210) | na |
| Other free analytics/data mining tools (198) | |
| Other programming and data languages (197) | |
| H2O (193) | |
| Scala (180) | na |
| SAS Enterprise Miner (162) | |
| Microsoft Power BI (161) | na |
| HBase (158) | na |
| QlikView (153) | |
| Microsoft Azure Machine Learning (147) | na |
| Other Hadoop/HDFS-based tools (141) | |
| Apache Pig (132) | |
| IBM Watson (121) | na |
| Rattle (103) | |
| Salford SPM/CART/Random Forests/MARS/TreeNet (100), 63.0 % alone | |
| Gnu Octave (89) | |
| Orange (89) | |
| Alteryx (87) | |
| RapidInsight/Veera (87), 51.7 % alone | |
| TIBCO Spotfire (80) | |
| Apache Mahout (74) | |
| Other paid analytics/data mining/data science software (71) | |
| Dato (69) | |
| Pentaho (68) | |
| Perl (67) | |
| IBM Cognos (64) | |
| Splunk/ Hunk (63) | |
| JMP (58) | |
| C4.5/C5.0/See5 (58) | |
| Amazon Machine Learning (55) | na |
| Mathematica (53) | |
| Microsoft other ML/Data Science tools (46) | na na |
| Vowpal Wabbit (45) | na |
| Microstrategy (45) | na |
| SAP Analytics (42) | |
| Stata (39) | |
| Dell/StatSoft (36), 8.3 % alone | |
| XLMiner (35) | na na |
| SAP HANA (35) | na na |
| Julia (32) | |
| Oracle Adv. Analytics (31) | |
| BigML (25), 16.0 % alone | |
| Zementis (25) | |
| BayesiaLab (18) | |
| Alpine Data Labs (16), 12.5 % alone | |
| DataRobot (15), 6.7 % alone | na na |
| Datameer (13), 7.7 % alone | |
| Lavastorm (12) | |
| F# (11) | |
| Clojure (11) | |
| Actian (10) | |
| WordStat (10) | |
| Ayasdi (9) | na |
| Skytree (8) | na |
| Lisp (7) | |
| Ontotext GraphDB (6) | na |
| SiSense (5) | |
| Birst (5) | na |
| FICO Model Builder (5) | |
| WPS World Programming System (4) | |
| Angoss (3) | |
| Predixion Software (2) | |
Additional tools not included but mentioned in the comments include
- XLSTAT
- BeyondCore
- Timi and Anatella
- SAS/STAT
- Domino Data Lab
- MapR
- Neural Designer
- Javascript
- R leads RapidMiner, Python catches up, Big Data tools grow, Spark ignites, 2015
- KDnuggets 15th Annual Analytics, Data Mining, Data Science Software Poll: RapidMiner Continues To Lead, 2014
- KDnuggets 2013 Software Poll: RapidMiner and R vie for first place.
- KDnuggets 2012 Poll: Analytics, Data mining, Big Data software used
- KDnuggets 2011 Poll: Data Mining/Analytic Tools Used
- KDnuggets 2010 Poll: Data Mining / Analytic Tools Used
- KDnuggets 2009 Poll: Data Mining Tools Used
- KDnuggets 2008 Poll: Data Mining Software Used
- KDnuggets 2007 Poll: Data Mining/Analytics Software Tools