首页 > 数据分析 > 不善于编程的人的福音——18个免费的数据探索分析工具

[悬赏]不善于编程的人的福音——18个免费的数据探索分析工具 (已翻译10%)

查看 (290次)
英文原文:18 Free Exploratory Data Analysis Tools For People who don’t code so well
标签: 数据分析
admin 发布于 2017-04-06 11:47:09 (共 20 段, 本文赏金: 43元)
参与翻译(3人): sysu天下无病 胡萝卜Carrot 廿九_ 默认 | 原文

【已悬赏】 赏金: 4元

每个人都有各自的天赋,发现它们并开始相信我们自己只是时间的问题。我们都有局限,但是我们应该止步不前吗?答案是不。

当我开始用R编程的时候,我很纠结。有时候不止一个人曾经这么想过。因为我在这一生中从未编码过。我的情况就像是一个从来没学过游泳的人在被强行踹进深海后用尽力气让自己不沉下去但是却喝了好多口咸咸的海水。

现在,当我回过头看,我笑了。你知道为什么吗?因为,我本可以选择不需要会编程就可以使用的数据分析工具并避免那些痛苦。

数据挖掘是预测建模不可缺少的一部分。除非你知道过去发生了什么否则你无法做出预测。掌握数据挖掘最重要的技能就是好奇心,它是免费的却不是每个人都拥有的东西。

我写这篇文章是为了帮助你们了解可用于探索性数据分析的各种免费工具。时下,在市场中可以找到非常多的免费且有趣的工具来帮助我们工作。这些工具不需要你精确仔细地编写代码,只需要你点点鼠标就能完成工作。

sysu天下无病
翻译于 2017-06-27 10:37:22
 

参与本段翻译用户:
sysu天下无病

显示原文内容

【已悬赏】 赏金: 3元

无需编程即可用来数据分析的工具/软件

1. Excel / Spreadsheet

无论你正准备步入数据科学领域还是已经在这个领域小有建树,你会知道过去这么多年以来,excel 一直以来都是数据分析领域不可缺少的一部分(最常用的工具之一)。哪怕是在今天,有很大一部分需要数据分析的项目都依赖与excel去完成。由于来自于社区,辅导教程,免费资源的帮助越来越多,学习excel已经变得越来越简单。


excel 基本上支持了最常用的数据分析功能:用来概述(总结)数据特征,数据可视化,对数据转型(去除噪音数据)从而得到新的数据集用来分析等。这些工具足够强大到让我们可以重新从多个方面审视数据。无论你知道有多少其它的数据分析工具,你一定要学会用excel。尽管Microsoft excel这个软件是付费的,但你可以用其替代品,例如open office, google docs!


免费下载请点击这里: Click Here

胡萝卜Carrot
翻译于 2017-09-30 23:26:28
 

参与本段翻译用户:
胡萝卜Carrot

显示原文内容

【待悬赏】 赏金: 2元

2. Trifacta

Trifacta’s Wrangler tool is challenging the traditional methods of data cleaning and manipulation. Since, excel possess limitations on data size, this tool has no such boundaries and you can securely work on big data sets. This tool has incredible features such as chart recommendations, inbuilt algorithms, analysis insights using which you can generate reports in no time. It’s an intelligent tool focused on solving business problems faster, thereby allowing us to be more productive at data related exercises.

Availability of such open source tools make us feel more confident and supportive, that there are good people also, around the world who are working extremely hard to make our lives better.

Free Download: Click Here

共1人翻译此段 (待审批1人)


参与本段翻译用户:
廿九_


【待悬赏】 赏金: 2元

3. Rapid Miner

This tool emerged as a leader in 2016 Gartner Magic Quadrant for Advanced Analytics. Yes, it’s more than a data cleaning tool. It extends its expertise in building machine learning models. Yes, it comprises all the ML algorithms which we use frequently. Not just a GUI, it also extends support to people using Python & R for model building.

It’s continues to fascinate people around the world with its remarkable capabilities. Above all, it claims to provide analytics experience at lightning fast level. Their product line has several products built for big data, visualizations, model deployment, some of which (enterprise) include a subscription fee. In short, we can say it’s a complete tool for any business which requires performing all tasks from data loading to model deployment.

Free Download: Click Here



【待悬赏】 赏金: 2元

4. Rattle GUI 

If you tried using R, but couldn’t get a knack of what’s going in, Rattle should be your first choice. This GUI is built on R and gets launched by typing install.packages("rattle") followed by library(rattle) then rattle() in R. Therefore, to use rattle you must install R. It’s also more than just data mining tool. Rattle supports various ML algorithms such as Tree, SVM, Boosting, Neural Net, Survival, Linear models etc.

It’s being widely used these days. According to CRAN, rattle is being installed 10000 times every month. It provides enough options to explore, transform and model data is just few clicks. However, it has fewer options than SPSS for statistical analysis. But, SPSS is a paid tool.

Free Download: Click Here



【待悬赏】 赏金: 2元

5. Qlikview

Qlikview is one of the most popular tool in business intelligence industry around the world. Deriving business insights and presenting it in an awesome manner, it what this tool does. With it’s state of art visualization capabilities, you’d be amazed by the amount of control you get while working on data. It has an inbuilt recommendation engine to update you from time to time about best visualization methods while working on data sets.

However, it is not a statistical software. Qlikview is incredible at exploring data, trend, insights but it can’t prove anything statistically. In that case, you might want to look at other softwares.

Free Download: Click Here



【待悬赏】 赏金: 2元

6. Weka 

An advantage of using Weka is that it is easy to learn. Being a machine learning tool, its interface is intuitive enough for you to get the job done quickly. It provides options for data pre-processing, classification, regression, clustering, association rules and visualization. Most of the steps you think of while model building can be achieved using Weka. It’s built on Java.

Primarily, it was designed for research purposes at University of Wakaito, but later it got accepted by more and more people around the world. However, overtime I haven’t seen an enthusiastic weka community like of R and Python. The tutorial listed below should help you more.

Free Tutorial: Click Here



【待悬赏】 赏金: 2元

7. KNIME 

Similar to RapidMiner, KNIME offers an open source analytics platform for analyzing data, which can later be deployed, scaled using other supportive KNIME products. This tool has abundance of features on data blending, visualization and advanced machine learning algorithms. Yes, using this tool you can build models also. Though, there hasn’t be enough talk about this tool, but considering its state of art design, I think it will soon catch up much needed limelight.

Moreover, quick training lessons are available on their website to get you started with this tool right now.

Free Download: Click Here



【待悬赏】 赏金: 2元

8. Orange 

As cool as its sounds, this tool is designed to produce interactive data visualizations and data mining tasks. There are enough youtube tutorial to learn this tool. It has an extensive library of data mining tasks which includes all classification, regression, clustering methods. Along with, the versatile visualizations which get formed during data analysis allows us to understand the data more closely.

To build any model, you’ll be required  to create a flowchart. This is interesting as it would help us further understand the exact procedure of data mining tasks.

Free Download: Click Here



【待悬赏】 赏金: 2元

9. Tableau Public

Tableau is a data visualization software. We can say, tableau and qlikview are the most powerful sharks in business intelligence ocean. The comparison of superiority is never ending. It’s a fast visualization software which let’s you explore data, every observation using various possible charts. It’s intelligent algorithms figure out by self about the type of data, best method available etc.

If you want to understand data in real time, tableau can get the job done. In a way, tableau imparts a colorful life to data and let’s us share our work with others.

Free Download: Click Here



【待悬赏】 赏金: 1元

10. Data Wrapper 

It’s a lightning fast visualization software. Next time, when someone in your team gets assigned BI work, and he/she has no clue what to do, this software is a considerable option. It’s visualization bucket comprises of line chart, bar chart, column chart, pie chart, stacked bar chart and maps. So, it’s a basic software and can’t be compared with giants like tableau and qlikview. This tools is browser enabled and doesn’t require any software installation.



【待悬赏】 赏金: 2元

11. Data Science Studio (DSS)

It is a powerful tool designed to connect technology, business and data. It is available in two segments: Coding & Non-Coding. It’s a complete package for any organization which aims to develop, build, deploy and scale models on network. DSS is also powerful enough to create smart data applications to solve real world problems. It comprises of features which facilitates team integration on projects. Among all features, the most interesting part is, you can reproduce your work in DSS as every action in the system is versioned through an integrated GIT repository.

Free Download: Click Here



【待悬赏】 赏金: 2元

12. OpenRefine

It started as Google Refine but looks like google plummeted this project due to reasons unclear. However, this tool is still available renamed as Open Refine. Among the generous list of open source tools, openrefine specializes in messy data; cleaning, transforming and shaping it for predictive modeling purposes. As an interesting fact, during model building, 80% time of an analyst is spent in data cleaning. Not so pleasant, but it’s the fact. Using openrefine, analysts can not only save their time, but put it to use for productive work.

Free Download: Click Here



【待悬赏】 赏金: 2元

13. Talend

Decision making these days is largely driven by data. Managers & professionals no longer make gut-based decision. They require a tool which can help them quickly. Talend can help them to explore data and support their decision making. Precisely, it’s a data collaboration tool capable of clean, transform and visualize data.

Moreover, it also offers an interesting automation feature where you can save and redo your previous task on a new data set. This feature is unique and haven’t been found in many tools. Also, it makes auto discovery, provides smart suggestion to the user for enhanced data analysis.

Free Download: Click Here



【待悬赏】 赏金: 2元

14. Data Preparator 

This tool is built on Java to assist us in data exploration, cleaning and analysis. It includes various inbuilt packages for discretization, numeration, scaling, attribute selection, missing values, outliers, statistics, visualization, balancing, sampling, row selection, and several other tasks. It’s GUI is intuitive and simple to understand. Once you start working on it, I’m sure you wouldn’t take lot of time to figure out how to work.

A unique advantage of this tool is, the data set used for analysis doesn’t get stored in computer memory. This means you can work on large data sets without having any speed or memory troubles.

Free Download: Click Here



【待悬赏】 赏金: 2元

15. DataCracker  

It’s a data analysis software which specializes on survey data. Many companies do survey but they struggle to analyze it statistically. Survey data are never clean. It comprises of lot of missing & inappropriate value. This tool reduces our agony and enhances our experience of working on messy data. This tool is designed such that it can load data from all major internet survey programs like surveymonkey, survey gizmo etc. There are several interactive features which helps to understand data better.

Free Download: Click Here



【待悬赏】 赏金: 2元

16. Data Applied 

This powerful interactive tool is designed to build, share, design data analysis reports. Creating visualization on large data sets can sometimes be troublesome. But this tool is robust in visualizing large amounts of data using tree maps. Like all other tools above, it has feature for data transformation, statistical analysis, detecting anomalies etc. All in all, it’s a multi usage data mining tool capable of of automatically extracting valuable knowledge (signal) from the raw data. You’d be amazed to see that such non-programming tools are no less than R or Python for data analysis.

Free Download: Click Here



【待悬赏】 赏金: 2元

17.  Tanagra Project 

You might not like it because of old fashioned UI, but this free data mining software is designed to build machine learning models. Tanagra project started as a free software for academic and research purposes. Being an open source project, it provides you enough space to devise your own algorithm and contribute.

Along with supervised learning algorithms, it is enabled with paradigms such as clustering, factorial analysis, parametric and nonparametric statistics, association rule, feature selection and construction algorithms etc. Some of its limitations include  unavailability of wide set of data sources, direct access to datawarehouses and databases, data cleansing, interactive utilization etc.

Free Download: Click Here



【待悬赏】 赏金: 2元

18. H2o

H2o is one of the most popular software in analytics industry today. In few years, this organization has succeeded in evangelizing the analytics community around the world. With this open source software, they bring lighting fast analytics experience, which is further extended using API for programming languages. Not just data analysis, but you can build advanced machine learning models in no time. The community support is great, hence learning this tool isn’t a worry. If you live in US, chances are they would be organizing a meetup nearby you. Do drop by!

Free Download: Click Here



【待悬赏】 赏金: 3元

Bonus Additions:

In addition to the awesome tools above, I also found some more tools which I thought you might be interested to look at. However, these tools aren’t free but you can still avail them for trial:

  1. Data Kleenr
  2. Data Ladder
  3. Data Cleaner
  4. WinPure

 

End Notes

Once you start working on these tools (your choice), you’d understand that knowing programming for predictive modeling isn’t much advantageous. You can accomplish the same thing with these open source tools. Therefore, until now, if you were get disappointed at your lack of non-coding, now is the time you channelize your enthusiasm on these tools. You may be interested to check 19 Data Science Tools for Non Coders.

The only limitation I see with these tools (some of them) is, lack of community support. Except few tools, several of them don’t have a community to seek help and suggestions. Still, it’s worth a try!

Did you like reading this article? Have you worked on any of the tools listed above? Which one do you think is the most versatile? Drop your suggestions / opinions in the comments below.


GMT+8, 2018-1-23 22:18 , Processed in 0.040853 second(s), 11 queries .