0% found this document useful (0 votes)
71 views

Machine Learning Tools and Toolkits in The Explora

Uploaded by

Jonathan Perez
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
71 views

Machine Learning Tools and Toolkits in The Explora

Uploaded by

Jonathan Perez
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/332113377

Machine Learning Tools and Toolkits in the Exploration of Big Data

Article  in  INTERNATIONAL JOURNAL OF COMPUTER SCIENCES AND ENGINEERING · December 2018


DOI: 10.26438/ijcse/v6i12.570575

CITATIONS READS
6 501

2 authors, including:

Afreen Khan
Aligarh Muslim University
15 PUBLICATIONS   18 CITATIONS   

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Artificial Intelligence and Big Data Analytics Towards the Advancement of Alzheimer's Research View project

All content following this page was uploaded by Afreen Khan on 04 September 2019.

The user has requested enhancement of the downloaded file.


International Journal of Computer Sciences and Engineering Open Access
Research Paper Vol.-6, Issue-12, Dec 2018 E-ISSN: 2347-2693

Machine Learning Tools and Toolkits in the Exploration of Big Data


Afreen Khan1, SwalehaZubair2*
1,2
Department of Computer Science, Aligarh Muslim University, Aligarh, India
*
Corresponding Author: swalehazubair@yahoo.com, Tel.:+91-9410059635

Available online at: www.ijcseonline.org

Accepted: 28/Dec/2018, Published: 31/Dec/2018


Abstract-Machine learning (ML) is the best way to make progress towards human level artificial intelligence, which allows
software applications to become more accurate in predicting results. It is the most promising technique that has profound
realization in reorganizing practices pertaining to various fields viz. healthcare, education world industry, retail and
manufacturing sectors, traffic and urban planning etc. The compilation and storage followed by specific training of the stored
data are some of the salient features of the machine learning process that has tremendous scope in discovering novel output in
various relevant fields. There are plenty of tools in ML that may help in the training of data without being explicitly
programmed. Tools are categorized into- framework, platform, library, and interface. For the successful development and
effective execution of ML, one can categorically manipulate various related tools. Working through such tools advances the
process as applied to the various applications. In the present study, we intend to exploit recommendation engines for the
development of tools that can handle the huge quantity of data. The usage of the overwhelming quantity of multimodal data
and streamlining the same for its personalized usage are some of the unique features of the study. We also focus on the
evaluation of a toolkit with loads of data and furthering several ML tools along with their features and use for the desired
application in the relevant field.

Keywords-Big data, Application Programming Interface (API), Command Line Interface (CLI), Graphic User Interface (GUI),
Machine learning, Tool, Toolkits, Platform, Library, Interface

I. INTRODUCTION used in data mining, character recognition, search engine,


spam detection, to name a few. These examples echo the
McKinsey, in one of its report, stated, “As ever more of essential role and center stage that the ML has grabbed
the analog world gets digitized, our ability to learn and is all set to hit the technology in today’s data -rich
from data by developing and testing algorithms will world [1].
only become more important for what are now seen as
traditional businesses” [1]. In the past several years, there With the continuous growth and subsequent
has been a technological advancement in the data and development in the key functionalities, methods and
digitization; and analytics have been restructuring the entire practices, uses, and the global significance of ML, it is
world, amplifying performance and facilitating the advent of high time and chief concern to have the knowledge of
recent innovations in the various fields like health, education, the right ML tools and techniques in order to solve the
agriculture, finance, etc. In order to build a dynamic model, demanding tasks [3]. As data deluge is on the rise,
the realm of the technological world is now extended to there has been an increased sophistication of ML
include autonomous several other domains so as to solve whichaids in analyzing the huge amount of Big Data.
them with various competent computer tools and techniques The present traditional practicesprovide little for applying
[2]. This has now become achievable and essential through statistical ML algorithms in dealing with Big Data [4].In the
innovations in the field of Artificial Intelligence (AI). With present times, ML have replaced the conventional
the advancement in the sphere of AI, there comes Machine statistical methods and has also transformed the
Learning (ML) which has brought new and innovative waves approach of data extraction, its processing and
in the present technological globe. interpretation by encompassing mechanized groups of
generic methods [1].
When building programs and algorithms become a complex
task to achieve, ML aids in unravelling the challenging Around the globe, researchers are laying emphasis on
issues. Along with solving these, understanding the building effective ML tools and techniques to examine the
algorithms and their intricacy have increased as well. ML is various problem domains. Thus, this paper will include the

© 2018, IJCSE All Rights Reserved 570


International Journal of Computer Sciences and Engineering Vol.6(12), Dec 2018, E-ISSN: 2347-2693

budding realm of ML in the age of AI and Big Data learn the structure of the data where one does not have a
Analytics. The paper is organized as follows: The subsequent knowledge behind the theory of what the structure appears to
section involves the state of Machine Learning as an be and completely understood theoretical distributions are
indispensable technique. Section 3 discusses how to evaluate then adjusted to the data whereas in statistical model, every
an ML toolkit. Section 4 presents the various ML tools. model has a theory behind it which is scientifically proven
Section 5 introduces the comparison of ML tools followed by but the rule is that the data should meet some robust
a conclusion in Section 6. assumptions [10]. Furthermore, the test for an ML model is a
validation error that is performed on a new data,however, a
II. MACHINE LEARNING- AN INDISPENSABLE TECHNIQUE theoretical test is never carried out which is used to prove a
null hypothesis [10]. ML uses an iterative methodology to
Due to the pervasiveness of data and the huge scalability of learn and discover from data until a strong pattern is obtained
cloud computing power, there has been a massive and this acquisition of knowledge is automated easily.
advancement in the use of AI and ML, which has
developed in significance with its competence to filter The strategic rule of an ML model is the ability to
through large datasets, explore and analyze them, interpret independently learn and progress as new data is loaded into
them with the aim of discovering effective patterns and the system and to convert this data into actionable
lastly, constructing useful predictions based on the result knowledge. The chief notion of the entire ML concept is the
attained. capability of a model to automatically apply sophisticated
mathematical computations to Big Data repeatedly until a
As Intel CEO Brian Krzanich in one of his interview most probable solution is achieved [10].
stated that “Data is the new Oil” then accordingly, it can be
further stated that Machine Learning, a subset of AI, is III. HOW TO EVALUATE AN ML TOOLKIT
fuelled by data. It is built on a modelling scheme of not only
analyzing data but also, it has the ability to learn- get trained The first thing to consider when ML need to be adopted is
and improve- get better by using different algorithms that the tool on which the work will be performed. Each step in
provide new and innovative insights [5]. the ML process can be automated if the right tools are used.
The key importance of selecting the correct tool is reflected
When the talk is about Big Data (BD), ML is the best choice later when right predictions and improved results are
for unravelling the robust issues. In a report, Gartner achieved, thereby it is as essential as working with the finest
defined Big Data as, “high volume, high velocity, algorithms [12]. The toolkit provided differs considerably,
and/or high-variety information assets that require new therefore it is necessary to maintain an equilibrium between
forms of processing to enable enhanced decision keeping up with the latest developments and rigid reliability
making, insight discovery and process optimization and stability of a project [11]. These tools not only provide
[6].” ML and BD are related in such a way where ML is the facility of implementation of ML algorithms but also
centered on diverse algorithms that know how to learn support at every step while the tasks are being executed and
from data with no dependency on rules-based can be used throughout the ML challenge.
programming while BD is the kind of data that is
loaded into such a system where analytical practices are ML advances well only when critical decisions need to be
carried consequently leading to the improvement in the taken which thereby is built on assumptions that are
precision of the predictions that ML model is trained generated from the analysis of data [13]. Thus, there isno one
for [7].Big Data analytics aids in acquiring insights and particular criteria for deciding the best toolkit for ML. Each
hence therefore, better decisions are taken while and every toolkit is developed to focus on the needs as
modifications in modelling are applied [8]. observed by the developer. Presently, there are numerous ML
toolkits available and how to evaluate a specific tool is an
In a report, published in 2018, Gartner mentioned that essential issue in the current times so as to deal with the
“almost half of CIOs are preparing to implement AI in their problem statement in a most practicable way. Following are a
respective organizations [9].” The main purpose of this is to range of different criteria that is usually used to assess any
obtain insights from the collected data so as to understand tool:
and acquire knowledge of the respective model used and
thereby analysing through their behaviour in order to build 1. Language: In regard to developing the ML models and
improved decisions [5]. On the other hand, the level of writing the ML algorithms, the toolkit’s programming
success, how enhanced results, how effective predictions is language in which it is written in, influence the entire
only achieved when the data is used righteously in a way as modelling. The choice of language depends on the comfort
how better understanding one gain from it [5]. ML and level i.e. ability to use it efficiently, type of problem to be
statistical model differ in a way where the aim of ML is to solved, the quantity of data to be processed, and

© 2018, IJCSE All Rights Reserved 571


International Journal of Computer Sciences and Engineering Vol.6(12), Dec 2018, E-ISSN: 2347-2693

developer’s expertise and past experience [14]. There are neural networks, and more to the list. As more and more
certain factors to consider before selecting a language, computation power is required, a huge interest is rising in
likewise, speed, concurrency, performance, cost exploiting ML in addition to the growing access to heaps of
effectiveness, learning curve i.e. functional or procedural, data so as to gain action-driven advantages. To the same
application development, and community support [15]. degree, there exist plenty of tools in ML, when machines
need to be trained to work without being explicitly
2. Type: This includes the different categories in which the programmed [13]. The nucleus of the ML system consists of
toolkits are divided. The categorization done is as follows: ML storage cluster and its computation power, which usually
platform, library, interface and local or remote tool. differs based on the learning method used, its application and
the need to automate it [5].
3. Documentation: The documentation of a specific toolkit
plays an important role in deciding which one to choose ML tool can be a platform, library, an interface or any local
and which one to avoid. If a toolkit is documented well in or remote tool. These toolkits provide the developers to
terms of quality, coverage of a huge number of examples create ML models more quickly and easily without stepping
that look similar to problems one work on, then it is easier into the details of the core algorithms thereby providing a
to build a solution to the particular problem. well-defined and brief approach for classifying ML models
by applying a set of pre-built and improved modules [16].
4. Integrated Development Environment (IDE):The IDE The tools can be divided into four branches, as depicted in
used for ML is as important as the ML techniques that are Figure 1 below.
used to solve the predictive modeling challenges. Certain
toolkits have graphical IDE, and others include command
line and editor IDE.

5. Execution Speed: The execution speed of ML toolkit is a


speed with which the algorithms and models execute the
instructions and classify the tasks via a trained model. It is
faster than training but it is not as important as the training
speed that trains the data in order to predict through the
test data.

6. Training Speed: The training speed is of great importance


as compared to the execution speed. It depends on the Figure 1. Classification of ML Toolkits
efficiency of different math libraries that the toolkit
consists of, and exactly how these libraries use the existing Significant attributes of a good ML toolkit are developer
computational resources. Also, it heavily depends on the friendly, optimized for performance, language and coding
kind of the problem to be solved, the trained data and the choice, ease of understanding, and computation process
images. parallelization among the various processes [2]. In general,
an effective ML toolkit lessens the complex nature of ML,
7. GPU (Graphical Processing Unit) Support:GPUs are causing it to be user-friendly and understandable to new
chiefly used to accomplish and increase the performance developers. The 5 tools defined above are further described
of video and graphics, and boost the network’s learning below, and in particular, specific examples belonging to
speed. They speed up the ML algorithms by latency those are illustrated well in the table-format.
reduction, bandwidth increment, and lessening the
communication cost. 1. ML Platform: ML platform provides an
environment where an ML project can be completed from
IV. ML TOOLS the start till the end. It offers capabilities like data pre-
processing that includes preparing the data so as to model it,
Even though a good set of ML algorithms exist, the recent data analysis, data modeling and evaluation and selection of
development illustrates the capability of ML to apply an algorithm [12]. There are many factors that become
sophisticated mathematical operations to data and refine relevant when the ML platform toolkit needs to be
them rapidly as soon as the processing begins [5]. Mostly implemented. They are: type of data, its characteristics,
used ML processes and algorithms aredecision trees, features of automation, characteristics pertaining to ease of
Bayesian networks, support vector machines, random use and integration, algorithm and modeling techniques,
forests, gradient boosting and bagging, self-organizing supported open source resources and management features
maps, Gaussian mixture models, k-means clustering, [17]. Some of the characteristics of ML platform are:

© 2018, IJCSE All Rights Reserved 572


International Journal of Computer Sciences and Engineering Vol.6(12), Dec 2018, E-ISSN: 2347-2693

 Provide complete facility needed at every step of an ML  It has the ability to reproduce the results by storing
project development. the commands and command line arguments.
 The ML platform interface may include API, CLI or  It supports several small programs and many
GUI, or a combination of these while programming. program genres for certain subtasks of ML project.
 They are used for general purpose modelling, instead of
focusing on accuracy, scalability and speed. iii. ML GUI (Graphical User Interface):Machine
 Features are loosely coupled, therefore it is the task of a learning tools support a GUI which mainly focuses
user to assemble all the components collectively for the on the graphical representation of data i.e.
particular project. visualization and also consists of windows, point
and click [18]. The characteristics of ML GUI are as
2. ML Library: ML library contains capabilities for only follows:
finishing a fragment i.e. one or more steps of an ML  The users that are not an expert in programming, for
project [12]. It is used to unravel the predictive use-cases. those, ML GUI provides an environment where they
It includes facilities like documentation, configuration and can complete their tasks easily through ML.
help data, pre-written subroutines and code, and message  The chief emphasis of ML GUI is on the process
templates [18]. An ML library also provides modeling and by what means maximum information can be
algorithms that are suitable for particular use-cases, each extracted from the ML tools and techniques.
having their own pros and cons. When determining which
ML library to apply, several features need to be 4. A. ML Local Tool:Machine learning Local tool is a
considered. They are programming language, performance tool that can be downloaded, installed and can be used
and hardware features, and ML algorithm [19]. Certain and run in the local environment. The characteristics of
characteristics of the ML library are: ML Local tool are as follows:
 ML library interface is usually an API which involves  It is built for main memory data and its algorithms.
programming.  This ML tool can be incorporated into our own ML
 They are designed for a particular use-case or machines so as to model it according to our needs
environment.  It provides control over the parameters so as to
devise predictions on newer data thereby supporting
3. ML Interfaces: ML interface is another ML tool and is the run configuration of the system.
further forked into three parts, namely, ML API, ML
CLI and ML GUI. 4. B. ML Remote Tool:Machine learning Remote tool
i. ML API (Application Programming is a tool that runs on the server of a third-party. It is a
Interface):Machine learning tools support an API which tool that is established on a server and the operations are
provides the ability to decide what components to work carried out on the local environment by calling it
with and how to apply them in the ML programs [18]. remotely. Thus, ML Remote tools are called Machine
The characteristics of ML API are as follows: Learning as a Service (MLaaS) [12]. The characteristics
 It provides the capability of developing our own ML of ML Remote tool are as follows:
tools.  These ML tools can handle large datasets even
 ML API tool can be used to build our own processes, though the data scales up rapidly.
and thereby, it can be further implemented on ML  It provides a set up where the processes can run
projects so as to automate them in an improved way. amongst the multiple machines, numerous cores
 It gives the flexibility to develop our own methods, while sharing the memory.
merge them with existing the libraries and methods.  Because these tools run remotely at scale, it
supports less number of ML algorithms since
ii. ML CLI (Command Line Interface):Machine learning complex modifications are needed.
tools give an environment of CLI that focuses on input  It has the ability to get incorporated within our local
and output i.e. it structures ML tasks in terms of the environments though RPCs (Remote Procedure
required input and output to be produced [12]. In Calls).
addition, it also comprises of command line
parameterization and command line programs. The V. ILLUSTRATION OF TOOLKITS
characteristics of ML CLI are as follows:
 It provides such an environment where non- The tools described above are illustrated in the following
programmers can perform their tasks through ML Table 1.
projects.

© 2018, IJCSE All Rights Reserved 573


International Journal of Computer Sciences and Engineering Vol.6(12), Dec 2018, E-ISSN: 2347-2693

Table 1. Illustration of ML Toolkits


ML Toolkit Language Features and Use
ML PLATFORM
WEKA ML Java Consists of huge number of ML algorithms, data preparation, data pre-processing, feature
Workbench selection and visualization methods. It provides all the three interfaces- Java API, CLI and
GUI [20].
R Platform R Built for data mining majorly. Chief uses are- data manipulation, matrix arithmetic, statistical
computing, graphical display, efficient data management and storage capabilities [21].
Subset of the Python Python It includes modules for various tasks such as, optimization, image processing, and linear
SciPy (Pandas and algebra and include tools like Pandas, Matplotlib, SymPy, and a developing set of various
Scikit-learn) scientific computing libraries [22].
ML LIBRARY
Scikit Python Supports numerous ML algorithms like, classification, clustering, regression, support vector
machines, k-means, to name a few [23].
JSAT Java Consists of many MLalgorithms that are research and need specific, such as data
(Java Statistical transformation, tree based, predictive, meta algorithms, and algorithms based on vector
Analysis Tool) quantization [24].
Accord (For .NET C# Provides a comprehensive framework for developing applications such as computer audition,
Platform) computer vision, statistical operations, signal processing [25].
ML INTERFACE
API
Pylearn2 Python It has a capability of wrapping other libraries like,Scikit-learn. It supports a dataset interface
for images, vector, video, etc. Also, it provides cross-platform serialization of trained models
[26].
LIBSVM C
It supports a simple interface in which the users can link their individual programs with it
(A Library for Support
easily. It includes: various SVM formulations, probability estimates, automated model
Vector Machines)
selection. It comprises of certain data mining set up too such as LIONsolver,RapidMiner [27].
SysWEKA Java It extends the features of WEKA. Furthermore, it supports the software interface which is
used by advanced applications for the management of resources on cloud setup [28].
CLI
Waffle C++ It provides cross-platform command-line techniques. It supports data transformation,
clustering, classification, data evaluation in reduced dimensionality, model training and
visualization. Moreover, in order to accomplish a specific task, it includes a ‘Wizard’ tool
which directs the user across a sequence of methods to build a command for this task [29].
WEKA Java WEKA as a CLI, supports an environment according to a particular domain. It provides tools
and techniques for data handling, data visualization, cross-validation, database linkage,
comparison of rule sets [30].
GUI
KNIME (Konstanz Java It provides an integrated data pipelining model that combines different modules for ML and
Information Miner) data mining. It consists ofin-built modular workflow method that allowsscalability by means
of complex data handling techniques, allows user to construct data flows visually,
parallelization on multi-core machines, blending of data and tools according to the problem
statement [31].
RapidMiner Java It is an extension for OpenML, which is an open platform for ML. It provides ML and data
mining techniques, such as ETL (Extract, Transform, Load), tools for data pre-processing,
data visualization, statistical modeling and predictive analytics [32].
Orange Python, C++ It is a ML and data mining collection of algorithms for data analysis through visual
programming. It makes easier the construction of data analysis workflow and the
development of different data mining methods from existing modules [33].
ML LOCAL AND REMOTE TOOLS
ML LOCAL
GoLearn Go It employs KNN (k-nearest neighbours) classification and regression, along with certain
essential CSV parsing [34].
Shogun Library C++ It consists of a stand-alone CLI.It is developed for large scale training of models for a wide
variety of learning and feature settings [35].
ML REMOTE
Apache Mahout for Java It provides clustering, collaborative filtering, classification, linear algebra operations,
Hadoop repeated item set timing, and utilities to speed up the vector and sparse matrix calculations
[36].
AWS Machine Java, C++ It trains and tests automatically a bunch of sophisticated models that are set with various
Learning parameters. It supports splitting of dataset, input normalization, and evaluation of model [37].
Microsoft Distributed C++ It supports numerous programming interfaces and is more competent in Big Data research. It
ML Toolkit includes two distributed ML algorithms that are used to make the machines learn the

© 2018, IJCSE All Rights Reserved 574


International Journal of Computer Sciences and Engineering Vol.6(12), Dec 2018, E-ISSN: 2347-2693

(DMTK) largest and fastest topic model and the biggest word-embedding model around the globe
[38].
Microsoft Azure ML Java It is a strong cloud-based tool which is used in analytics that allows predictive management
[39].
MLib for Spark Usable in Java, It consists of good set of ML algorithms that influence iteration and produces improved
Scala, Python, and results. It supports feature transformation, development of ML pipeline, hyper-parameter
R. tuning and model evaluation [40].
[13] https://knowm.org/machine-learning-tools-an-overview/
[14] https://blogs.opentext.com/choosing-the-right-programming-
VI. CONCLUSION language-for-machine-learning-algorithms-with-apache-
spark/amp/
Machine learning is a complicated field and the graph is [15] https://medium.com/@UdacityINDIA/machine-learning-
being rising at an elevated speed as we are heading programming-languages-why-is-the-best-and-why-
forward and becoming stronger technologically.As ML 56f9f370cb99
algorithm is quite difficult to write from scratch, Machine [16] https://www.analyticsindiamag.com/machine-learning-
learning toolkit provides a tremendous amount of framework-10-need-know/
[17] https://searchenterpriseai.techtarget.com/feature/How-to-make-
resources that can be used according to the problem a-wise-machine-learning-platforms-comparison
statement to solve any challenge. In this paper, we have [18] V. Vinothina, “MACHINE LEARNING TOOLS-AN
illustrated many tools that can be used for applying ML OVERVIEW,” in International Conference on Recent Trends in
techniques. The best toolkit is selected on the basis of Engineering Science, Humanities and Management, 2017, pp.
skills, background and use-case of a researcher. Also, the 629–637.
[19] https://www.oreilly.com/ideas/square-off-machine-learning-
type of project and available resources play an important
libraries
role in the selection of a tool. Therefore, when a project is [20] https://machinelearningmastery.com/tour-weka-machine-
started, it is required to spend a certain amount of time to learning-workbench/
assess existing toolkits so as to be confident enough that [21] https://bookdown.org/rdpeng/rprogdatascience/history-and-
the chosen toolkit is best for the situation. overview-of-r.html
[22] https://en.wikipedia.org/wiki/SciPy
REFERENCES [23] https://github.com/scikit-learn/scikit-learn
[24] https://github.com/EdwardRaff/JSAT
[1] https://www.simplilearn.com/what-is-machine-learning- [25] http://accord-framework.net/intro.html
and-why-it-matters-article [26] Pylearn2 Documentation Release dev, LISA lab, University of
[2] https://dzone.com/articles/5-open-source-machine-learning- Montreal, 2015.
frameworks-and-tool [27] https://www.csie.ntu.edu.tw/~cjlin/libsvm/
[3] https://www.forbes.com/sites/ciocentral/2018/02/28/gartner- [28] Thomas A. Henzinger, Anmol V. Singh, Vasu Singh, Thomas
magic-quadrant-whos-winning-in-the-data-machine-learning- Wies,DamienZufferey, “Static Scheduling in clouds”
space/ [29] Mike Gashler, “Waffles: A Machine Learning Toolkit”, Journal
[4] J. V. N. Lakshmi and A. Sheshasaayee, “A Big Data Analytical of Machine Learning Research, 12 (2011), 2383-2387.
Approach for Analyzing Temperature Dataset using Machine [30] G.Holmes, A.Donkin, I.H Witten, “WEKA: a machine learning
Learning Techniques,” Int. J. Sci. Res. Comput. Sci. Eng., vol. workbench”, Proceedings of Second Australian and New
5, no. 3, pp. 92–97, 2017. Zealand conferences on Intelligent Information System, 1994.
[5] C. E. Sapp, “Preparing and Architecting for Machine Learning,” [31] https://www.predictiveanalyticstoday.com/knime/
2017. [32] https://rapidminer.com/products/studio/feature-list/
[6] https://www.gartner.com/it-glossary/big-data/ [33] https://orange.biolab.si/#Orange-Features
[7] https://www.quora.com/How-are-big-data-and-machine- [34] http://126kr.com/article/yucgkiovd
learning-related [35] S¨orenSonnenburg et.al, “The SHOGUN Machine Learning
[8] Rakesh. S.Shirsath, VaibhavA.Desale, Amol. D.Potgantwar, Toolbox”, Journal of Machine Learning Research 11 (2010) ,
"Big Data Analytical Architecture for Real-Time Applications", 1799-1802.
International Journal of Scientific Research in Network Security [36] https://mahout.apache.org/docs/latest/index.html
and Communication, Vol.5, Issue.4, pp.1-8, 2017 [37] http://cloudacademy.com/blog/aws-machine-learning/
[9]https://www.forbes.com/sites/ciocentral/2018/02/28/gartner [38] https://www.microsoft.com/en-us/research/blog/microsoft-open-
-magic-quadrant-whos-winning-in-the-data-machine- sources-distributed-machine-learning-toolkit-for-more-efficient-
learning-space/#3995d9407dab big-data-research/
[10] https://www.sas.com/en_us/insights/analytics/machine- [39] https://www.predictiveanalyticstoday.com/microsoft-azure-
learning.html machine-learning/
[11] https://towardsdatascience.com/gui-fying-the-machine-learning- [40] https://spark.apache.org/mllib/
workflow-towards-rapid-discovery-of-viable-pipelines-
cab2552c909f
[12] https://machinelearningmastery.com/machine-learning-tools/

© 2018, IJCSE All Rights Reserved 575

View publication stats

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy