DSTK - Data Science ToolKits


DSTK - Data Science Toolkit 3 is a set of data and text mining softwares, following the CRISP DM model. DSTK offers data understanding using statistical and text analysis, data preparation using normalization and text processing, modeling and evaluation for machine learning and statistical learning algorithms. It is based on the old version DSTK at https://sourceforge.net/projects/dstk2/ ChartPlotter is a New Addition to the DSTK softwares, and it allows you to build interactive Plotly JS charts and dashboards in minutes, using only mouse clicks. DSTK Studio allows you to build recommendation and prediction data products.

DSTK - Data Science Toolkit 3 is

DSTK 3 consists of DSTK Engine, DSTK ScriptWriter, DSTK Studio, DSTK Text Explorer, and DSTK ChartPlotter. DSTK Engine is R simplified, focusing on Data Mining. DSTK ScriptWriter offers GUI to write script for DSTK Engine. DSTK Studio offers SPSS Statistics like GUI for data mining, DSTK Text Explorer offers GUI for Text Mining, and DSTK Chart Plotter offers GUI for data visualizations.

DSTK Engine and DSTK ScriptWriter are free of charge and have been uploaded to Sourceforge.net They are under GNU GPL License. For commercial license, please Contact Us.



DSTK Studio, Text Explorer, and Chart Plotter, however, requires a small fee of $59 usd to help support us. A demo version of DSTK Studio and DSTK Text Explorer is included in DSTK 3 package, but you can only use them 10 times.



DSTK 3 is written in C# and Java. You need Microsoft .Net framework and Java runtime to run the softwares. DSTK descriptives statistics, inferential statistics, and regressions has been checked and validated with R. Machine learning will be tested in future and DSTK is in beta version, hence, may have some bugs and do not have warranty.



License: DSTK Engine uses WordNet, MIT JWI, GATE's Gazetteers, Stanford NLP POS Tagger, Stanford NLP Classifier, Harvard University Inquirer Sentiments Data, Porter Stemmer C# library, RPortable, Math.Net Numerics, and etc. Each have their own licenses and are included in DSTK Engine distribution. DSTK ChartPlotter is an interface to Plotly JS. DSTK Studio, DSTK Text Explorer, DSTK ScriptWriter are Standalone softwares providing easy to use GUI to write script for DSTK Engine. DSTK 3 has no warranty, but we will take feedbacks.





Download DSTK Data Science ToolKit at:

Download Data Science TooKit 3

Buy Me a Coffee




Download DSTK ChartPlotter at:

Download




Purchase DSTK Studio, DSTK Text Explorer, DSTK ChartPlotter:

USD $59

Save $30.99 Was $99


FEATURES



Data Understanding using Statistical Analysis

1. Descriptives (mean, median, variance, standard deviations, ...)
2. Inferential (T-Test, Chi Square ...)
3. Regression (Simple Linear)
... And interface with R and Python, ...

Data Understanding using Data Visualizations

1. Histogram
2. Scatter Plots
3. Box Plots
4. Plotly Interactive Charts and Dashboards with ChartPlotter
... And more...

Data Preparation

1. Log Transform
2. Feature Scaling
3. Standard Score
4. Remove Missing Values
... And more...

Modeling and Evaluation

1. Neural Network (in future, Deep Neural Network)
2. Naive Bayes
3. KNN
4. Linear Regression
5. Bags of Words

Text Mining and Analysis

1. Text Preprocessing (stopwords, porter stemmer, regular expressions, ...)
2. POS Tagging, Name Entity, Word Net
3. Sentiment Analysis
4. Text Link Analysis
5. Text Classification (Naive Bayes, NN, ...)
... And more with gazetteers from GATE...

Plugins

1. Expand features with R Scripts...
2. Included plugins for Big Data Analysis using Microsoft Azure...





Download DSTK Data Science ToolKit at:

Download Data Science TooKit 3

Buy Me a Coffee




Download DSTK ChartPlotter at:

Download




Purchase DSTK Studio, DSTK Text Explorer, DSTK ChartPlotter:

USD $59

Save $30.99 Was $99


Screenshots






Free DSTK Books and Courses with Certifications


DSTK 3 Book


We have develop our own Data and Text Mining software at DSTK.Tech. This technical book aim to equip the reader with Data and Text Mining fundamentals in a fast and practical way using our DSTK - Data Science ToolKit 3 software. There will be many examples and explanations that are straight to the point.

Contents
1. Introduction
2. Getting Started
3. DSTK ScripWriter Essentials
4. DSTK Studio Essentials
5. DSTK Text Explorer Essentials
6. Conclusion

Now Free.

Get Now for Free »



Introduction to Data and Text Mining with DSTK 3 Course


Have you ever wanted to learn data and text mining? Data Science is a very hot trend now. This FREE course will equip you with the fundamentals of data and text mining knowledge, with the use of our own DSTK - Data Science Toolkit 3.

View Course »

About Us


DSTK Tech is part of SVBook. Our main goal is to create useful data science technology for practitioners in both academia and business to reach fast conclusions for data science and analysis before going into deeper tools like SPSS Statistics. DSTK was designed with the user in mind, using SPSS and Excel like interface to reduce the learning curves. DSTK Engine and DSTK ScriptWriter are free of charge and have been uploaded to Sourceforge.net. DSTK Studio and DSTK Text Explorer require a small fee of 59 usd to support us.

Sponsors Us


$5 a month

Support me with a monthly donation and let me continue my activities. It means a lot to me.


GitHub »

$10 a month

Become a Diamond Sponsor with a monthly donation of $10 and get your name on readme.md on GitHub.


GitHub »

$20 a month

Become a Ruby Sponsor with a monthly donation of $20 and get your name on readme.md on GitHub and http://DSTK.Tech.


GitHub »

$50 a month

Become a Sapphire Sponsor with a monthly donation of $20 and get your name on readme.md on GitHub and logo on http://DSTK.Tech.


GitHub »

$100 a month

Become an Emerald Sponsor with a monthly donation of $100 and get your name on readme.md on GitHub and logo on http://DSTK.Tech and 28 courses FREE on http://EMHAcademy.com


GitHub »

Sponsors


Use In


Download at these places


About The Founder...






Mr. Eric M. H. Goh
Eric Goh is a data scientist, software engineer, adjunct faculty and entrepreneur with years of experiences in multiple industries. His varied career includes data science, data and text mining, natural language processing, machine learning, intelligent system development, and engineering product design. He founded SVBook sole proprietorship in 2016 and reregister to SVBook Pte. Ltd. in 2018, because of contract requirements, and extended it with DSTK.Tech and EMHAcademy.com. DSTK.Tech is where Eric develops his own DSTK data science softwares (public version) and uploaded at sourceforge and github. Eric also published “Learn R for Applied Statistics” at Apress, and published some books at LeanPub and SVBook Pte. Ltd. He teaches the content at Udemy and EMHAcademy.com, and developed 28 courses, 1 E-Diploma, 7 advanced certificates. Eric is also an adjunct faculty at Universities and Institutions, which is a consultancy from EMHAcademy.com.

Eric Goh has been leading his teams for various industrial projects, including the advanced product code classification system project which automates Singapore Custom’s trade facilitation process, and Nanyang Technological University's data science and ranking projects where he develop his own DSTK data science software (NTU version) for QS ranking project, JATI for QS mobile apps screenshots to text project and JAVT for Convocation videos to text project. Eric wrote some guides to use these softwares for NTU projects - DSTK, JAVT, JATI . While in NTU, from 2015 to 2018, NTU ranking did not fall even when NUS ranking fall - First Year, Second Year, Third Year. In second year, NTU data increases, hence, research and development of DSTK (NTU Version) software, and management reduced the data for third year. DSTK (public version) is at DSTK.Tech, DSTK is developed in 2017 and is similar to the QuillEdit software developed in 2006. He has years of experience in C#, Java, C/C++, SPSS Statistics and Modeller, SAS Enterprise Miner, R, Python, Excel, Excel VBA and etc. He won Tan Kah Kee Young Inventors' Merit Award and Shortlisted Entry for TelR Data Mining Challenge.

Eric holds a Masters of Technology degree in Knowledge Engineering (Machine Learning) from the National University of Singapore (NUS) (download opencert file and put here. What is OpenCert.... ) in 2013. Eric also possessed Executive Master of Business Administration (MBA) degree (click here) from IGNOU (http://ignou.ac.in) and Executive Certificate in Global IT Management from U21Global (currently GlobalNxt) in 2012, where he's delayed and left exams while studying in NUS. He has a Graduate Diploma in Mechatronics from A*STAR SIMTech (a national research institute located in Nanyang Technological University) in 2011, which he completed while working in SUTD. He has Coursera Specialization Certificate in Business Statistics and Analysis (Excel) from Rice University in 2017, IBM Data Science Professional Certificate (Python, SQL) in 2018, and Coursera Verified Certificate in R Programming from Johns Hopkins University in 2017, More Data Mining with Weka Certificate from University of Waikato, Coursera Verified Certificate in Data Visualization and Communication with Tableau from Duke University in 2016, Coursera Verified Certificate in Internet of Things and Augmented Reality Emerging Technologies from Yonsei University in 2017. Eric continuous upgrade himself using Coursera courses after 2013, which are University courses. He possessed a Bachelor of Science degree in Computing from the University of Portsmouth in 2010, which he completed while in National Service (2008 to 2010). He holds a Diploma with Merit in Electronics and Computer Engineering from Ngee Ann Polytechnic in 2008. He is also an AIIM Certified Business Process Management Master (BPMM) in 2011, GSTF certified Big Data Science Analyst (CBDSA), IES Certified Lecturer in 2016, and holds the Coursera Verified Certificate in University Teaching from The University of Hong Kong in 2019, and University of Wisconsin-Madison Fundamentals of Online Teaching certificate in 2016.

During free time, Eric holds a Career Diploma in Graphic Design with Honors from Ashworth College in 2017, and has used Solidworks, Inventor in Singapore University of Technology and Design. He knows Blender. He has a National Survival Swimming Bronze Award Certificate in 1997 and is PADI Advanced SCUBA Diver in 2016. He learnt cooking since young and has Udemy Certificate in "China Table: Learn to Cook Traditional Chinese Cuisine" in 2016. He has the "Stanford Introduction to Food and Health" Certificate in 2021 and the Alison "Growing Organic Food Sustainably" certificate in 2021. Eric start AsianEasternRecipes.Club in 2019. Asian Eastern Recipes Club advertise hawkers, restaurants. When you first reached Singapore Airport, you can find Singapore food at Changi Airport. You can go to Bedok and find Singapore food at Bedok Hawker Center. You can then go to the city and tour and find Singapore food. When you go back to your country, you can cook Singapore food using Asian Eastern Recipes Club recipes. Evidence 1 Evidence 2

World Education Services, WES Evaluation converts educational credentials from any country in the world into US equivalent. WES stated official transcript from University is required for evaluation. Required Documents for MBA . WES Official Badge (See Evidence). How significance is WES report? WES report is used for migration to Canada (evidence). From Polytechnic to University, Eric also do not have all As, straight As, or perfect grades. This is normal.

UK NARIC Evaluation converts educational credentials from any country in the world into UK equivalent. UK NARIC Report

More...


Request Information






Contact

Question?

Singapore