Data Visualization C++

Data Preprocessing And Visualization In C++

A working code example on how to implement basic functionalities of Machine learning using C++

Data preprocessing is the process of converting raw Data into computer understandable formats, it’s the first step in any machine learning operation. Data collection is usually loosely controlled and may result in out-of-range values. Data preparation and filtering steps can take a considerable amount of processing time.

Data preprocessing includes:

Reading Data from files.

Data cleaning.

Instance selection.

Data standardization.

Data transformation.

Feature extraction and selection.

The product of Data preprocessing is the final training set. In this article, I will address some of the Data preprocessing steps while using C++, also Data Visualization using the Matplotlib-Cpp library.

This article is part of a series that address the implementation of Machine learning algorithms in C++, throughout this series, We will be using the Iris Data set available here.

When Should You Learn Machine Learning using C++?

The 8 Books Each C++ Developer Must Read.

Visualization-in-c-source Data Preprocessing And Visualization In C++.

Machine Learning Data Manipulation Using C++.

Naive Bayes From Scratch using C++.

Linear Regression Implementation In C++.

Note that there are already libraries that can do this job easily, but the purpose of this series is to learn how to develop these algorithms from scratch. if you are interested in learning more about the ML libraries for C++ you can read this article:

In this article, I will use the iris dataset as an example of the Data that we can perform each operation on it, also note that I will be using C++11 in this tutorial.

Reading Data from Files:

After downloading the iris.Data file from here. let’s read the Data from a file with simple read file instructions and parse each type of Data in a separate vector.

std::vector<std::vector<float>> Read_Iris_Dataset(void)
	{
	std::ifstream myfile("iris.data");
	std::string line;
	std::vector<std::vector<float>> Iris_Dataset;
	std::vector<float> temp_sepal_len;
	std::vector<float> temp_sepal_wid;
	std::vector<float> temp_petal_len;
	std::vector<float> temp_petal_wid;
	std::vector<float> temp_iris_class;

	float sepal_len_f,sepal_wid_f,petal_len_f,petal_wid_f;
	float iris_class_f;

	std::string temp_string;
	int count =0;
	if (myfile.is_open())
	{
	std::cout<< "file opened successfully"<<std::endl;
	while (std::getline(myfile, line)) {
	std::replace(line.begin(), line.end(), '-', '_');
	std::replace(line.begin(), line.end(), ',', ' ');

	std::istringstream iss(line);
	count++;

	iss >> sepal_len_f>>sepal_wid_f >> petal_len_f >>petal_wid_f >> temp_string;
	temp_sepal_len.push_back(sepal_len_f);
	temp_sepal_wid.push_back(sepal_wid_f);
	temp_petal_len.push_back(petal_len_f);
	temp_petal_wid.push_back(petal_wid_f);
	if(temp_string.compare("Iris_setosa") == 0)
	{
	iris_class_f = Iris_setosa;
	}
	else if (temp_string.compare("Iris_versicolor") == 0)
	{
	iris_class_f = Iris_versicolor;
	}
	else if (temp_string.compare("Iris_virginica") == 0)
	{
	iris_class_f = Iris_virginica;
	}else
	{
	iris_class_f = Iris_unkown;
	}
	temp_iris_class.push_back(iris_class_f);
	}
	Iris_Dataset.push_back(temp_sepal_len);
	Iris_Dataset.push_back(temp_sepal_wid);
	Iris_Dataset.push_back(temp_petal_len);
	Iris_Dataset.push_back(temp_petal_wid);
	Iris_Dataset.push_back(temp_iris_class);
	}
	else
	{
	std::cout << "Unable to open file";
	}
	return Iris_Dataset;
	}

In this code, we used the ifstream to create a simple input stream from a file.

Contact UCanCode Software

To buy the source code or learn more about with:

Product Inquiry

E-mail to (sales@ucancode.net)

Or call us at: +86-28-8535-4545
Fax us at: +86-28-8535-4645
Technical support online with msn messager: ucancode@hotmail.com

Download a trial solution

Next--> Promotional personalized database document printing Solution

Data Visualization:

Data Cleaning:

Data Standardization:

Get Ready to Unleash the Power of UCanCode .NET

Data Preprocessing And Visualization In C++

A working code example on how to implement basic functionalities of Machine learning using C++

Data preprocessing includes:

Reading Data from Files:

Contact UCanCode Software

To buy the source code or learn more about with:

Ask any questions by MSN: ucancode@hotmail.com Yahoo: ucan_code@yahoo.com