2018-225

2018-225

Using Data Visualization to Inform Machine Learning Approaches

ERIC N. ZIELONKA, and ALEKSANDR W. FRITZ

Machine learning with big data is a complicated task to tackle. Using data visualizations, one can find trends, anomalies, and patterns to help select the appropriate approach to the problem in machine learning. Using 2D visualizations, we’ve displayed flight data on interactive maps, visualizing density and property changes in an area. We’ve also used frequency histograms to view the quantitative properties of each point to look for trends. Using scatterplots, anomalies in data collection were found. Other plots confirmed previously found trends and initial thoughts about the data. These visualizations helped inform a machine learning approach to our problem and avoided major pitfalls further down the road.