Member-only story

Top 5 Structured Data Formats for Data Science

Mori
9 min readJul 25, 2023

--

Photo by Alexander Sinn on Unsplash

Table of contents

Introduction

Welcome, data aficionados and fellow Pythonistas, to another blog post! Today, we embark on a thrilling quest to unravel the world of file formats in the realm of data science. We will explore the top 5 file formats that are essential for every data sorcerer out there!

Before we dive headlong into the world of file formats, let’s take a moment to appreciate the importance of choosing the right format. Imagine you’re venturing through a dark forest (not the internet kind, mind you) in search of hidden treasures. The path you choose can either lead you to a pot of gold or a barrel of pickle juice. Likewise, selecting the right file format can be the difference between smooth data sailing and a world of frustration.

So, without further ado, let’s unveil the magnificent five file formats that will undoubtedly make your data science adventures extraordinary:

  • CSV (Comma-Separated Values)
  • JSON (JavaScript Object Notation)
  • Parquet
  • Feather

--

--

Mori
Mori

Written by Mori

Date Scientist/Machine Learning Engineer | Passionate about solving real-world problems | PhD in Computer Science

No responses yet