What is serialization and how does it work?

Glossary Terms›Serialization

Serialization is the process of converting a data object—a combination of code and data represented within a region of data storage—into a series of bytes that saves the state of the object in an easily transmittable form. In this serialized form, the data can be delivered to another data store (such as an in-memory computing platform), application, or some other destination.

Serialization Diagram — Data serialization is the process of converting an object into a stream of bytes to more easily save or transmit it.

The reverse process—constructing a data structure or object from a series of bytes—is deserialization. The deserialization process recreates the object, thus making the data easier to read and modify as a native structure in a programming language.

Serialization-Deserialization Diagram — Serialization and deserialization work together to transform/recreate data objects to/from a portable format.

Serialization enables us to save the state of an object and recreate the object in a new location. Serialization encompasses both the storage of the object and exchange of data. Since objects are composed of several components, saving or delivering all the parts typically requires significant coding effort, so serialization is a standard way to capture the object into a sharable format.

With serialization, we can transfer objects:

Over the wire for messaging use cases
From application to application via web services such as REST APIs
Through firewalls (as JSON or XML strings)
Across domains
To other data stores
To identify changes in data over time
While honoring security and user-specific details across applications

A number of popular object-oriented programming languages provide either native support for serialization or have libraries that add non-native capabilities for serialization to their feature set. Java, .NET, C++, Node.js, Python, and Go, for example, all either have native serialization support or integrate with serializer libraries.

Data formats such as JSON and XML are often used as the format for storing serialized data. Customer binary formats are also used, which tend to be more space-efficient due to less markup/tagging in the serialization.

Big data systems often include technologies/data that are described as “schemaless.” This means that the managed data in these systems are not structured in a strict format, as defined by a schema. Serialization provides several benefits in this type of environment:

Structure. By inserting some schema or criteria for a data structure through serialization on read, we can avoid reading data that misses mandatory fields, is incorrectly classified, or lacks some other quality control requirement.
Portability. Big data comes from a variety of systems and may be written in a variety of languages. Serialization can provide the necessary uniformity to transfer such data to other enterprise systems or applications.
Versioning. Big data is constantly changing. Serialization allows us to apply version numbers to objects for lifecycle management.

Keep Reading

Video

Hazelcast – Using In-Memory Key-Value Stores

Join Rafal Leszko, Software Engineer at Hazelcast, as he discusses caching and other valid use cases for in-memory key-value stores.

White Paper

Simplifying Production Deployments with Hazelcast Enterprise Features

Hazelcast Enterprise features help to simplify the DevOps function for companies that need secure, always-on, low-latency, in-memory processing features. Understanding…

Case Study

Microservices with Hazelcast at a Global Pizza Delivery Chain

This global pizza delivery chain operates in 82 countries, making it the second-largest franchised pizza chain in the world. Today, it operates more than 12,600 pizza restaurants around the world and delivers more than 1 million pizzas each day.

Why Hazelcast?

Forrester names Hazelcast as a Strong Performer

Platform

Introducing Hazelcast Platform 5.4

Solutions

By Industry

By Use Case

By Architecture

Join us for a deep dive into Hazelcast Platform's capabilities

Resource Center

Learn

The Gartner®️ Market Guide for Event Stream Processing

Developers

Community

Learn

Toolbox

See Hazelcast in Action

Sign up for a personalized demo.

What is Serialization?

Why Is Data Serialization Important for Distributed Systems?

What Are Common Languages for Data Serialization?

What Is Data Serialization in Big Data?

Related Topics

Keep Reading

Hazelcast – Using In-Memory Key-Value Stores

Simplifying Production Deployments with Hazelcast Enterprise Features

Microservices with Hazelcast at a Global Pizza Delivery Chain

Why Hazelcast

About Us

Platform

Solutions

Developers

Learn

Connect