JSON Lines

On The Web


Airbyte is an open-source data integration tool that uses JSON Lines to communicate between containerized source applications that pull data from files/APIs/databses and containerized destination applications that write data to warehouses.

Apache Spark uses JSONL for reading and writing JSON data.

ArangoDB is an open source multi-model database. The JSON lines format allows to import huge amounts of documents sequentially (via arangoimport).

BigQuery uses JSON Lines as one of the supported formats to load data into the database.

BKL is a layered templating configuration tool that supports JSON Lines input and output.

Bubbles supports JSON Lines datastores

ClickHouse is an open source column-oriented DBMS. It supports JSON lines as JSONEachRow format for input and output.

CSS HTML Validator for Windows v22.0211+ now supports JSON Lines syntax checking.

Dart uses JSON Lines as one of the possible reporters when running tests.

Dataflow kit is a web scraping open source framework written in Go. JSON Lines is one of the supported formats for storing results.

Go Standard library's json.Encoder will produce JSON lines by default. The decoder parses Concatenated JSON, which is compatible with, though less strict than, JSON lines

Golang JSONL library

Graylog GELF is format for log messages, their stream is de-facto JSON lines.

Kubernetes (k8s) is an open-source container orchestration system. It uses JSON Lines as format for auditing.

Logstash supports JSON Lines via the json_lines codec

Mattermost is an open-source, self-hostable online chat service. It uses JSON Lines as the format for bulk data migration on self-hosted instances.

Miller supports JSON Lines format as input.

NDJSON is a similar format that also allows blank lines

Neo4j the open-source graph database supports JSONL export and import via its standard library procedures apoc.export/import.json to allow stream processing of nodes and relationships.

petl is a general purpose Python package for extracting, transforming and loading tables of data. It allows importing and exporting documents/records between many databases and file formats, including JSON lines, in local and remote filesystems and clouds.

php-jsonl is a PHP library for reading & writing JSON Lines documents, taking advantage of the streaming benefits.

plot.ly uses JSON Lines for its streaming data API

pytest-reportlog is a pytest plugin which writes testing report data in JSON lines format

Rumble is a JSONiq engine that runs on top of Spark. It can process datasets in the JSON lines format that have billions of objects and more.

Scrapy is a framework for web scraping & crawling, it supports and recommends JSON lines since long -- it might've even coined the term.

serde-jsonlines is a Rust library for reading & writing JSON Lines documents.

Shopify GraphQL Bulk Operations API, designed for very large data exports from Shopify stores, returns results in the form of a JSONL file.