split json into multiple json c

How to convert one big JSON into multiple smaller JSON files?

community.make.com › t › how-to-convert-one-big-json-into-multiple-smaller-json-files › 17779

Welcome to the Make community! [image] samliew: You can map the messages array in the Iterator, then use an Array Aggregator (not Text Aggregator) to select the fields you want. Like I said earlier, this can be done this way. You’ll need a minimum of three modules: [Screenshot_2023-12-20_1… Answer from samliew on community.make.com

Make Community

community.make.com › questions

How to convert one big JSON into multiple smaller JSON files? - Questions - Make Community

Top answer

1 of 1

Merge-json-files

merge-json-files.com › json-file-splitter

Split JSON Files Instantly | Free Online JSON Splitter Tool

Use our online tool to split large JSON files into parts easily. Free, safe, and fast — no login or install needed.

Discussions

Is there any sample code to split a json file into smaller chunks?

For example… I got a json file that looks kind of like this: [ { “id:”: 1, “question:”: "What are your hours of operations? ", “answer:”: "We open at 10am and close at 8pm. " }, { “id:”: 2, “question:”: “Do you offer vegetarian food?”, “answer:”: “Yes, we have ... More on community.openai.com

community.openai.com

October 17, 2023

text processing - Split a JSON array into multiple files - Unix & Linux Stack Exchange

This takes each top-level array element in your JSON document and creates a shell here-document redirection for each. Each redirection into cat is quoted to avoid that the shell expands things in the document. More on unix.stackexchange.com

unix.stackexchange.com

python - Split a large json file into multiple smaller files - Stack Overflow

27 Using jq how can I split a very large JSON file into multiple files, each a specific quantity of objects? More on stackoverflow.com

stackoverflow.com

posix - Split JSON array into separate files/objects - Stack Overflow

The question then becomes what ... with the -c option, and filter the result into awk, which can then allocate the components to different files. See e.g. Split JSON File Objects Into Multiple Files... More on stackoverflow.com

stackoverflow.com

Top answer

1 of 3

From this SO thread:

jq -cr 'keys[] as $k | "\($k)\n\(.[$k])"' input.json | while read -r key; do
  fname=$(jq --raw-output ".[$key].billingAccountNumber" input.json)
  read -r item
  printf '%s\n' "$item" > "./$fname"
done

2 of 3

Give a try with this code to save each element with names 0.json, 1.json and 2.json. FYI, it can work for any number of JSON items in an array.

for i in (jq '.|length' sample.json)); do \
    j=$( expr $i - 1 ); \
    jq ".[$j]" sample.json > $j.json;  \
done

Explanation:

below line finds the length of array to name the objects:

(jq '.|length' sample.json))

Since the array starts with 0, lets fix the output file name

j=$( expr $i - 1 );

below lines fetches one element from json document and save to file

jq ".[$j]" sample.json > $j.json;

to save literally to x.json, y.json and z.json:

for i in x y z; do \
  jq ".[$count]" sample.json > $i.json;  \
  count=$( expr $count + 1 ) \
done

Stack Overflow

stackoverflow.com › questions › 43074147 › split-a-large-json-file-into-multiple-smaller-files

python - Split a large json file into multiple smaller files - Stack Overflow

Top answer

1 of 7

Use this code in linux command prompt

split -b 53750k <your-file>
cat xa* > <your-file>

Refer to this link: https://askubuntu.com/questions/28847/text-editor-to-edit-large-4-3-gb-plain-text-file

2 of 7

Answering the question whether Python or Node will be better for the task would be an opinion and we are not allowed to voice our opinions on Stack Overflow. You have to decide yourself what you have more experience in and what you want to work with - Python or Node.

If you go with Node, there are some modules that can help you with that task, that do streaming JSON parsing. E.g. those modules:

https://www.npmjs.com/package/JSONStream
https://www.npmjs.com/package/stream-json
https://www.npmjs.com/package/json-stream

If you go with Python, there are streaming JSON parsers here as well:

https://github.com/kashifrazzaqui/json-streamer
https://github.com/danielyule/naya
http://www.enricozini.org/blog/2011/tips/python-stream-json/

Splunk Community

community.splunk.com › t5 › Getting-Data-In › how-to-split-the-json-array-into-multiple-new-events › m-p › 122265

how to split the json array into multiple new even... - Splunk Community

February 11, 2021 - Does anyone know how to turn a single JSON event with an array of N sub-items into N events, each with one sub-item? ... examples you want....so you probably did not read the docs then there are examples on how to use spath on XML and JSON -> http://docs.splunk.com/Documentation/Splunk/6.0.1/SearchReference/Spath#Examples ... Note, you might have to use spath to get multi-value fields, then mvexpand to get events from each distinct set. ... Get Updates on the Splunk Community!

Find elsewhere

Google Bing Mojeek

Stack Overflow

stackoverflow.com › questions › 48790861 › split-json-array-into-separate-files-objects

posix - Split JSON array into separate files/objects - Stack Overflow

Top answer

1 of 6

To split a json with many records into chunks of a desired size I simply use:

jq -c '.[0:1000]' mybig.json

which works like python slicing.

See the docs here: https://stedolan.github.io/jq/manual/

Array/String Slice: .[10:15]

The .[10:15] syntax can be used to return a subarray of an array or substring of a string. The array returned by .[10:15] will be of length 5, containing the elements from index 10 (inclusive) to index 15 (exclusive). Either index may be negative (in which case it counts backwards from the end of the array), or omitted (in which case it refers to the start or end of the array).

2 of 6

Using jq, one can split an array into its components using the filter:

.[]

The question then becomes what is to be done with each component. If you want to direct each component to a separate file, you could (for example) use jq with the -c option, and filter the result into awk, which can then allocate the components to different files. See e.g. Split JSON File Objects Into Multiple Files

Performance considerations

One might think that the overhead of calling jq+awk would be high compared to calling python, but both jq and awk are lightweight compared to python+json, as suggested by these timings (using Python 2.7.10):

time (jq -c  .[] input.json | awk '{print > "doc00" NR ".json";}')
user    0m0.005s
sys     0m0.008s

time python split.py
user    0m0.016s
sys     0m0.046s

Cloudera Community

community.cloudera.com › t5 › Support-Questions › How-to-split-large-json-file-into-multiple-json-files-in › m-p › 364136

Solved: How to split large json file into multiple json fi... - Cloudera Community - 364136

March 2, 2023 - We have a large json file which is more than 100GB and we want to split this json file into multiple files. We used Split Text processor to split this json file into mutliple files by specifying Line Split Count.

GitHub

github.com › jhsu98 › json-splitter

GitHub - jhsu98/json-splitter · GitHub

A simple command line tool for splitting large JSON files into smaller files.

Starred by 54 users

Forked by 20 users

Languages Python

Quora

quora.com › How-do-I-split-a-single-JSON-object-array-into-two-JSON-arrays

How to split a single JSON object array into two JSON arrays - Quora

Answer (1 of 4): you can split it by using a index of a json array. var lists = [ {name: “sai”, age:”21”}, {name: “siva”, age:”22”} ]; var list1 = lists[0]; var list2 = lists[1];

Make Community

community.make.com › questions

Splitting JSON single bundle into several - Questions - Make Community

Top answer

1 of 1

You will need to use a text parser to convert the JSON string into array, before you Parse JSON. Pattern: \", \" New Value: "}, {" Text: [{{ 3.data }}] Global Match: YES [Screenshot_2023-10-13_161058] Output, individual bundles: [Screenshot_2023-10-13_161039]

GitHub

gist.github.com › abraham › 46875b4b1c4b855cb3d3cafe540a22b2

Split a large JSON file into multiple files · GitHub

Save abraham/46875b4b1c4b855cb3d3cafe540a22b2 to your computer and use it in GitHub Desktop. Download ZIP · Split a large JSON file into multiple files · Raw · example.json · This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below.

MuleSoft

help.mulesoft.com › s › question › 0D52T00005TZta7SAD › split-the-json-message-into-multiple-json-messages

Split the JSON Message into Multiple ...

Loading · ×Sorry to interrupt · Refresh

reddit.com › r/learnpython › split 150gb json file with python?

r/learnpython on Reddit: Split 150GB json file with Python?

March 3, 2024 -

I haven't had much to do with json files so far, but now I need them for datahoarding. Now I have a 150GB json file here and can't open it because there's not enough free HDD & RAM on my laptop and computers.

So I have to split the file into several pieces (prefer 1 GB files) and open and view them one after the other. How can I do this on Windows?

Google spits out ancient results (and mostly for Linux) which, as usual, have contradictory information.

Maybe it is possible with a Python script. I don't care if the last and first line of the new file look slightly different than in the original json file

Top answer

1 of 29

I have questions... What do you want to achieve by "viewing" a 1GB part of a JSON file? what's the end-goal? How did you end up with 150GB file to begin with? Technically it's possible to split a file using python, the question remains what exactly do you hope to understand from these "splits"...

2 of 29

The problem with JSON is that it can have an arbitrary nested structure, so you can't create "splits" without knowing the overall structure first, which again either requires first parsing the entire thing (Catch-22) or having a priori knowledge of the structure which can be used. I assume your data in the file is quite regular, so to your question if it's possible with a Python script: probably yes by writing something taking advantage of this known regular structure. But you didn't provide enough information to guide you further.

DEV Community

dev.to › imaduddin_101 › how-to-split-json-files-fastest-and-easiest-way-e92

How to split JSON files - Fastest and Easiest way - DEV Community

April 11, 2025 - Upload your JSON file – Drag and drop or select a JSON file. Set your splitting preferences – Choose the number of parts or size per file. Click "Split JSON" – Our tool processes it instantly.

Withdata Software

withdata.com › blog › datafilesplitter › split-json-into-multiple-files.html

Split large JSON file into multiple files

Choose source JSON file -> Set splitting JSON options -> Split JSON

Stack Overflow

stackoverflow.com › questions › 46534569 › how-to-split-json-into-multiple-files-per-document

How to split json into multiple files per document - Stack Overflow

Top answer

1 of 3

Here is a Python solution to your problem.

Don't forget to change the in_file_path to the location of your big JSON file.

import json

in_file_path='path/to/file.json' # Change me!

with open(in_file_path,'r') as in_json_file:

    # Read the file and convert it to a dictionary
    json_obj_list = json.load(in_json_file)

    for json_obj in json_obj_list:
        filename=json_obj['_id']+'.json'

        with open(filename, 'w') as out_json_file:
            # Save each obj to their respective filepath
            # with pretty formatting thanks to `indent=4`
            json.dump(json_obj, out_json_file, indent=4)

Side Note: I ran this in Python3, it should work in Python2 as well

2 of 3

I ran into this problem today as well, and did some research. Just want to share the resulting Python snippet that lets you also customise the length of split files (thanks to this slicing method).

import os
import json
from itertools import islice

def split_json(
    data_path,
    file_name,
    size_split=1000,
):
    """Split a big JSON file into chunks.
    data_path : str, "data_folder"
    file_name : str, "data_file" (exclude ".json")
    """
    with open(os.path.join(data_path, file_name + ".json"), "r") as f:
        whole_file = json.load(f)

    split = len(whole_file) # size_split

    for i in range(split + 1):
        with open(os.path.join(data_path, file_name + "_"+ str(split+1) + "_" + str(i+1) + ".json"), 'w') as f:
            json.dump(dict(islice(whole_file.items(), i*size_split, (i+1)*size_split)), f)
    return

Update: Then, when you need to combine them together again, use the following code:

json_all = dict()
split = 4         # this is the 1-based actual number of splits

for i in range(1, split+1):
    with open(os.path.join("data_folder", "data_file_" + str(split) + "_" + str(i) + ".json"), 'r') as f:
        json_i = json.load(f)
        json_all.update(json_i)

reddit.com › r/bash › splitting a huge json file into several smaller files

r/bash on Reddit: Splitting a huge JSON file into several smaller files

May 27, 2022 -

Hi!

I have a huge JSON file containing company data that I want to split into several smaller files based on their companyId. The JSON file looks like this:

[
    {
        "companyId": "123456789",
        "name": "Foobar Ltd.",
        // more company data
    },

    // etc.
]

Ideally, I want to split this based on the X first characters of companyId, so that I end up with companies that share the first part of their companyId in separate smaller files;

companyId 123456789 => 1234.json
companyId 234567890 => 2345.json
// etc

I could write a Perl script to do this for me, but I was wondering if it's at all possible to do it ~~with a one-liner~~ without too much "outside of bash", if that makes sense, at least without having to rely on Perl, Python etc. The only progress I have made so far is this:

cat huge.json | jq '.[]' | jq '.companyId'

...which outputs the companyId, and I could probably get the X first characters from that, but where is the rest of the JSON record?

Thanks in advance!

EDIT: Specified that I don't want to use Perl (or similar tools), because I want to do this as "minimal" as possible.

Top answer

1 of 4

You probably want to look at the --compact-output and --stream flags to jq Parse your big input file with jq, pipe it into a while loop, read the input one line at a time. You can then search each line for the relevant company id and dump the line into the right file as needed. You will probably need some 'glue' around the individual lines to make each individual split json file be valid json afterwards. It might be better to create one json file per companyID at first then merge them all together in a second pass.

2 of 4

$ cat company.json [ { "companyId": "123456789", "name": "Foobar1 Ltd." }, { "companyId": "765456788", "name": "BarFoo2 Ltd." }, { "companyId": "123456788", "name": "Barfoo1 Ltd." }, { "companyId": "765456789", "name": "Foobar2 Ltd." } ] You can use group_by() $ jq -c 'group_by(.companyId[:4])[]' company.json [{"companyId":"123456789","name":"Foobar1 Ltd."},{"companyId":"123456788","name":"Barfoo1 Ltd."}] [{"companyId":"765456788","name":"BarFoo2 Ltd."},{"companyId":"765456789","name":"Foobar2 Ltd."}] As for splitting it into separate files - you probably have to call jq multiple times: $ jq -c 'group_by(.companyId[:4])[]' company.json | while read -r line do filename=$(jq -r '.[0].companyId[:4]' <<< "$line").json declare -p filename jq . <<< "$line" done Output: declare -- filename="1234.json" [ { "companyId": "123456789", "name": "Foobar1 Ltd." }, { "companyId": "123456788", "name": "Barfoo1 Ltd." } ] declare -- filename="7654.json" [ { "companyId": "765456788", "name": "BarFoo2 Ltd." }, { "companyId": "765456789", "name": "Foobar2 Ltd." } ]

Stack Overflow

stackoverflow.com › questions › 42978661 › how-to-split-a-json-object-in-two-and-append-each-separately

jquery - How to split a JSON object in two and append each separately - Stack Overflow

@DarianSteyn you can then define var c = 0; and use that as your count, to check in the condition. Or was posted, use a for loop. ... You were pretty much on the right track - although having this logic inside an each of all the data.rates seems a bit weird. Below is the logic to split an associative array into 2 parts getting the keys & values