multiple json objects in one file c

How to extract multiple JSON objects from one file?

stackoverflow.com › questions › 27907633 › how-to-extract-multiple-json-objects-from-one-file

Update: I wrote a solution that does not require reading the entire file in one go. It is too big for a stackoverflow answer, but can be found here jsonstream.

You can use json.JSONDecoder.raw_decode to decode arbitarily big strings of "stacked" JSON (so long as they can fit in memory). raw_decode stops once it has a valid object and returns the last position where was not part of the parsed object. It is poorly documented [1] (see footer), but you can pass this position back to raw_decode and it start parsing again from that position. Unfortunately, the Python json module doesn ot accept strings that have prefixing whitespace. So we need to search to find the first non-whitespace part of your document.

from json import JSONDecoder, JSONDecodeError
import re

NOT_WHITESPACE = re.compile(r'\S')

def decode_stacked(document, idx=0, decoder=JSONDecoder()):
    while True:
        match = NOT_WHITESPACE.search(document, idx)
        if not match:
            return
        idx = match.start()
        
        try:
            obj, idx = decoder.raw_decode(document, idx)
        except JSONDecodeError:
            # do something sensible if there's some error
            raise
        yield obj

s = """

{"a": 1}  


   [
1
,   
2
]


"""

for obj in decode_stacked(s):
    print(obj)

prints:

{'a': 1}
[1, 2]

Note About Missing Documentation

The current signature of raw_decode() dates from 2009, when simplejson was ported into the standard library. The documentation for raw_decode() in simplejson mentions an optional idx argument that can be used to start parsing at an offset. Given that the signature of raw_decode() has not changed since 2009, I think it is fair to assume the API is fairly stable. Especially as decode() uses the idx argument of raw_decode() to ignore prefixing whitespace when parsing a string. And this is exactly what this answer is using the idx argument for too. The documentation of raw_decode() in simplejson is:

raw_decode(s[, idx=0])

Decode a JSON document from s (a str or unicode beginning with a JSON document) starting from the index idx and return a 2-tuple of the Python representation and the index in s where the document ended.

This can be used to decode a JSON document from a string that may have extraneous data at the end, or to decode a string that has a series of JSON objects.

JSONDecodeError will be raised if the given JSON document is not valid.

Answer from Dunes on Stack Overflow

Google Groups

groups.google.com › g › json-c › c › YyCp43o6RPU

getting multiple json objects from one file

November 27, 2018 - I apologize for the multiple posts. This is the code I'm experimenting with. There are 3 objects in the file. The values from only one object are being printed: #include <stdio.h> #include <json.h> #include <stdlib.h> #define BUF_SIZE 2048 enum json_tokener_error jerr; int main (int argc, char *argv[]) { FILE *fp = fopen ("units.json", "r"); if (fp == NULL) { fprintf (stderr, "Unable to open file."); exit (EXIT_FAILURE); } fseek (fp, 0, SEEK_END); int f_size = ftell (fp); rewind (fp); char buffer[BUF_SIZE]; size_t result = fread (buffer, 1, f_size, fp); printf ("%s", buffer); json_tokener *tok

Stack Overflow

stackoverflow.com › questions › 27907633 › how-to-extract-multiple-json-objects-from-one-file

python - How to extract multiple JSON objects from one file? - Stack Overflow

Top answer

1 of 6

Update: I wrote a solution that does not require reading the entire file in one go. It is too big for a stackoverflow answer, but can be found here jsonstream.

from json import JSONDecoder, JSONDecodeError
import re

NOT_WHITESPACE = re.compile(r'\S')

def decode_stacked(document, idx=0, decoder=JSONDecoder()):
    while True:
        match = NOT_WHITESPACE.search(document, idx)
        if not match:
            return
        idx = match.start()
        
        try:
            obj, idx = decoder.raw_decode(document, idx)
        except JSONDecodeError:
            # do something sensible if there's some error
            raise
        yield obj

s = """

{"a": 1}  


   [
1
,   
2
]


"""

for obj in decode_stacked(s):
    print(obj)

prints:

{'a': 1}
[1, 2]

Note About Missing Documentation

raw_decode(s[, idx=0])

Decode a JSON document from s (a str or unicode beginning with a JSON document) starting from the index idx and return a 2-tuple of the Python representation and the index in s where the document ended.

This can be used to decode a JSON document from a string that may have extraneous data at the end, or to decode a string that has a series of JSON objects.

JSONDecodeError will be raised if the given JSON document is not valid.

2 of 6

Use a json array, in the format:

[
{"ID":"12345","Timestamp":"20140101", "Usefulness":"Yes",
  "Code":[{"event1":"A","result":"1"},…]},
{"ID":"1A35B","Timestamp":"20140102", "Usefulness":"No",
  "Code":[{"event1":"B","result":"1"},…]},
{"ID":"AA356","Timestamp":"20140103", "Usefulness":"No",
  "Code":[{"event1":"B","result":"0"},…]},
...
]

Then import it into your python code

import json

with open('file.json') as json_file:

    data = json.load(json_file)

Now the content of data is an array with dictionaries representing each of the elements.

You can access it easily, i.e:

data[0]["ID"]

Discussions

Writing multiple json items to files

Hello i try to write json items to separate files pairs. I generate json items and files names and want to write the to files. I suceeded writing only one file pair. How can write more than one pair of files. I would be pleased help how can achive write multiple file pairs to disk. More on community.n8n.io

community.n8n.io

September 6, 2021

Multiple objects in JSON file

You’re not getting valid JSON if there’s more than one top level object. More on reddit.com

r/learnpython

February 5, 2020

Parse JSON File with Array of Objects without converting to USTRUCT

Hello, I have multiple JSON files as follows. I am trying to create an all-purpose plugin to handle some JSON object-array files given to me. // Multiple files like this: [ { // JSON Object }, { // JSON Object } ] I have seen discussion threads such as Parsing json array of objects suggesting ... More on forums.unrealengine.com

forums.unrealengine.com

September 18, 2022

Parsing multiple JSON objects from a string or stream

(It would help somewhat if json could accept whitespace* separated objects in arrays not requiring comma, though that would not be valid JSON. This could be a relaxed parsing, instead of strict.) Second possible implementation (preferred IMHO) for the parsing to be resumable. Meaning extract one ... More on github.com

github.com

January 28, 2017

Videos