How to define pydantic/JSON schema

community.openai.com › t › how-to-define-pydantic-json-schema › 988192

I’ll attempt to answer your questions about the .parse() version of chat completions, found in ‘beta’. The new beta method, when used with a BaseModel, enforces and passes strict:true without regard to your desires otherwise when you use a pydantic BaseModel as the response_format. For example, le… Answer from arata on community.openai.com

docs.pydantic.dev › latest › concepts › json_schema

JSON Schema - Pydantic Validation

You can also generate JSON schemas for combinations of BaseModels and TypeAdapters, as shown in this example: import json from typing import Union from pydantic import BaseModel, TypeAdapter class Cat(BaseModel): name: str color: str class Dog(BaseModel): name: str breed: str ta = TypeAdapter(Union[Cat, Dog]) ta_schema = ta.json_schema() print(json.dumps(ta_schema, indent=2))

medium.com › @kishanbabariya101 › episode-8-json-schema-generation-in-pydantic-9a4c4fee02c8

Episode 8: JSON Schema Generation in Pydantic | by Kishan Babariya | Medium

December 17, 2024 - Feature Description Automatic Schema Generation Generate JSON Schema for all Pydantic models using model_json_schema(). FastAPI Integration Leverage schema generation for interactive API docs and validation.

Videos

Pydantic 2 Crash Course | Python Data Validation, Serialization ...

February 10, 2025

Pydantic - Nested Models, JSON Schema and Auto-Generating Models ...

JSON Schema Validation in Python: Bring Structure Into JSON - YouTube

November 4, 2023

Schema Validation with Pydantic: Part #11 Python API Course - YouTube

Simplify AI Schemas with Pydantic & OpenAI (No More Manual JSON!)

pypi.org › project › jsonschema-pydantic

jsonschema-pydantic · PyPI

Simple transform of jsonschema to pydantic models.

      » pip install jsonschema-pydantic

Published Feb 03, 2024

Version 0.6

Homepage https://github.com/kreneskyp/jsonschema-pydantic/

pypi.org › project › json-schema-to-pydantic

json-schema-to-pydantic · PyPI

A Python library for automatically generating Pydantic v2 models from JSON Schema definitions

      » pip install json-schema-to-pydantic

Published Feb 01, 2026

Version 0.4.9

Homepage https://github.com/richard-gyiko/json-schema-to-pydantic

docs.pydantic.dev › 1.10 › usage › schema

Schema - Pydantic

The standard format JSON field is used to define pydantic extensions for more complex string sub-types. The field schema mapping from Python / pydantic to JSON Schema is done as follows:

OpenAI Developer Community

community.openai.com › api

How to define pydantic/JSON schema - API - OpenAI Developer Community

I’ll attempt to answer your questions about the .parse() version of chat completions, found in ‘beta’. The new beta method, when used with a BaseModel, enforces and passes strict:true without regard to your desires otherwise when you use a pydantic BaseModel as the response_format. For example, le…

docs.pydantic.dev › 2.1 › usage › json_schema

JSON Schema - Pydantic

The generated JSON schemas are compliant with the following specifications: ... OpenAPI extensions. import json from enum import Enum from typing import Union from typing_extensions import Annotated from pydantic import BaseModel, Field from pydantic.config import ConfigDict class FooBar(BaseModel): count: int size: Union[float, None] = None class Gender(str, Enum): male = 'male' female = 'female' other = 'other' not_given = 'not_given' class MainModel(BaseModel): """ This is the description of the main model """ model_config = ConfigDict(title='Main') foo_bar: FooBar gender: Annotated[Union[Gender, None], Field(alias='Gender')] = None snap: int = Field( 42, title='The Snap', description='this is the value of snap', gt=30, lt=50, ) print(json.dumps(MainModel.model_json_schema(), indent=2))

github.com › pydantic › pydantic › discussions › 6598

Take 2: create model from JSON schema aka model_load_json_schema() · pydantic/pydantic · Discussion #6598

Author pydantic

Sure datamodel-code-generator is what you looking for https://docs.pydantic.dev/latest/integrations/datamodel_code_generator/.

Or you could write something that converted JSON Schema to pydantic core schema directly.

We won't be adding this functionality directly to pydantic.

Find elsewhere

Google Bing Mojeek

github.com › pydantic › pydantic › discussions › 5135

Does Pydantic natively support schema validation of JSON schemas? · pydantic/pydantic · Discussion #5135

Author pydantic

Currently, pydantic does nothing to validate JSON schema whatsoever — either that a JSON schema is valid, or that a JSON object matches a JSON schema. The extent of pydantic's JSON schema integration today is to generate JSON schema for various types, and I believe was originally added by @tiangolo for the purposes of FastAPI.

I think at this point in time, validating JSON schema (either instances against a schema, or schemas against a metaschema) is out of scope for pydantic proper, and I would encourage you to take an approach like you've taken using a library like jsonschema to implement a pydantic validator if you want to perform actual validation.

That said, I could see it being reasonable to add a JSON schema validator type to the nascent pydantic_extra_types package. I think even if that was done though, we'd likely take the approach of relying on a library like jsonschema to maintain the actual validation logic.

reddit.com › r/python › new package: jambo — convert json schema to pydantic models automatically

r/Python on Reddit: New Package: Jambo — Convert JSON Schema to Pydantic Models Automatically

April 10, 2025 -

🚀 I built Jambo, a tool that converts JSON Schema definitions into Pydantic models — dynamically, with zero config!

✅ What my project does:

Takes JSON Schema definitions and automatically converts them into Pydantic models
Supports validation for strings, integers, arrays, nested objects, and more
Enforces constraints like minLength, maximum, pattern, etc.
Built with AI frameworks like LangChain and CrewAI in mind — perfect for structured data workflows

🧪 Quick Example:

from jambo.schema_converter import SchemaConverter

schema = {
    "title": "Person",
    "type": "object",
    "properties": {
        "name": {"type": "string"},
        "age": {"type": "integer"},
    },
    "required": ["name"],
}

Person = SchemaConverter.build(schema)
print(Person(name="Alice", age=30))

🎯 Target Audience:

Developers building AI agent workflows with structured data
Anyone needing to convert schemas into validated models quickly
Pydantic users who want to skip writing models manually
Those working with JSON APIs or dynamic schema generation

🙌 Why I built it:

My name is Vitor Hideyoshi. I needed a tool to dynamically generate models while working on AI agent frameworks — so I decided to build it and share it with others.

Check it out here:

GitHub: https://github.com/HideyoshiNakazone/jambo
PyPI: https://pypi.org/project/jambo/

Would love to hear what you think! Bug reports, feedback, and PRs all welcome! 😄
#ai #crewai #langchain #jsonschema #pydantic

lol, you forgot to remove the intro

How is it different from: https://github.com/koxudaxi/datamodel-code-generator ?

stackoverflow.com › questions › 73841072 › dynamically-generating-pydantic-model-from-a-schema-json-file

dynamic - Dynamically Generating Pydantic Model from a Schema JSON File - Stack Overflow

This has been discussed some time ago and Samuel Colvin said he didn't want to pursue this as a feature for Pydantic.

If you are fine with code generation instead of actual runtime creation of models, you can use the datamodel-code-generator.

To be honest, I struggle to see the use case for generating complex models at runtime, seeing as their main purpose is validation, implying that you think about correct schema before running your program. But that is just my view.

For simple models I guess you can throw together your own logic for this fairly quickly.

If do you need something more sophisticated, the aforementioned library does offer some extensibility. You should be able to import and inherit from some of their classes like the JsonSchemaParser. Maybe that will get you somewhere.

Ultimately I think this becomes non-trivial very quickly, which is why Pydantic's maintainer didn't want to deal with it and why there is a whole separate project for this.

Updated @Alon's answer to handle nested modals:

from typing import Any, Type, Optional
from enum import Enum

from pydantic import BaseModel, Field, create_model


def json_schema_to_base_model(schema: dict[str, Any]) -> Type[BaseModel]:
    type_mapping: dict[str, type] = {
        "string": str,
        "integer": int,
        "number": float,
        "boolean": bool,
        "array": list,
        "object": dict,
    }

    properties = schema.get("properties", {})
    required_fields = schema.get("required", [])
    model_fields = {}

    def process_field(field_name: str, field_props: dict[str, Any]) -> tuple:
        """Recursively processes a field and returns its type and Field instance."""
        json_type = field_props.get("type", "string")
        enum_values = field_props.get("enum")

        # Handle Enums
        if enum_values:
            enum_name: str = f"{field_name.capitalize()}Enum"
            field_type = Enum(enum_name, {v: v for v in enum_values})
        # Handle Nested Objects
        elif json_type == "object" and "properties" in field_props:
            field_type = json_schema_to_base_model(
                field_props
            )  # Recursively create submodel
        # Handle Arrays with Nested Objects
        elif json_type == "array" and "items" in field_props:
            item_props = field_props["items"]
            if item_props.get("type") == "object":
                item_type: type[BaseModel] = json_schema_to_base_model(item_props)
            else:
                item_type: type = type_mapping.get(item_props.get("type"), Any)
            field_type = list[item_type]
        else:
            field_type = type_mapping.get(json_type, Any)

        # Handle default values and optionality
        default_value = field_props.get("default", ...)
        nullable = field_props.get("nullable", False)
        description = field_props.get("title", "")

        if nullable:
            field_type = Optional[field_type]

        if field_name not in required_fields:
            default_value = field_props.get("default", None)

        return field_type, Field(default_value, description=description)

    # Process each field
    for field_name, field_props in properties.items():
        model_fields[field_name] = process_field(field_name, field_props)

    return create_model(schema.get("title", "DynamicModel"), **model_fields)

Example Schema

schema = {
    "title": "User",
    "type": "object",
    "properties": {
        "name": {"type": "string"},
        "age": {"type": "integer"},
        "is_active": {"type": "boolean"},
        "address": {
            "type": "object",
            "properties": {
                "street": {"type": "string"},
                "city": {"type": "string"},
                "zipcode": {"type": "integer"},
            },
        },
        "roles": {
            "type": "array",
            "items": {
                "type": "string",
                "enum": ["admin", "user", "guest"]
            }
        }
    },
    "required": ["name", "age"]
}

Generate the Pydantic model

DynamicModel = json_schema_to_base_model(schema)

Example usage

print(DynamicModel.schema_json(indent=2))

github.com › kreneskyp › jsonschema-pydantic

GitHub - kreneskyp/jsonschema-pydantic: Python library for converting JSON Schemas to Pydantic models

Simple transform of jsonschema to pydantic models.

Starred by 20 users

Forked by 4 users

Languages Python 82.0% | Makefile 17.6% | Dockerfile 0.4% | Python 82.0% | Makefile 17.6% | Dockerfile 0.4%

github.com › pydantic › pydantic › blob › main › docs › concepts › json_schema.md

pydantic/docs/concepts/json_schema.md at main · pydantic/pydantic

json_schema_extra: Extra JSON Schema properties to be added to the field (see the dedicated documentation). field_title_generator: A function that programmatically sets the field's title, based on its name and info. ... import json from typing import Annotated from pydantic import BaseModel, EmailStr, Field, SecretStr class User(BaseModel): age: int = Field(description='Age of the user') email: Annotated[EmailStr, Field(examples=['marcelo@mail.com'])] # (1)!

Author pydantic

stackoverflow.com › questions › 74883956 › pros-and-cons-of-pydantic-compared-to-json-schemas

Pros and cons of pydantic compared to json schemas - Stack Overflow

While both Pydantic and Json Schema are used to verify data adheres to a certain format they serve different use-cases:

Json Schema: a tool for defining JSON structures independent of any implementation or programming language.
Pydantic: a python specific tool for validating input data against a pydantic specific definition

You can find many implementations of Json Schema validator in many languages those are the tools that you might want to check out in a 1:1 comparison to pydantic. However, pydantic understands Json Schema: you can create pydantic code from Json Schema and also export a pydantic definition to Json Schema. They should be equivalent from a functional perspective. You can find a type mapping in the pedantic docs.

So, which should you use? Your use-case is important but most likely its not either/or. If you're python-only and prefer to define your schema in python directly definitely go for pydantic. If you need to exchange the schemas across languages or want to handle schemas generated somewhere else, you can add Json Schema on top and pydantic will be able to handle it.

stackoverflow.com › questions › 73419115 › pydantic-model-for-json-meta-schema

python - Pydantic model for JSON Meta Schema - Stack Overflow

One solution is to hack the utils out of datamodel-code-generator, specifically their JsonSchemaParser. This generates an intermediate text representation of all pydantic models which you can then dynamically import. You might reasonably balk at this, but it does allow for self-referencing and multi-model setups at least:

import importlib.util
import json
import re
import sys
from contextlib import contextmanager
from pathlib import Path
from tempfile import NamedTemporaryFile
from types import ModuleType

from datamodel_code_generator.parser.jsonschema import JsonSchemaParser
from pydantic import BaseModel


NON_ALPHANUMERIC = re.compile(r"[^a-zA-Z0-9]+")
UPPER_CAMEL_CASE = re.compile(r"[A-Z][a-zA-Z0-9]+")
LOWER_CAMEL_CASE = re.compile(r"[a-z][a-zA-Z0-9]+")

class BadJsonSchema(Exception):
    pass


def _to_camel_case(name: str) -> str:
    if any(NON_ALPHANUMERIC.finditer(name)):
        return "".join(term.lower().title() for term in NON_ALPHANUMERIC.split(name))
    if UPPER_CAMEL_CASE.match(name):
        return name
    if LOWER_CAMEL_CASE.match(name):
        return name[0].upper() + name[1:]
    raise BadJsonSchema(f"Unknown case used for {name}")


def _load_module_from_file(file_path: Path) -> ModuleType:
    spec = importlib.util.spec_from_file_location(
        name=file_path.stem, location=str(file_path)
    )
    module = importlib.util.module_from_spec(spec)
    sys.modules[file_path.stem] = module
    spec.loader.exec_module(module)
    return module


@contextmanager
def _delete_file_on_completion(file_path: Path):
    try:
        yield
    finally:
        file_path.unlink(missing_ok=True)


def json_schema_to_pydantic_model(json_schema: dict, name_override: str) -> BaseModel:
    json_schema_as_str = json.dumps(json_schema)
    pydantic_models_as_str: str = JsonSchemaParser(json_schema_as_str).parse()

    with NamedTemporaryFile(suffix=".py", delete=False) as temp_file:
        temp_file_path = Path(temp_file.name).resolve()
        temp_file.write(pydantic_models_as_str.encode())

    with _delete_file_on_completion(file_path=temp_file_path):
        module = _load_module_from_file(file_path=temp_file_path)

    main_model_name = _to_camel_case(name=json_schema["title"])
    pydantic_model: BaseModel = module.__dict__[main_model_name]
    # Override the pydantic model/parser name for nicer ValidationError messaging and logging
    pydantic_model.__name__ = name_override
    pydantic_model.parse_obj.__func__.__name__ = name_override
    return pydantic_model

Main drawback as I see it- datamodel-code-generator has non-dev dependencies isort and black- not ideal to have in your deployments.

If I understand correctly, you are looking for a way to generate Pydantic models from JSON schemas. Here is an implementation of a code generator - meaning you feed it a JSON schema and it outputs a Python file with the Model definition(s). It is not "at runtime" though. For this, an approach that utilizes the create_model function was also discussed in this issue thread a while back, but as far as I know there is no such feature in Pydantic yet.

If you know that your models will not be too complex, it might be fairly easy to implement a crude version of this yourself. Essentially the properties in a JSON schema are reflected fairly nicely by the __fields__ attribute of a model. You could write a function that takes a parsed JSON schema (i.e. a dictionary) and generates the Field definitions to pass to create_model.

stackoverflow.com › questions › 78679812 › pydantic-v2-to-json-schema-translation-how-to-suppress-autogeneration-of-title

jsonschema - Pydantic v2 to JSON Schema translation: How to suppress autogeneration of "title" annotation in v2? - Stack Overflow

After digging a bit deeper into the pydantic code I found a nice little way to prevent this. There is a method called field_title_should_be_set(...) in GenerateJsonSchema which can be subclassed and provided to model_json_schema(...).

I'm not sure if the way I've overwritten the method is sufficient for each edge case but at least for this little test class it works as intended.

from pydantic import BaseModel
from pydantic._internal._core_utils import is_core_schema, CoreSchemaOrField
from pydantic.json_schema import GenerateJsonSchema


class Test(BaseModel):
    a: int

class GenerateJsonSchemaWithoutDefaultTitles(GenerateJsonSchema):
    def field_title_should_be_set(self, schema: CoreSchemaOrField) -> bool:
        return_value = super().field_title_should_be_set(schema)
        if return_value and is_core_schema(schema):
            return False
        return return_value

json_schema = Test.model_json_schema(schema_generator=GenerateJsonSchemaWithoutDefaultTitles)
assert "title" not in json_schema["properties"]["a"]

You can do it in following way with Pydantic v2:

from pydantic import BaseModel, ConfigDict

def my_schema_extra(schema: dict[str, Any]) -> None:
    for prop in schema.get('properties', {}).values():
        prop.pop('title', None)


class Model(BaseModel):
    a: int

    model_config = ConfigDict(
        json_schema_extra=my_schema_extra,
    )


print(Model.schema_json())

jsontopydantic.com

JSON to Pydantic

Convert JSON to Pydantic

stackoverflow.com › questions › 65738013 › how-to-generate-a-strict-json-schema-with-pydantic

python - How to generate a strict json schema with pydantic? - Stack Overflow

You need to use a configuration on your model:

from pydantic import BaseModel, Extra
class Query(BaseModel):
    id: str
    name: Optional[str]
    class Config:
        extra = Extra.forbid

It defaults to Extra.ignore, the other option is Extra.allow which adds any extra fields to the resulting object.

You can also just use the strings "ignore", "allow", or "forbid"

Here are all the model config options you can use:

https://pydantic-docs.helpmanual.io/usage/model_config/

pydantic.com.cn › en › api › json_schema

JSON Schema - Pydantic documentation (en)

Generates a JSON schema that matches a schema that allows values matching either the JSON schema or the Python schema.