download json file from url python

How do I download json files from a website with python?

reddit.com › r › learnpython › comments › kejee8 › how_do_i_download_json_files_from_a_website_with

Use the requests module: import requests r = requests.get(https://api.fda.gov/drug/label.json) https://requests.readthedocs.io/en/master/user/quickstart/ The website you linked also has API documentation, check it out. Answer from the_shell_man_ on reddit.com

reddit.com › r/learnpython › how do i download json files from a website with python?

r/learnpython on Reddit: How do I download json files from a website with python?

December 16, 2020 -

It would be the files listed here. https://open.fda.gov/apis/drug/event/download/

I have been using youtube videos on how to bulk download but I haven't figure it out.

Top answer

1 of 2

2 of 2

Be prepared because it's gonna take you a long time to download the 1080 files. import json import requests def download_file(url): r = requests.get(url) filename = url.split('/')[-1] with open(filename, 'wb') as f: f.write(r.content) api_url = 'https://api.fda.gov/download.json' r = requests.get(api_url) files = [file['file'] for file in json.loads(r.text)['results']['drug']['event']['partitions']] count = 1 for file in files: download_file(file) print(f"{count}/{len(files)} downloaded!") count += 1

Stack Overflow

stackoverflow.com › questions › 12965203 › how-to-get-json-from-webpage-into-python-script

How to get JSON from webpage into Python script - Stack Overflow

Top answer

1 of 10

452

Get data from the URL and then call json.loads e.g.

Python3 example:

import urllib.request, json 
with urllib.request.urlopen("http://maps.googleapis.com/maps/api/geocode/json?address=google") as url:
    data = json.loads(url.read().decode())
    print(data)

Python2 example:

import urllib, json
url = "http://maps.googleapis.com/maps/api/geocode/json?address=google"
response = urllib.urlopen(url)
data = json.loads(response.read())
print data

The output would result in something like this:

{
"results" : [
    {
    "address_components" : [
        {
            "long_name" : "Charleston and Huff",
            "short_name" : "Charleston and Huff",
            "types" : [ "establishment", "point_of_interest" ]
        },
        {
            "long_name" : "Mountain View",
            "short_name" : "Mountain View",
            "types" : [ "locality", "political" ]
        },
        {
...

2 of 10

165

I'll take a guess that you actually want to get data from the URL:

jsonurl = urlopen(url)
text = json.loads(jsonurl.read()) # <-- read from it

Or, check out JSON decoder in the requests library.

import requests
r = requests.get('someurl')
print r.json() # if response type was set to JSON, then you'll automatically have a JSON response here...

Videos

01:45

YouTube

Python: How to get Json from URL - YouTube

Python-Json -Save/Read JSON response from Restful API ...

How to Access Web APIs using Python Requests and JSON - YouTube

May 24, 2021

14:48

YouTube

Use Python to scrape web JSON data from multiple pages - YouTube

March 11, 2020

View all

GitHub

github.com › justinbalaguer › python-json-url-download

GitHub - justinbalaguer/python-json-url-download: python script for downloading files from json data url

python script for downloading files from json data url - justinbalaguer/python-json-url-download

Author justinbalaguer

reddit.com › r/learnpython › trouble downloading json files from a url

r/learnpython on Reddit: Trouble downloading JSON files from a URL

April 14, 2022 -

I'm trying to download JSON files from a URL. When I've attempted to open the saved JSON files in python, I get the error:

"raise JSONDecodeError("Expecting value", s, err.value) from None".

Whenever I try opening the JSON files in a URL, I see this:

"SyntaxError: JSON.parse: unexpected character at line 1 column 1 of the JSON data"

Below is a simplified version of my code. Is there a way to download JSON files correctly?

def read_url(url):

    urls = []
    psfiles = []
    
    url = url.replace(" ","%20")
    req = Request(url)
    a = urlopen(req).read()
    soup = BeautifulSoup(a, 'html.parser')
    x = (soup.find_all('a'))
    for i in x:
        file_name = i.extract().get_text()
        url_new = url + file_name
        url_new = url_new.replace(" ","%20")
        if(file_name[-1]=='/' and file_name[0]!='.'):
            read_url(url_new)
        if url_new.endswith('json'):
            urls.append(url_new)
      
    for i in urls:
        psfile = i.replace('url','')
        psfiles.append(psfile)
 
         
    for j in range(len(psfiles)):
        urllib.request.urlretrieve("url", "path to directory"+psfiles[j])


if __name__ == '__main__':
    while True:
        read_url("url")
        time.sleep(1800)

Top answer

1 of 2

here's a code snippet with requests that works >>> import requests >>> >>> url = 'https://old.reddit.com/r/learnpython/comments/v02hmv/trouble_downloading_json_files_from_a_url.json' >>> >>> response = requests.get(url) >>> json_data = response.json() >>> print(json_data) some of the output: [{'kind': 'Listing', 'data': {'after': None, 'dist': 1, 'modhash': '', 'geo_filter': '', 'children': [{'kind': 't3', 'data': {'approved_at_utc': None, 'subreddit': 'learnpython', 'selftext': 'I\'m trying to download JSON files from a URL. When I\'ve attempted to open the saved JSON files in python, I get the error:\n\n"raise JSONDecodeError("Expecting value", s, err.value) from None".\n\nWhenever I try opening the JSON files in a URL, I see this:\n\n"SyntaxError: JSON.parse: [...] if that doesn't work at your URL it's probably not valid a json string in the response you're getting

2 of 2

Imho you should start by actively debugging the stages in your code. The prime place to start is to check what is actually downloaded, eg add html = urlopen(req).read() # avoid confusing names like 'a' with open("base.html", "w") as fp: fp.write(html) to check in an editor or browser if that's a functional webpage that includes your data. Then add prints to the data extracted in BS, so links = (soup.find_all('a')) # again, more useful name print("found", len(links), "urls") for link in links: # you see now it reads more like English than 'for i in x' file_name = link.extract().get_text() print("file_name", file_name) url_new = url + file_name url_new = url_new.replace(" ","%20") if(file_name[-1]=='/' and file_name[0]!='.'): print("calling read_url with", url_new) read_url(url_new) else: print("skipping read_url") # or whatever you would call this case if url_new.endswith('json'): print("appending", url_new) urls.append(url_new) else: print("not json", url_new) same for the rest. Then you can actually know what parts are working and if it behaves like you're expecting.

Power CMS Technology

powercms.in › article › how-get-json-data-remote-url-python-script

How to get json data from remote url into Python script | Power CMS Technology

August 17, 2016 - import urllib, json url = "http://maps.googleapis.com/maps/api/geocode/json?address=googleplex&sensor=false" response = urllib.urlopen(url) data = json.loads(response.read()) print data

Delft Stack

delftstack.com › home › howto › python › python get json from url

How to Get JSON From URL in Python | Delft Stack

February 2, 2024 - If you print the type of the data variable, then it will be of type <class 'list'> because in this case, the JSON response starts with square brackets [] and in Python, lists start with square brackets. Now that we have parsed the JSON data, we are ready to access the individual values which we want using the data variable. To access the details of the first user, like Name and Address, we can do the following. import json import requests url = requests.get("https://jsonplaceholder.typicode.com/users") text = url.text data = json.loads(text) user = data[0] print(user["name"]) address = user["address"] print(address)

Stack Overflow

stackoverflow.com › questions › 39780403 › python3-read-json-file-from-url › 39780510

python - python3: Read json file from url - Stack Overflow

Top answer

1 of 4

You were close:

import requests
import json
response = json.loads(requests.get("your_url").text)

2 of 4

Just use json and requests modules:

import requests, json

content = requests.get("http://example.com")
json = json.loads(content.content)

Finxter

blog.finxter.com › home › learn python blog › how to get json from url in python?

How to Get JSON from URL in Python? - Be on the Right Side of Change

May 20, 2021 - import urllib.request import json # Bitcoin Genesis Block Transactions your_url = 'https://blockchain.info/rawaddr/12c6DSiU4Rq3P4ZxziKxzrL5LmMBrzjrJX' with urllib.request.urlopen(your_url) as url: data = json.loads(url.read().decode()) print(data)

Find elsewhere

Google Bing Mojeek

Stack Overflow

stackoverflow.com › questions › 57102306 › downloading-json-file-through-python › 57149132

Downloading JSON file through Python - Stack Overflow

Top answer

1 of 2

Ok so I had a lot of trouble interacting with the site. I decided to just go with the webbrowser library.

import webbrowser
chrome_path="C:xxx\\Google\\Chrome\\Application\\chrome.exe"
webbrowser.register('chrome', None,webbrowser.BackgroundBrowser(chrome_path))
url = 'http://testsite/csv?date=2019-07-18'

Setting chrome to download files automatically populates my download folder from where i can automate everything else :)

2 of 2

You need to read the data out of the object that urlopen returns.

Try

import urllib
with urllib.request.urlopen("test.com/csv?date=2019-07-17") as f:
        jsonl = f.read()

GeeksforGeeks

geeksforgeeks.org › how-to-read-a-json-response-from-a-link-in-python

How to read a JSON response from a link in Python? - GeeksforGeeks

February 24, 2021 - This library helps to open the URL and read the JSON response from the web. To use this library in python and fetch JSON response we have to import the json and urllib in our code, The json.loads() method returns JSON object.

Real Python

realpython.com › python-download-file-from-url

How to Download Files From URLs With Python – Real Python

January 25, 2025 - This offers the possibility for more advanced operations in web scraping and interacting with web APIs. To download a file from a URL using the urllib package, you can call urlretrieve() from the urllib.request module.

Python Basics

pythonbasics.org › json

Working With JSON Data in Python - Python Tutorial

You can parse a JSON object with python. The object will then be converted to a python object. ... You can get JSON objects directly from the web and convert them to python objects.

MyCleverAI

mycleverai.com › it-questions › how-can-i-download-a-json-file-from-a-link

How can I download a JSON file from a link?

A Blob object is created to hold the string data, and a download link is dynamically generated and clicked to trigger the download. ... - If you're working in a programming environment, you can use libraries to fetch and save JSON data. Here’s a Python example using the requests library: ...

GitHub

gist.github.com › sirleech › 2660189

Python Read JSON from HTTP Request of URL · GitHub

To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode characters ... This is very helpful. Thank you. ... Thanks, i use your first example. I think the content will be cached. Any solution for that? ... I would suggest you don't create a variable named 'json' that overloads the class named 'json' in line 19. ... url="http://api.open-notify.org/iss-pass.json" r=requests.get(url) t=json.loads(r.content) for i in range(len(t)): print(t[i]['state'])

Stack Overflow

stackoverflow.com › questions › 33102330 › trying-to-get-json-data-from-url-using-python

Trying to get json data from URL using Python - Stack Overflow

Top answer

1 of 2

Don't use beautiful soup to process a json http response. Use something like requests:

url = "https://www.daraz.pk/womens-kurtas-shalwar-kameez/?pathInfo=womens-kurtas-shalwar-kameez&page=2&YII_CSRF_TOKEN=31eb0a5d28f4dde909d3233b5a0c23bd03348f69&more_products=true"
header = {'x-requested-with': 'XMLHttpRequest'}
t = requests.get(url, headers=True)
newDictionary=json.loads(t)
print (newDictionary)

The beautiful soup object can't be parsed with json.loads() that way.

If you have HTML data on some of those json keys then you can use beautiful soup to parse those string values individually. If you have a key called content on your json, containing html, you can parse it like so:

BeautifulSoup(newDictionary.content, "lxml")

You may need to experiment with different parsers, if you have fragmentary html.

2 of 2

The following is an example of how to use various JSON data that has been loaded as an object with json.loads().

Working Example — Tested with Python 2.6.9 and 2.7.10 and 3.3.5 and 3.5.0

import json

json_data = '''
{
    "array": [
        1,
        2,
        3
    ],
    "boolean": true,
    "null": null,
    "number": 123,
    "object": {
        "a": "b",
        "c": "d",
        "e": "f"
    },
    "string": "Hello World"
}
'''

data = json.loads(json_data)

list_0 = [
    data['array'][0],
    data['array'][1],
    data['array'][2],
    data['boolean'],
    data['null'],
    data['number'],
    data['object']['a'],
    data['object']['c'],
    data['object']['e'],
    data['string']
]

print('''
array value 0           {0}
array value 1           {1}
array value 2           {2}
boolean value           {3}
null value              {4}
number value            {5}
object value a value    {6}
object value c value    {7}
object value e value    {8}
string value            {9}
'''.format(*list_0))

Output

array value 0           1
array value 1           2
array value 2           3
boolean value           True
null value              None
number value            123
object value a value    b
object value c value    d
object value e value    f
string value            Hello World

Python.org

discuss.python.org › python help

How can I download multiple .json files in one go from a site? - Python Help - Discussions on Python.org

August 12, 2023 - Hi all, I am using chatbase.co https://www.chatbase.co/ where I am hosting several chatbots. For each chatbot it is possible to download a .json file specified by a date interval which stores the chat conversation. Rig…

reddit.com › r/learnpython › how can i download a .json from a url?

r/learnpython on Reddit: How can i download a .json from a url?

July 7, 2019 -

For example: https://www.reddit.com/r/learnpython/about.json

Trying to use urllib and requests always returns error 429.

Top answer

1 of 4

import requests response = requests.get("https://www.reddit.com/r/learnpython/about.json") print(response) response 429 is "too many requests" - if you need to reuse this json, instead of making multiple get requests you could save it locally and only get it infrequently.

2 of 4

HTTP 429 is a "Too many requests" error. You get it when a host is rate-limiting you and you've gone over the limit. My guess is that reddit has rate-limiting enabled to prevent denial-of-service attacks. Most likely you've been testing your script and just made one too many attempts to download the file. Wait a while and try again. If it still gives an error, it may be that your user agent string is blocked.

Stack Abuse

stackabuse.com › how-to-get-json-from-a-url-in-python

How to Get JSON from a URL in Python

February 14, 2025 - In this article, we'll explore how to use Python to retrieve JSON data from a URL. We'll cover two popular libraries - requests and urllib, and show how to extract and parse the JSON data using Python's built-in json module.

Russelwebber

russelwebber.github.io › xlslim-docs › html › samples › read_json_from_web.html

Read JSON data from a URL — xlSlim v1.0 documentation

# Fetch JSON data from a web site import urllib.request import json def fetch_json(url): """GET data from the url and assume the response is JSON.""" with urllib.request.urlopen(url) as response: data = json.loads(response.read()) columns = ["userId", "id", "title", "completed"] result = [columns] for d in data: result.append([d[c] for c in columns]) return result if __name__ == "__main__": print(fetch_json("https://jsonplaceholder.typicode.com/todos")) ... All the Python code and Excel files shown are available from github in the xlslim-code-samples repo.

Pluralsight

pluralsight.com › blog › guides

Importing Data from a JSON Resource with Python | Online Courses, ...

Although there are third party tools that can perform the same task (such as the requests module), in this guide we will use a utility that is available out of the box with Python: the urllib.request library. First, we will import the library and the json module: