python subprocess encoding

docs.python.org › 3 › library › subprocess.html

subprocess — Subprocess management

3 weeks ago - The input argument is passed to Popen.communicate() and thus to the subprocess’s stdin. If used it must be a byte sequence, or a string if encoding or errors is specified or text is true.

stackoverflow.com › questions › 49150550 › python-subprocess-encoding

python subprocess encoding - Stack Overflow

1 of 2

Following this post instructions:How to make Unicode charset in cmd.exe by default?

Its possible to bypass this encoding problem

import subprocess
output = subprocess.check_output("chcp 65001 | powershell \"Get-ChildItem -LiteralPath 'HKLM:SOFTWARE\\\\Microsoft\\\\Windows\\\\CurrentVersion\\\\Uninstall' -ErrorAction 'Stop' -ErrorVariable '+ErrorUninstallKeyPath'\"", shell=True, stderr=subprocess.STDOUT)

2 of 2

The output is a of type bytes so you need to either decode it to a string with .decode('utf-8') (or with whatever codec you want), or use str(), Example:

import subprocess
output_bytes = subprocess.check_output("powershell \"Get-ChildItem -LiteralPath 'HKLM:SOFTWARE\\\\Microsoft\\\\Windows\\\\CurrentVersion\\\\Uninstall' -ErrorAction 'Stop' -ErrorVariable '+ErrorUninstallKeyPath'\"", shell=True, stderr=subprocess.STDOUT)

output_string = str(output_bytes)
# alternatively
# output_string = output_bytes.decode('utf-8')

# there are lots of \r\n in the output I encounterd, so you can split
# to get a list
output_list = output_string.split(r'\r\n')

# once you have a list, you can loop thru and print (or whatever you want)
for e in output_list:
    print(e)

The key here is to decode to whatever codec you want to use in order to produce the correct character when printing.

bugs.python.org › issue6135

Issue 6135: subprocess seems to use local encoding and give no choice - Python tracker

This issue tracker has been migrated to GitHub, and is currently read-only. For more information, see the GitHub FAQs in the Python's Developer Guide · This issue has been migrated to GitHub: https://github.com/python/cpython/issues/50385

Python.org

discuss.python.org › python help

Choosing correct encoding for subprocess.Popen() - Python Help - Discussions on Python.org

February 22, 2020 - I’m working on a Python wrapper around 3rd part command-line tool and need to exchange data with it via stdin/stdout. So I use subprocess.Popen() to start a process and then write()/readline() to send data or retrieve result. Here is simplified code import subprocess command = ['/path/to/executable', 'arg1', 'arg2', 'arg3'] instance = subprocess.Popen(command, stdin=subprocess.PIPE, stdout=subprocess.PIPE, std...

stackoverflow.com › questions › 33283603 › python-popen-communicate-str-encodeencoding-utf-8-errors-ignore-cr

Python popen() - communicate( str.encode(encoding="utf-8", errors="ignore") ) crashes - Stack Overflow

1 of 3

universal_newlines=True enables text mode. Combined with stdout=PIPE, it forces decoding of the child process' output using locale.getpreferredencoding(False) that is not utf-8 on Windows. That is why you see UnicodeDecodeError.

To read the subprocess' output using utf-8 encoding, drop universal_newlines=True:

#!/usr/bin/env python3
from subprocess import Popen, PIPE

with Popen(r'C:\path\to\program.exe "arg 1" "arg 2"',
           stdout=PIPE, stderr=PIPE) as p:
    output, errors = p.communicate()
lines = output.decode('utf-8').splitlines()

str.encode("utf-8") is equivalent to "utf-8".encode(). There is no point to pass it to .communicate() unless you set stdin=PIPE and the child process expects b'utf-8' bytestring as an input.

str.encode(encoding="utf-8", errors="ignore) has the form klass.method(**kwargs). .encode() method expects self (a string object) that is why you see TypeError.

>>> str.encode("abc", encoding="utf-8", errors="ignore") #XXX don't do it
b'abc'
>>> "abc".encode(encoding="utf-8", errors="ignore")
b'abc'

Do not use klass.method(obj) instead of obj.method() without a good reason.

2 of 3

You are not supposed to call .encode() on the class itself. What you probably want to do is something like

p1.communicate("FOOBAR".encode("utf-8"))

The error message you're getting means that the encode() function has nothing to encode, since you called it on the class, rather than on an instance (that would then be passed as the self parameter to encode()).

github.com › python › cpython › issues › 105312

subprocess.run() defaults to the wrong text encoding under Windows · Issue #105312 · python/cpython

June 5, 2023 - Python 3.11.3 (tags/v3.11.3:f3909b8, Apr 4 2023, 23:49:59) [MSC v.1934 64 bit (AMD64)] on win32 >>> import subprocess >>> subprocess.run("echo ö", shell=True, text=True, stdout=subprocess.PIPE).stdout.strip("\n") '”' As you can see, there is codepage confusion. You don't get back what you wrote out. Windows has different codepage settings applied, depending on context. File encoding (also called ANSI codepage) is not necessarily identical with console encoding (also called OEM codepage), see https://stackoverflow.com/a/43194047.

Author kunom

JetBrains

youtrack.jetbrains.com › issue › PY-24760

subprocess.check_output(..., encoding='utf-8') inspector ...

{{ (>_<) }} This version of your browser is not supported. Try upgrading to the latest stable version. Something went seriously wrong

Python for Network Engineers

pyneng.readthedocs.io › en › latest › book › 16_unicode › convert_examples.html

Examples of converting between bytes and strings - Python for network engineers

In [6]: result = subprocess.run(['ping', '-c', '3', '-n', '8.8.8.8'], ...: stdout=subprocess.PIPE, encoding='utf-8') ...: In [7]: result.stdout Out[7]: 'PING 8.8.8.8 (8.8.8.8) 56(84) bytes of data.\n64 bytes from 8.8.8.8: icmp_seq=1 ttl=43 time=55.5 ms\n64 bytes from 8.8.8.8: icmp_seq=2 ttl=43 time=54.6 ms\n64 bytes from 8.8.8.8: icmp_seq=3 ttl=43 time=53.3 ms\n\n--- 8.8.8.8 ping statistics ---\n3 packets transmitted, 3 received, 0% packet loss, time 2003ms\nrtt min/avg/max/mdev = 53.368/54.534/55.564/0.941 ms\n' In [8]: print(result.stdout) PING 8.8.8.8 (8.8.8.8) 56(84) bytes of data.

Python Forum

python-forum.io › thread-24580.html

subprocess.Popen() and encodings

I need to call 3rd party command-line tool from Python and communicate with it: pass commands and read their results. Tool started with the subprocess.Popen() and then I write to stdin and read from stdout. Here is simplified code import subprocess ...

Find elsewhere

Google Bing Mojeek

github.com › pyinstaller › pyinstaller › issues › 1325

Set input and output encoding when calling subprocesses · Issue #1325 · pyinstaller/pyinstaller

July 5, 2015 - Pass some env-variables to the subprocess.* calls to ensure e given encoding to be used by the subprocess ... Do not use universal_newlines, because it's behavior has changed in Python 3 and it now sets and encoding derived from the current locale.

Author htgoebel

gist.github.com › codeforkjeff › d9c15f224c7163131c38

subprocess_utf8.py · GitHub

October 4, 2022 - subprocess_utf8.py · This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode characters ·

github.com › python › cpython › issues › 50385

subprocess seems to use local encoding and give no choice · Issue #50385 · python/cpython

May 28, 2009 - BPO 6135 Nosy @gpshead, @amauryfa, @ncoghlan, @pitrou, @vstinner, @mark-summerfield, @merwok, @bitdancer, @mightyiam, @andyclegg, @cjerdonek, @vadmium, @eryksun, @zooba, @davispuh PRs #5564#5572#5573 Files subprocess.patch: Add encoding ...

Published May 28, 2009

Author mark-summerfield

Python.org

discuss.python.org › core development

Deprecating `text` option in subprocess - Core Development - Discussions on Python.org

March 15, 2022 - This thread is spin off from "JEP 400: UTF-8 by Default" and future of Python subprocess module has text=False option. When text=True is passed, locale encoding is used for now. Instead of changing the default encoding, we can deprecate the ...

stackoverflow.com › questions › 47946134 › subprocess-run-argument-encoding

python - subprocess.run() argument encoding - Stack Overflow

1 of 2

(Answering own question hoping it could be helpfull to others)

I made a short test program. This is what I have found:

File system encoding is the key point.
Monkey patching does not work. Well, that's OK. It is not acceptable as a solution anyway.
LANG=C.UTF-8 requires the locale installed and it was not on my system (checked with locale -a). But on a second system where it was available, it worked.

I can make the encoding explicitly and pass bytes as one of the args:

cmdresult = sub.run(
    [SCRIPT, tid, days, name.encode('utf-8')],
    ...

This works, but one question remianed:

Does it comply with the docs?

All I could find is:

args should be a sequence of program arguments or else a single string

And I did understand it as one string or a list of strings, but actually it does not specify a list of what types. I passed also and int to see what will happen. I got this error:

expected str, bytes or os.PathLike object

So my solution seems to be fine.

2 of 2

In the context of mod_wsgi, you should ensure you are using mod_wsgi daemon mode and set the lang/locale for the mod_wsgi daemon process group. For a much more detailed explanation which is too much to repeat here, see:

http://blog.dscpl.com.au/2014/09/setting-lang-and-lcall-when-using.html

bugs.python.org › issue34618

Issue 34618: Encoding error running in subprocess with captured output - Python tracker

Deanishe

deanishe.net › alfred-workflow › guide › text-encoding.html

Encoded strings and Unicode — Alfred-Workflow 1.39.0 documentation

Best practice in Python programs is to use Unicode internally and decode all text input and encode all text output at IO boundaries (i.e. right where it enters/leaves your program). On macOS, UTF-8 is almost always the right encoding. Be sure to decode all input from and encode all output to the system (in particular via subprocess and when passing a {query} to a subsequent workflow action).

Fransiska

fransiska.github.io › 2021 › 08 › 13 › subprocess-encoding

Subprocess without shell

Traceback (most recent call last): File "ledger.py", line 32, in <module> res = subprocess.run(command, capture_output=True, encoding="utf8") #UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe3 in position 164: invalid continuation byte File "/Users/fransiska/.pyenv/versions/3.7.3/lib/python3.7/subprocess.py", line 474, in run stdout, stderr = process.communicate(input, timeout=timeout) File "/Users/fransiska/.pyenv/versions/3.7.3/lib/python3.7/subprocess.py", line 939, in communicate stdout, stderr = self._communicate(input, endtime, timeout) File "/Users/fransiska/.pyenv/versions/3.7.3

bugs.python.org › issue27179

Issue 27179: subprocess uses wrong encoding on Windows - Python tracker

June 2, 2016 - This issue tracker has been migrated to GitHub, and is currently read-only. For more information, see the GitHub FAQs in the Python's Developer Guide · This issue has been migrated to GitHub: https://github.com/python/cpython/issues/71366

bugs.python.org › issue33339

Issue 33339: Using default encoding with `subprocess.run()` is not obvious - Python tracker

stackoverflow.com › questions › 58522863 › subprocess-command-encoding

python - Subprocess command encoding - Stack Overflow