Hello, I would like to learn enough that ultimately I can run python script, have it view what ever is on my monitor and locate photos, and interact with them.
For example I have folder with pictures of four animals. fish.png deer.png bear.png horse.png
I want the script to scan whatever is on my monitor and attempt to find out if any of the picture I am interested in are present. Ideally I will also be able to get the X,Y coords on the screen so I can automate some mouse movement and clicking as well.
This is a static image. I dont need to be able to identify ANY deer, or ANY fish. I just need to be able to find the known photo.
Not asking for code help, but can anyone point me in the right direction on what to research or any guides?
So far I have been able to use NUMPY and MATPLOTLIB following this series https://pythonprogramming.net/image-recognition-python/ and I am able to pull in my saved image and work with its array, but really not sure how to examine my screen and get x, y, resolution coords of a positive match.