🌐
spaCy
spacy.io › usage › spacy-101
spaCy 101: Everything you need to know · spaCy Usage Documentation
Most of the tags and labels look pretty abstract, and they vary between languages. spacy.explain will show you a short description – for example, spacy.explain("VBZ") returns “verb, 3rd person singular present”.
🌐
spaCy
spacy.io › api › example
Example · spaCy API Documentation
An Example holds the information for one training instance. It stores two Doc objects: one for holding the gold-standard reference data, and one for holding the predictions of the pipeline.
🌐
spaCy
spacy.io › usage › linguistic-features
Linguistic Features · spaCy Usage Documentation
Most of the tags and labels look pretty abstract, and they vary between languages. spacy.explain will show you a short description – for example, spacy.explain("VBZ") returns “verb, 3rd person singular present”.
🌐
Real Python
realpython.com › natural-language-processing-spacy-python
Natural Language Processing With spaCy in Python – Real Python
February 1, 2025 - Here’s an example where an ellipsis (...) is used as a delimiter, in addition to the full stop, or period (.): ... >>> ellipsis_text = ( ... "Gus, can you, ... never mind, I forgot" ... " what I was saying. So, do you think" ... " we should ..." ... ) >>> from spacy.language import Language >>> @Language.component("set_custom_boundaries") ...

software library for natural language processing

spaCy Tailored Pipelines
Advanced NLP with spaCy: A free online course
pypi Version
conda Version
spaCy (/speɪˈsiː/ spay-SEE) is an open-source software library for advanced natural language processing, written in the programming languages Python and Cython. The library is published under the MIT license and its main … Wikipedia
Factsheet
spaCy
Original author Matthew Honnibal
Developers Explosion AI, various
Factsheet
spaCy
Original author Matthew Honnibal
Developers Explosion AI, various
🌐
spaCy
spacy.io
spaCy · Industrial-strength Natural Language Processing in Python
To use it with 'spacy train' # you can run spacy init fill-config to auto-fill all default settings: # python -m spacy init fill-config ./base_config.cfg ./config.cfg [paths] train = null dev = null vectors = null [system] gpu_allocator = null [nlp] lang = "en" pipeline = [] batch_size = 1000 [components] [corpora] [corpora.train] @readers = "spacy.Corpus.v1" path = ${paths.train} max_length = 0 [corpora.dev] @readers = "spacy.Corpus.v1" path = ${paths.dev} max_length = 0 [training] dev_corpus = "corpora.dev" train_corpus = "corpora.train" [training.optimizer] @optimizers = "Adam.v1" [training
🌐
Penn Libraries
guides.library.upenn.edu › penntdm › python › spacy
SpaCy Package - Text Analysis - Guides at Penn Libraries
Where is_alpha detects if the token consists of alphabetic characters; is_punct detects if the token is a punctuation symbol; is_space detects if the token is a space. Stop words are the most common words in a language. Examples of stop words are the, who, too, and is.
🌐
Analytics Vidhya
analyticsvidhya.com › home › spacy tutorial to learn and master natural language processing (nlp)
spaCy Tutorial to Learn and Master Natural Language Processing (NLP)
November 27, 2023 - For example, consider the following sentence: Dependency Parsing spaCy In this sentence, the entities are “Donald Trump”, “Google”, and “New York City”. Let’s now see how spaCy recognizes named entities in a sentence.
🌐
Medium
medium.com › @krishnusai › mastering-natural-language-processing-with-spacy-a-comprehensive-guide-5e67ce30d6ab
SpaCy in Python. Natural Language Processing (NLP) has… | by krishna sai | Medium
April 3, 2024 - Open a terminal or command prompt and run the command: pip install spacy. Additionally, download the language model you intend to use. For example, python -m spacy download en_core_web_sm downloads the English language model.
Find elsewhere
🌐
spaCy
spacy.io › usage › projects
Projects · spaCy Usage Documentation
If not, spaCy will show an error and the command won’t run. Setting no_skip: true means that the command will always run, even if the dependencies (the trained pipeline) haven’t changed. This makes sense here, because you typically don’t want to skip your tests. Your project commands can include any custom scripts – essentially, anything you can run from the command line. Here’s an example of a custom script that uses typer for quick and easy command-line arguments that you can define via your project.yml:
🌐
TutorialsPoint
tutorialspoint.com › spacy › index.htm
spaCy Tutorial
spaCy, developed by software developers Matthew Honnibal and Ines Montani, is an open-source software library for advanced NLP (Natural Language Processing). It is written in Python and Cython (C extension of Python which is mainly designed to give C like performance to the Python language programs)
🌐
GitHub
github.com › explosion › spaCy
GitHub - explosion/spaCy: 💫 Industrial-strength Natural Language Processing (NLP) in Python
import spacy import en_core_web_sm nlp = en_core_web_sm.load() doc = nlp("This is a sentence.") 📖 For more info and examples, check out the models documentation.
Starred by 32.9K users
Forked by 4.6K users
Languages   Python 54.1% | MDX 31.2% | Cython 10.5% | JavaScript 2.6% | Sass 0.8% | TypeScript 0.4%
🌐
Domino Data Lab
domino.ai › blog › natural-language-in-python-using-spacy
Using spaCy for natural language processing (NLP) in Python
August 13, 2025 - For example, in an early section, we parsed the sentence The gorillas just went wild and were able to show that the lemma for the word went is the verb go. At this point we can use a venerable project called WordNet which provides a lexical ...
🌐
freeCodeCamp
freecodecamp.org › news › getting-started-with-nlp-using-spacy
NLP using spaCy – How to Get Started with Natural Language Processing
June 26, 2023 - In the above code, we're asking the spaCy model to find the entities from the sentence "Apple is looking at buying U.K. startup for $1 billion". We're then iterating through each entity, and displaying the entity, start and end characters index ...
🌐
Machine Learning Plus
machinelearningplus.com › spacy-tutorial-nlp
spaCy Tutorial - Learn all of spaCy in One Complete Writeup | ML+
May 27, 2021 - NER Application 1: Extracting brand names with Named Entity Recognition 12. NER Application 2: Automatically Masking Entities 13. Rule based Matching Token Matcher Phrase Matcher Entity Ruler 14. Word Vectors and similarity 15. Merging and Splitting Tokens with retokenize 16. spaCy pipelines 17.
🌐
DataCamp
datacamp.com › cheat-sheet › spacy-cheat-sheet-advanced-nlp-in-python
spaCy Cheat Sheet: Advanced NLP in Python | DataCamp
August 1, 2021 - from spacy.tokens import Doc, Token, Span doc = nlp("The sky over New York is blue")
🌐
spaCy
spacy.io › usage › training
Training Pipelines & Models · spaCy Usage Documentation
However, there can still be scenarios where you may want to override config settings when you run spacy train. This includes file paths to vectors or other resources that shouldn’t be hard-coded in a config file, or system-dependent settings. For cases like this, you can set additional command-line options starting with -- that correspond to the config section and value to override. For example, --paths.train ./corpus/train.spacy sets the train value in the [paths] block.
🌐
spaCy
spacy.io › models
Trained Models & Pipelines · spaCy Models Documentation
For example, en_core_web_sm is a small English pipeline trained on written web text (blogs, news, comments), that includes vocabulary, syntax and entities. Additionally, the pipeline package versioning reflects both the compatibility with spaCy, ...
🌐
spaCy
spacy.io › api › top-level
Top-level Functions · spaCy API Documentation
Your application or pipeline package can also expose a spacy_displacy_colors entry point to add custom labels and their colors automatically. By default, displaCy links to # for entities without a kb_id set on their span. If you wish to link an entity to their URL then consider using the kb_url_template option from above. For example if the kb_id on a span is Q95 and this is a Wikidata identifier then this option can be set to https://www.wikidata.org/wiki/{}. Clicking on your entity in the rendered HTML should redirect you to their Wikidata page, in this case https://www.wikidata.org/wiki/Q95.
🌐
spaCy
spacy.io › usage › processing-pipelines
Language Processing Pipelines · spaCy Usage Documentation
Disabled and excluded component names can be provided to spacy.load as a list. The disable mechanism makes it easy to distribute pipeline packages with optional components that you can enable or disable at runtime. For instance, your pipeline may include a statistical and a rule-based component for sentence segmentation, and you can choose which one to run depending on your use case. For example, spaCy’s trained pipelines like en_core_web_sm contain both a parser and senter that perform sentence segmentation, but the senter is disabled by default.