langchain_community.document_loaders.parsers.msword
.MsWordParser¶
- class langchain_community.document_loaders.parsers.msword.MsWordParser[source]¶
Parse the Microsoft Word documents from a blob.
Methods
__init__
()lazy_parse
(blob)Parse a Microsoft Word document into the Document iterator.
parse
(blob)Eagerly parse the blob into a document or documents.
- __init__()¶
- lazy_parse(blob: Blob) Iterator[Document] [source]¶
Parse a Microsoft Word document into the Document iterator.
- Parameters
blob – The blob to parse.
Returns: An iterator of Documents.
- parse(blob: Blob) List[Document] ¶
Eagerly parse the blob into a document or documents.
This is a convenience method for interactive development environment.
Production applications should favor the lazy_parse method instead.
Subclasses should generally not over-ride this parse method.
- Parameters
blob – Blob instance
- Returns
List of documents