langchain_community.document_loaders.parsers.msword.MsWordParser¶
- class langchain_community.document_loaders.parsers.msword.MsWordParser[source]¶
Parse the Microsoft Word documents from a blob.
Methods
__init__()lazy_parse(blob)Parse a Microsoft Word document into the Document iterator.
parse(blob)Eagerly parse the blob into a document or documents.
- __init__()¶
- lazy_parse(blob: Blob) Iterator[Document][source]¶
Parse a Microsoft Word document into the Document iterator.
- Parameters
blob – The blob to parse.
Returns: An iterator of Documents.
- parse(blob: Blob) List[Document]¶
Eagerly parse the blob into a document or documents.
This is a convenience method for interactive development environment.
Production applications should favor the lazy_parse method instead.
Subclasses should generally not over-ride this parse method.
- Parameters
blob – Blob instance
- Returns
List of documents