@xoxoharsh/multiparser
Version:
A Text extracting package docx, pdf and pptx files
61 lines (41 loc) • 1.2 kB
Markdown
# MultiParser
A powerful npm package for parsing text from PowerPoint, PDF, and Word documents. This tool seamlessly extracts text, making it easier to analyze, process, and integrate with your applications.
## Features
- Parse text from PPT, PDF, and DOCX files
- Easy-to-use API
- High performance and accuracy
- Supports multiple file formats
- Lightweight and fast
## Installation
Install the package via npm:
```bash
npm install @xoxoharsh/multiparser
```
## Usage
Here's how to use the package in your project:
- For parsing whole file:
```bash
import Parser from '@xoxoharsh/multiparser';
const parser = new Parser(filePath);
parser.extractAll().then((text) =>{
console.log(text);
}).catch((error) => {
console.error("Error extracting text:", error);
});
```
- For parsing a particular page:
```bash
import Parser from '@xoxoharsh/multiparser';
const parser = new Parser(filePath);
parser
.extractPage(pageNo)
.then((text) => {
console.log("Page 3 text:", text);
})
.catch((error) => {
console.error("Error extracting text:", error);
});
// Currently this feature is not available for word documents
```
## Contributing
We welcome contributions!