Skip to content

Denvermax19/Markdown-Converter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Markdown Converter 📄

A sleek, rapid-prototyped web tool built with Python, Streamlit, and Microsoft's MarkItDown library.


✨ Features

  • Multi-format Support: Convert .pdf, .docx, and .pptx files.
  • One-Click Conversion: Simple UI that handles the heavy lifting in the background.
  • Instant Preview: Preview your Markdown content directly in the browser before downloading.
  • Clean Output: Utilizes Microsoft’s MarkItDown for high-fidelity text extraction.

🛠️ Installation & Local Setup

If you want to run this project locally, follow these steps:

  1. Clone the repository:

    git clone https://github.com/Denvermax19/Markdown-Converter.git
    cd markdown-converter
  2. Install dependencies: This project requires Python 3.9+. Install the necessary packages using the requirements file:

    pip install -r requirements.txt

    Note: Ensure your requirements.txt includes streamlit and markitdown[pdf,docx,pptx].

  3. Run the application:

    streamlit run app.py

🚀 How to Use

  1. Upload: Drag and drop your PDF, Word, or PowerPoint file into the uploader.
  2. Convert: Click the "Convert to Markdown" button. The tool will process the file using the MarkItDown engine.
  3. Download: Once the conversion is finished, a download button will appear. Click it to save your new .md file.

Upcoming Features

  1. Images (OCR and llm based description)
  2. Audio (speech transcription)

📦 Dependencies

The tool relies on the following major libraries:

  • Streamlit: For the user interface.
  • MarkItDown: Microsoft's library for converting various files to Markdown.

🔗 Live Demo

Check out the live tool here: Markdown Converter

About

This tool allows you to instantly convert complex documents (PDF, Word, and PowerPoint) into clean, web-ready Markdown format

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages