Ever found yourself battling with equations while writing papers in LaTeX, wishing there was a way to just snap a picture and boom—LaTeX code? Well, I did too. After too many late nights and too much coffee, I finally decided to do something about it. The result? Quack2Tex.
Quack2Tex is a handy tool that facilitates turning images of math equations and charts into LaTeX code, quickly and easily!. But it doesn't stop there! It also has cool features like guessing the location from a photo, identifying recipes from food pictures, and explaining code from images. Rendered as a floating menu on your screen, Quack2Tex is always at your fingertips, making it easy to access anytime you need it.
-
2024-10-18: Quack2Tex v1.0.0 is out! 🎉
- Added functionality for users to customize the rubber-duck menu. A new "Settings" option allows users to create and manage custom actions.
- Action Grouping: Users can now group actions in the settings menu for better organization and streamlined access.
- Multimodal Selection: Users can now choose which model or combination of models to use for generating outputs, enhancing flexibility and multimodel inference.
- New Input Mode: A "Clipboard" input mode has been introduced, enabling users to use clipboard content as input for selected actions.
-
2024-9-10: Quack2Tex was released! 🎉
- Image to LaTeX: Convert pictures of equations or symbols into LaTeX code—no more manual typing!
- Location Guessing: Upload a photo, and Quack2Tex will try to figure out where it was taken.
- Recipe Finder: Snap a picture of your meal, and Quack2Tex will tell you what dish it is.
- Code Explainer: Got a screenshot of code? Quack2Tex can explain what it does.
See the video below for a demo of Quack2Tex in action:
Under the hood, Quack2Tex leverages state-of-the-art multimodal models like Gemini, GPT-4o, and Lava to analyze the content in the images. Whether you're converting handwritten notes into LaTeX or identifying the location of a stunning sunset photo, Quack2Tex has you covered.
To get started with Quack2Tex, follow these steps:
pip install quack2tex
In the terminal, run the following command:
quack2tex --google_api_key <GOOGLE_API_KEY> --openai_api_key <OPENAI_API_KEY>
make you set the env variables GOOGLE_API_KEY
and/or OPENAI_API_KEY
for using google and openai models respectively.
You can also use LLava models through the ollama api.
You can also run the app from python, see the main.py
file for an example.
- Support clipboard copy to be used in the prompt
- Support gemini, gpt-4o, and lava models
- Allow user add custom actions to the rubber-duck menu
- Persist images and results in a database
- Create window, mac, and linux executables
Want to help make Quack2Tex better? Feel free to contribute by following these steps:
- Fork the repo.
- Create a new branch.
- Make your changes.
- Commit and push your changes.
- Open a Pull Request.
If you run into any problems, check out the Issues section on GitHub.
Quack2Tex is open-source and available under the MIT License—see the LICENSE file for more details.
Got questions? You can reach out to me at henryruiz22@gmail.com.