Automate browser-based workflows with LLMs and Computer Vision
-
Updated
May 19, 2025 - Python
Automate browser-based workflows with LLMs and Computer Vision
Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)
Open source hardware and software platform to build a small scale self driving car.
[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.
The Open Source Framework for Machine Vision
Python code to fuse multiple RGB-D images into a TSDF voxel volume.
Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.
Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
Let Home Assistant see!
High-performance Vision library in Python. Scale your research, not boilerplate.
Implementation of Bottleneck Transformer in Pytorch
Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.
Code for "Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation"
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
Code for Neural Motifs: Scene Graph Parsing with Global Context (CVPR 2018)
Rewriting a Deep Generative Model, ECCV 2020 (oral). Interactive tool to directly edit the rules of a GAN to synthesize scenes with objects added, removed, or altered. Change StyleGANv2 to make extravagant eyebrows, or horses wearing hats.
[ NeurIPS2021] This is an official implementation of our paper "HRFormer: High-Resolution Transformer for Dense Prediction".
LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
Add a description, image, and links to the vision topic page so that developers can more easily learn about it.
To associate your repository with the vision topic, visit your repo's landing page and select "manage topics."