vision

Star

Here are 584 public repositories matching this topic...

Skyvern-AI / skyvern

Star

Automate browser-based workflows with LLMs and Computer Vision

python api workflow automation browser computer vision gpt browser-automation rpa playwright llm

Updated May 19, 2025
Python

PaddlePaddle / PaddleHub

Star

Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)

nlp awesome deep-learning model vision text2image

Updated Aug 7, 2024
Python

autorope / donkeycar

Star

Open source hardware and software platform to build a small scale self driving car.

python raspberry-pi tensorflow keras vision self-driving-car cv2 donkeycar jetson-nano

Updated Sep 15, 2024
Python

VainF / Torch-Pruning

Star

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

transformers vision pruning model-compression efficient-deep-learning llm

Updated Apr 25, 2025
Python

sightmachine / SimpleCV

Star

The Open Source Framework for Machine Vision

python computer-vision cv image-processing vision visionprocessing

Updated Dec 20, 2024
Python

andyzeng / tsdf-fusion-python

Star

Python code to fuse multiple RGB-D images into a TSDF voxel volume.

cuda artificial-intelligence vision rgbd 3d 3d-reconstruction depth-camera volumetric-data 3d-deep-learning tsdf kinect-fusion

Updated Feb 18, 2023
Python

lucidrains / mlp-mixer-pytorch

Star

An All-MLP solution for Vision, from Google AI

deep-learning vision

Updated Sep 13, 2024
Python

andyzeng / visual-pushing-grasping

Star

Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.

computer-vision deep-learning robotics deep-reinforcement-learning artificial-intelligence vision manipulation grasping 3d pushing

Updated May 11, 2021
Python

deepdrive / deepdrive

Star

Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving

python competition control reinforcement-learning deep-learning simulation tensorflow deep-reinforcement-learning vision gym self-driving-car unreal-engine transfer-learning sensorimotor

Updated Oct 3, 2023
Python

valentinfrlch / ha-llmvision

Star

Let Home Assistant see!

ai vision home-assistant image-analysis hacs-integration llm

Updated May 19, 2025
Python

jasmcaus / caer

Sponsor

Star

High-performance Vision library in Python. Scale your research, not boilerplate.

Updated Oct 13, 2023
Python

lucidrains / bottleneck-transformer-pytorch

Star

Implementation of Bottleneck Transformer in Pytorch

deep-learning transformers artificial-intelligence vision image-classification attention-mechanism

Updated Sep 20, 2021
Python

google-research / ravens

Star

Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.

reinforcement-learning computer-vision deep-learning robotics tensorflow openai-gym artificial-intelligence vision manipulation imitation-learning pybullet rearrangement pick-and-place transporter-nets

Updated Jul 30, 2024
Python

anki / vector-python-sdk

Star

Anki Vector Python SDK

robot ai robotics vector vision anki

Updated Jan 17, 2023
Python

RobotLocomotion / pytorch-dense-correspondence

Star

Code for "Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation"

computer-vision deep-learning robotics pytorch artificial-intelligence vision manipulation 3d self-supervised-learning

Updated May 9, 2023
Python

mees / calvin

Star

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

natural-language-processing computer-vision deep-learning robotics pytorch vision manipulation vision-and-language grounding vision-language

Updated Feb 14, 2025
Python

rowanz / neural-motifs

Star

Code for Neural Motifs: Scene Graph Parsing with Global Context (CVPR 2018)

pytorch vision scene-graph visual-genome

Updated Aug 9, 2019
Python

Rewriting a Deep Generative Model, ECCV 2020 (oral). Interactive tool to directly edit the rules of a GAN to synthesize scenes with objects added, removed, or altered. Change StyleGANv2 to make extravagant eyebrows, or horses wearing hats.

machine-learning research deep-learning graphics vision hci gans

Updated Oct 29, 2023
Python

HRNet / HRFormer

Star

[ NeurIPS2021] This is an official implementation of our paper "HRFormer: High-Resolution Transformer for Dense Prediction".

transformer vision classification segmentation pose-estimation hrnet

Updated Oct 19, 2022
Python

ictnlp / LLaVA-Mini

Star

LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.

video efficient vision llama multimodal large-language-models vision-language-model llava visual-instruction-tuning multimodal-large-language-models gpt4v large-multimodal-models gpt4o

Updated Jan 13, 2025
Python

Improve this page

Add a description, image, and links to the vision topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vision

Here are 584 public repositories matching this topic...

Skyvern-AI / skyvern

PaddlePaddle / PaddleHub

autorope / donkeycar

VainF / Torch-Pruning

sightmachine / SimpleCV

andyzeng / tsdf-fusion-python

lucidrains / mlp-mixer-pytorch

andyzeng / visual-pushing-grasping

deepdrive / deepdrive

valentinfrlch / ha-llmvision

jasmcaus / caer

lucidrains / bottleneck-transformer-pytorch

google-research / ravens

anki / vector-python-sdk

RobotLocomotion / pytorch-dense-correspondence

mees / calvin

rowanz / neural-motifs

davidbau / rewriting

HRNet / HRFormer

ictnlp / LLaVA-Mini

Improve this page

Add this topic to your repo