Clarifai’s Post

View organization page for Clarifai, graphic

72,763 followers

1mo Edited

Object Detection Using Florence-2 🔥 The recently released Florence-2 model demonstrates strong zero-shot capabilities across tasks such as captioning, object detection, grounding, and segmentation. Below is an example of using the model for an object detection task and getting the bounding box coordinates for an image. The model is available on the Clarifai Platform. Try it out: https://lnkd.in/gxqa9nYp #objectdetection #computervision #vllm

To view or add a comment, sign in

More Relevant Posts

Valerio Marsocci

postdoc at KU Leuven - SSL for EO
4mo Edited
Report this post
#10 DOFA Multimodality is one of the keys to remote sensing foundation models. Most of the methods, however, focus on just one of them. DOFA employs an innovative approach utilizing wavelength as a unifying parameter across various EO modalities to achieve a more cohesive multimodal representation. DOFA is trained using a masked image modeling strategy, and a distillation loss is included to further optimize its performance. what I liked the most: the wavelength approach paper: https://lnkd.in/dTBSEHk4 #50papers #AI4EO #remotesensing #deeplearning #GeoAI
6 Comments
Like Comment
To view or add a comment, sign in
Tanat Tonguthaisri, CISSP®

enabling digital services for Student Loan related activities while maintaining the highest security standard, the most compliant personal data protection and customer-centric data-driven innovation.
7mo
Report this post
🚀 Excited to share our latest blog post on Gaussian-SLAM: Photo-realistic Dense SLAM with Gaussian Splatting. Our new dense SLAM method uses Gaussian splats as a scene representation, enabling interactive-time reconstruction and photo-realistic rendering of real-world and synthetic scenes. We propose novel strategies for seeding and optimizing Gaussian splats, extending their use to sequential monocular RGBD input data setups. Check out the full article here: https://bit.ly/41pSYJn. #SLAM #ComputerVision #GaussianSplatting
Like Comment
To view or add a comment, sign in
SQUARS

536 followers
11mo
Report this post
What happens during the process of augmenting physical objects? How does augmentation actually work? Before the camera can augment the image or the object, it has to recognize it first. Image and object recognition are broad terms that cover various computer vision tasks. Image recognition includes such components as image detection, pattern matching, content overlay, and tracking with interaction. Object recognition in its turn consists of object detection, feature extraction, matching & tracking, and content overlay with interaction. Slide into this week’s Knowledge Bites and check out our Knowledge Base for more information 👉 https://lnkd.in/d9NCMYik #SQUARS #WebAR #KnowledgeBites #ARmarketing #AR #education #technology
Like Comment
To view or add a comment, sign in
Surinder Bharti

Educationist
3mo
Report this post
🔍 Unlock Geometry Secrets! 📐 Dive into today's question: In the figure, altitudes AD and CE of ∆ABC intersect at point P. Discover the fascinating relationships: (i) ∆AEP ~ ∆CDP (ii) ∆ABD ~ ∆CBE (iii) ∆AEP ~ ∆ADB (iv) ∆PDC ~ ∆BEC. Explore more at www.qmaths.com! #Geometry #QOTD #Mathematics
Like Comment
To view or add a comment, sign in
Patrick Hearin Ph. D.
10mo
Report this post
My capstone on object detection and image classification on stroke images.
Like Comment
To view or add a comment, sign in
Letitia Parcalabescu

PhD. 👩💻 AI Researcher | 📺 YouTuber @AICoffeeBreak | 🧠 Machine Learning
5mo
Report this post
We simply explain and illustrate Mamba and (Selective) State Space Models – SSMs. SSMs match performance of transformers, but are faster and more memory-efficient than them. This is crucial for long sequences! 📺 https://lnkd.in/en6Vapn9 Incredible work by Albert Gu and Tri Dao! 👏 #SSM #researchpaper #video #artificialintelligence

MAMBA and State Space Models explained | SSM explained

https://www.youtube.com/

2 Comments
Like Comment
To view or add a comment, sign in
Remote Sensing MDPI

36,429 followers
2w
Report this post
🔥 #hottopic Multiscale Pixel-Level and Superpixel-Level Method for Hyperspectral Image Classification: Adaptive Attention and Parallel Multi-Hop Graph Convolution by Junru Yin, et al. ➡️ https://brnw.ch/21wLoTt
Like Comment
To view or add a comment, sign in
The Classes

2 followers
2mo
Report this post
A powerful tool for solving systems of equations, transforming geometric objects, and modeling real-world phenomena. #Mathematics #Matrix #Algebra #DataScience#TheClasses
Like Comment
To view or add a comment, sign in
Bruno Neri

Technical Leader - Artificial Intelligence and Deep Learning Enthusiast - Senior Software Engineer at ALTEN Italia
11mo
Report this post
"SEM-GAT: Explainable Semantic Pose Estimation using Learned Graph Attention" by Efimia Panagiotaki, Daniele De Martini, Georgi Pramatarov, Matt G., and Lars Kunze "This paper proposes a GNN-based method for exploiting semantics and local geometry to guide the identification of reliable pointcloud registration candidates. Semantic and morphological features of the environment serve as key reference points for registration, enabling accurate lidar-based pose estimation. Our novel lightweight static graph structure informs our attention-based keypoint node aggregation GNN network by identifying semantic instance-based relationships, acting as inductive bias to significantly reduce the computational burden of pointcloud registration. By connecting candidate nodes and exploiting cross-graph attention, we identify confidence scores for all potential registration correspondences, estimating the displacement between pointcloud scans. Our pipeline enables introspective analysis of the model's performance by correlating it with the individual contributions of local structures in the environment, providing valuable insights into the system's behavior. We test our method on the KITTI odometry dataset, achieving competitive accuracy compared to benchmark methods and a higher track smoothness while relying on significantly fewer network parameters." Paper: https://lnkd.in/dE2DXNHg #graphneuralnetworks #computervision
Like Comment
To view or add a comment, sign in
Reynel Rodriguez

Reality Capture / Surface Reconstruction and Innovation R&D at Eolian.
1mo Edited
Report this post
This video depicts the difference in Surface Reconstruction quality between Photogrammetry, Neural Radiance Fields (NeRFs), 3D Gaussian Splats (SuGAR) and the latest 2D Gaussian Splats. I have been testing the code for different approaches and thought it would provide a visual reference as to where all of these technologies stand as of this moment. The dataset contains 27 images. I will write a more detailed comparison between the pipelines in a later post.
Like Comment
To view or add a comment, sign in

Clarifai

72,763 followers

View Profile Follow

More from this author

Explore topics