Meta Unveils CoTracker3 to Improve Tracking Using Pseudo Labelling Real Videos

6 months ago 52
  • Last updated October 17, 2024
  • In AI News

CoTracker3 is equipped to self-label parts of the data, increasing the quality and quantity of training information without requiring fully annotated datasets.

Meta, on October 16, announced the launch of CoTracker3, a point tracker model to track videos, an upgrade to its CoTracker series of models featuring advanced AI technology. CoTracker3 is designed to handle situations where tracked points move out of view or temporarily occluded to overcome challenges in tracking objects across complex scenarios. 

Click here to check out the GitHub repository.

By introducing a semi-supervised learning method called ‘pseudo labelling’ on real videos, it allows the model to self-label parts of the data while Meta focuses on increasing the quality and quantity of training information without requiring fully annotated datasets.

According to the researchers, CoTracker3 can surpass trackers trained on ×1,000 more videos through its simple semi-supervised training protocol. By tracking points jointly, CoTracker3 handles occlusions better than any other model, mainly when operated offline. 

Meta said this model can be used as a building block for tasks requiring motion estimation, such as 3D tracking, controlled video generation, or dynamic 3D reconstruction.

The researchers also said that CoTracker3 outperformed the state-of-the-art on TAP-Vid and other benchmarks as its architecture combines several ideas from recent trackers and eliminates unnecessary components. 

Available on both online and offline platforms, the model can be explored live by developers and researchers on Hugging Face. This model’s utility spans multiple domains, such as augmented reality, robotics, and sports analytics, where accurately tracking object motion is essential. 

Meta has also made the model and associated resources available under an A-NC licence to facilitate further research.
Earlier this year, Meta also introduced Video Joint Embedding Predictive Architecture (V-JEPA) V-JEPA that predicts the missing parts of videos without needing to recreate every detail. It learns from unlabeled videos, so it doesn’t require data that humans have categorised to start learning. This improves machines’ understanding of the world by analysing video interactions between objects.

Picture of Tanisha Bhattacharjee

Tanisha Bhattacharjee

Journalist with a passion for art, technological development and travel. Discovering the dynamic world of AI, one article at a time.

25th Oct 2024
Meet 30+ top CDOs, ITDMs & AI Leaders.

Upcoming Large format Conference

The Most Powerful Generative AI Conference for Enterprise Leaders and Startup Founders

Nov 21-22, 2024 | 📍 Santa Clara Convention Center, CA

Download the easiest way to
stay informed

Adding-Noise-Cancellation-to-LLMs

Adding Noise Cancellation to LLMs

Sagar Sharma

With DIFF Transformer, you can achieve 30% accuracy improvement and 10-20% accuracy gain in many-shot in-context learning across datasets.

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Rising 2025 | DE&I in Tech & AI Summit

Mar 20 and 21, 2025 | 📍 J N Tata Auditorium, Bengaluru

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

February 5 – 7, 2025 | Nimhans Convention Center, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

September 25-27, 2024 | 📍Bangalore, India

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

discord icon

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

World's Biggest Media & Analyst firm specializing in AI

AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.

AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.

ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent

With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.

Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring

AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives.

© Analytics India Magazine Pvt Ltd & AIM Media House LLC 2024

Read Entire Article