Resources

Blog

What does it mean to realize the next era of AI development? Learn more on our blog.

Featured Post

Read More

  • All
  • AI
  • AI Practitioners
  • Autonomous Transportation
  • Best Of
  • Blog
  • Company News
  • Consumer & Media
  • Customer Story
  • Data Annotation
  • Ethical AI
  • Events
  • Impact
  • Machine Learning
  • Product
  • Retail & E-commerce
  • Sama Engineering
  • Security & Trust
  • Training Data
  • Vector Annotation

It’s Time to Redefine Data Quality for Machine Learning

In ML, 100% data quality isn’t necessary – it isn’t even feasible. Learn how to find the quality “sweet spot” to get your models performing as desired.

Overcoming Data Labeling Challenges for Smart Checkout

More shopping doesn’t need to lead to longer lines. Many retailers have implemented self-checkout lines to help shoppers get their errands done more quickly.

ML Object Tracking for Video Annotation: Greater Throughput, Zero Sacrifice

Sama is investing in ML-powered tools to optimize efficiency for your video annotation projects; introducing ML object tracking to increase throughput without sacrificing quality.

New from Sama: Faster, More Cost-Effective Drivable Area Annotation for 3D Point Cloud Data

Sama is thrilled to announce new and improved features for 3D point cloud annotation to provide our clients with the most secure, cost-effective, and fastest path to quality 3D drivable area annotation at scale.

virtual-try-on-fiiting-rooms

Virtual Fitting Rooms: Solving Data Labeling Challenges for Fashion and Footwear

Many online shopping experiences end in disappointment. In 2020, $102 billion worth of products purchased online were returned to retailers — about 18% of all online sales.

ensuring-data-quality-autonomous-vehicles

Defining, Measuring, and Guaranteeing Quality for Autonomous Driving

Good annotation and testing practices are the foundations of building a great model. However, understanding what constitutes quality data is a tricky question.

Sama’s Contribution to Advancing AI Recognized

Almost 15 years ago, Leila Janah founded Sama on the belief that ​​“talent is equally distributed, but opportunity is not.” Her vision — to give work, not aid — has since produced ripple effects throughout East Africa and the world.

Data Labeling Smart Homes

Accurate Data Labeling Makes Your Smart Homes Smarter

Smart homes and appliances empower homeowners to remotely control and program appliances. They wouldn’t be so “smart” without high-quality labeled data.

customer-shopping

4 Game-Changing Applications of LiDAR in Retail

Customers walk around stores, browse displays, walk down aisles, and stop to consider products. From the outside, it may not appear there is much to glean from these behaviors.

Data Curation Capabilities, Enhanced 3D & LiDAR Annotation Features, and Self-Service Platform Access

Today, Sama is thrilled to announce a host of new offerings which will allow us to continue to power our customers’ ML models with unmatched accuracy, scale, and efficiency.

Hitting Revenues Out of the Park with Accurate Data for Sports Analytics

In sports, AI can drive ticket and merchandise sales by studying purchase patterns from a large number of fans. AI is used in sports both on and off-pitch.

10 Frequently Asked Data Labeling Questions

Here are some of the most frequently asked data labeling questions, along with recommendations for approaching your data annotation strategy holistically.

Building a Robust Automotive LiDAR Annotation Quality Rubric

The purpose of a LiDAR annotation quality rubric is simple: it ensures that two people will score the same object in identical ways.

A Message From Our CEO

“Like so many immigrant children, I learned to believe in a dream that is as much American as it is universal: a dream of equal opportunity.” – Leila Janah

What TIME Got Wrong

Continuous improvement and excellence are at the heart of everything we do, and we invite anyone, in and outside of the company, to help us on our journey.

Sama by the Numbers

Sama is committed to providing our workforce with professional development and upskilling opportunities, benefits, and a living wage.

Orbisk is Using Accurate AI to Help Restaurants Reduce Food Waste Up to 70%

Orbisk is focused on limiting the amount of food waste produced in restaurants, hotels, and cafes with their AI-powered food waste monitoring solution.

In-House vs Outsourcing Data Annotation for ML: Pros & Cons

In-house data annotation can be expensive but can be helpful in early stages of ML production. Outsourcing data annotation is cheaper but security can be compromised.

Sama’s Experiment-Driven Approach to Solving for High-Quality Labels at Scale

A recent academic paper outlines how Sama uses Experiment-Driven Development to measure how improvements made to our platform increase annotation efficiency.

Swift Medical is Reimagining Patient Care with AI-Powered Wound Monitoring

Swift partnered with Sama to cost-effectively scale data annotations for their CV model, without compromising the clinician-level accuracy they needed.

Sama Partners with Mila to Solve Key Problems in AI Development

Sama is thrilled to partner with research center Mila, to help further the mission of creating innovative technologies to drive the AI industry forward.

Orbisk’s Sama-Powered Food Waste Solution Named to Fast Company’s First-Ever List of the Next Big Things in Tech

Sama has been recognized in Fast Company’s list of the Next Big Things in Tech for our work with our partners at Orbisk.

Accurate Data Labeling Powers the Volumental Shoe Recommendation App — Helping Retailers Convert Mobile Customers

Learn how Volumental partnered with Sama to accurately label the datasets that fuel the computer vision technology for their mobile foot scanning app.

Getting to Series B: How Sama is Proving Impact is a Strategy for Business Success

Last week, Sama secured Series B, cementing the viability of purpose-driven companies—and hopefully inspiring others to pursue purpose and profitability.

Sama Raises $70M Series B to Build a More Accurate, More Ethical, End-to-End AI Development Pipeline

Sama has raised $70M Series B funding to build the first end-to-end AI platform that enables teams to manage the complete AI lifecycle.

How More Accurate Data Labeling is Helping PolyPerception Advocate for Responsible Waste Management

Find out how accurately labeled data is helping PolyPerception provide material recovery facilities with better visibility into their waste streams.

ML Assisted Annotation Powered by MICROMODEL Technology

ML Assisted Annotation can help you generate high-quality pre-labeled and human-assisted annotations, for predictably higher quality data in half the time.

New White Paper by Partnership on AI: Responsible Sourcing of Data Enrichment Services

This whitepaper by Partnership on AI examines the working conditions of data enrichment professionals, and what can be done to improve them.

How Sama’s Accurate AI is Helping Blind Runners Run Independently

Project Guideline by Google partnered with Sama to help people who are blind run without a guide, using only a smartphone, headphones, and a yellow guideline.

Innovation Week: How Sama Builds a Culture of Experimentation

Sama’s third annual Innovation Week is coming to a close, and once more, our teams have given us plenty to be excited about.

RCT Results from MIT: Evaluating the Impact of Sama’s Training and Job Programs

This week, researchers at MIT released a white paper evaluating Sama’s impact through a three-year Randomized Controlled Trial study. Here are their findings.

How Sama Powers Tribe Dynamics to Measure Your Influencer Marketing Efforts

Tribe Dynamics helps customers get better ROI from influencer programs. Find out how partnering with Sama helped Tribe better serve clients and expand into new markets.

Challenges & Solutions for 3D LiDAR Annotation & 3D Data Sets

In this webinar, discover the challenges & solutions to 3D LiDAR annotation & 3D data sets for solving autonomous driving or driver assistance.

Experts Explain: How to Think About Human-Centered Machine Learning

We asked experts working in the field about their thoughts on the role of humans in Machine Learning, and humans and the future of ML. 

How to Define and Measure Your Training Data Quality

How do you define training data quality and measure it? How do you improve it? We go into defining, measuring, and reviewing your training data quality.

10 Experts on the Biggest Roadblocks to Bringing ML Models to Production

87% of AI projects will never make it into production. Why? We asked ML experts.

Sama’s Gold Tasks: ML Training Data with Gold-Standard Quality

Sama is an expert in efficiently designing annotation guidelines that enhance data quality. Gold Tasks refer to tasks that have been annotated perfectly.

Part 3: A/B Testing with Python

In this series of three we’ll go into Experiment Driven Development and A/B Testing. EDD is fact-based development: based on evidence, not intuition.

Supercharge Your Data Quality with Automated Quality Accelerators

Automated quality accelerators are technology innovations that are focused on reducing the amount of manual quality assurance time spent in QA processes.

Part 2: A/B Testing

In this series of three we’ll go into Experiment Driven Development and A/B Testing. EDD is fact-based development: based on evidence, not intuition.

Part 1: Experiment Driven Development

In this series of three we’ll go into Experiment Driven Development. EDD is fact-based development: based on evidence, not intuition.

10 Experts Give Reasons Why High-Quality Training Data is so Important

We reached out to various ML experts, asking them the questions: Why is high-quality training data so important? Why do so many projects fail in ML?

Factotum: Containerizing DevOps Tools for Cloud Native Engineering and CI/CD

Introducing Factotum: an MIT-licensed, open source, kubernetes-oriented, general purpose docker container for devs/devops and custom CI/CD pipelines.

What’s next? 17 Machine Learning Predictions for 2021

2021 Predictions: We asked a range of ML experts about what they believe will be the next big thing in AI and Machine Learning.

The Sama MLOps Pipeline: Automating Model Training on the Cloud

The Sama MLOps Pipeline: At Sama we decided to build our own automated training pipeline in order to limit costs, and to avoid tying ourselves to a particular cloud provider. 

We Are Now Sama: Accurate Data For Ambitious AI

Samasource is now Sama, the same team that powers the world’s most ambitious AI projects with high-quality and accurate data, but with a new name that represents our vision moving forward.

10 Must-Read Machine Learning Books

There’s no shortage of literature about ML, but we’ve compiled a list of 10 must-read books to add to your list!

Fast Vector Annotation with Machine Learning Assisted Annotation

In this article we summarize an approach that we have developed to speed up polygonal instance segmentation using machine learning.

Meet Our New CEO: Wendy Gonzalez

Sama is thrilled to announce that Wendy Gonzalez is our new Chief Executive Officer.

Custom Keypoint Shapes for Vector Image & Video Annotation

Announcing our support for custom keypoint shapes in our training data platform trusted by the world’s leading AI teams, for vector image and video annotation.

12 Women in Machine Learning to Watch

Here’s a celebratory list of some of the women we look up to and have spearheaded development in AI and Machine Learning in 2020.

Code.Jam(2020)-McGill Hackathon: and the winner is A Virtual Fitting Room

For the second consecutive year, Sama was a Terabyte partner of the McGill Engineering Hackathon, the largest annual hackathon run by the McGill Electrical, Computer, and Software Engineering Student Society.

From Dreams to Reality: Our Journey to Becoming a Certified B Corporation

We’re humbled and honored to announce that, as of today, Sama is the first AI company to receive the prestigious B Corp certification.

5 Examples of AI You Didn’t Know You Used

AI is often framed as something that’ll change our future, but many people aren’t aware of quite the extent to which AI currently used in society and everyday life.

Sama Wins 2020 Artificial Intelligence Breakthrough Award for Best Image Processing Solution

Through our work with Vulcan, we have been awarded the winner of the 2020 AI Breakthrough Awards for the Best Image Processing Solution. 

We made the Inc. 5000 list!

Sama is honored on Inc. Magazine’s Annual List of America’s Fastest-Growing Private Companies—the Inc. 5000

The Training Data Challenge: 5 AI Fails

From accidental Alexa purchasing to bias in recruitment, we have gathered 5 AI fails from the last few years.

“Samasource Opens Kampala”

Underscoring Our Commitment to Racial Justice and Equality

We’ve decided to take action both inside our organization and in partnership with local organizations to drive toward a more equitable society.

Data Protection and Privacy for Training Data

The growth of popularity in AI has been mirrored by a growing number of concerns surrounding privacy, security and ethical use of data.

Introducing Chloe: A ChatBot to Help You Get Accurate Health Information During COVID-19

We’re excited to introduce Chloe, an automated public chatbot service to support in the fight against the coronavirus.

Fast Company Names Sama a 2020 World Changing Ideas Finalist in AI and Data

Fast Company has recognized Sama as a finalist in the AI and Data category as part of their 2020 World Changing Ideas Awards.

traffic light

The Traffic Light Problem for Autonomous Vehicles

The traffic light problem for autonomous vehicles is critical for all vehicle safety, and unlike human-drivers, AVs rely solely on computer vision systems to navigate the world around us.

Celebrating 50 Years of Earth Day: Honoring Leila Janah’s Legacy

Creating lasting social impact, understanding our carbon footprint and being environmentally sustainable are just few of our goals for sustainability and impact.

Object Tracking with Frame Level-Labeling

Sama video and 3D object tracking with frame-level labeling assists companies in quickly building models that better reflect real-world behavior.

How Sama is Closing the Gender Pay Gap for Women in the AI Supply Chain

Sama’s commitment to an ethical AI supply chain enables us to close the gender pay gap, by paying a living wage to all of our employees. 

4 Ways AI Makes a Positive Impact on Communities in East Africa

As AI adoption expands, untapped communities are finding work at the cutting edge of AI. Here are 4 ways AI makes a positive impact on communities in East Africa.

Fighting AI Bias by Obtaining High-Quality Training Data

During the REWORK Deep Learning Summit, Sama shared how top organizations obtain secure, high-quality training data, fighting AI bias in the process.

8 Answers to Your Questions About AI and Machine Learning

In this interview, we chat with Head of AI at Sama about AI trends to expect in 2020, as well as frequently asked questions about AI and machine learning.

Highlights from McGill CodeJam 2019

Sama was proud to sponsor CodeJam 2019, an annual hackathon at McGill University, from November 15 – 17, 2019.

High-Quality Labels Power Accurate Search for Walmart’s 385 Million Online Visitors

When Walmart set out to improve e-commerce search relevance, it needed a labeling partner who could scale with demand without sacrificing quality.

4 Compelling Reasons to Use AI in E-commerce

When we work with companies that want to use AI in e-commerce, we notice a few common barriers in AI adoption. Here are 4 reasons e-commerce needs AI to stay ahead.

3 Takeaways from ICCV 2019

ICCV 2019 provided a welcoming platform for the distribution and discussion of scholarly and technical work in computer vision.

What Nike’s Use of AR in E-commerce Means for the Retail Industry

From improved shopping experiences to increased buyer’s confidence, here’s what Nike’s use of AR in e-commerce means for the retail industry.

Computer Vision Insights From Around the Web

This list of computer vision insights shares how artificial intelligence is learning to understand and relate to the intensely visual world around us.

4 Things That Make a Difference in Data Security

Here are a few things to consider, to strengthen your data security practices.

6 TED Talks to Watch on AI Ethics

Here are 6 TED talks on AI Ethics anyone working in artificial intelligence should watch.

What’s Holding Back Artificial Intelligence?

Data isn’t the only thing holding back artificial intelligence. Read more about some of the challenges and trends in AI.

Revamped 2D Vector Segmentation

Sama’s revamped toolset for 2D image vector segmentation is ideal for computer vision projects using vector shapes to structure training data.

The Ethical AI Supply Chain: Protecting the Soul of AI

Most media coverage discuss bias, fairness, and ethical use of AI, but the humanitarian aspect of the AI supply chain is often overlooked.

Highlights from CVPR 2019

From facial recognition to AR/VR, computer vision is changing the way we interact with the world around us. Here are some highlights from CVPR 2019.

From Around the Web: 5 Computer Vision Infographics

We’ve collected 6 computer vision infographics on everything from visual search to the history of computer vision from around the web.

10 Organizations Leading the Way in Computer Vision

Here are 7 organizations attending CVPR 2019 who are leading the way in computer vision, plus 3 noteworthy companies from around the web.

56 Students Use Data Science to Reduce Poverty and Income Inequality

The Sama Hackathon in Costa Rica encouraged 56 students to look for ways to use data science to help reduce poverty and income inequality.

13 Open Source Datasets for Machine Learning

13 open source datasets for machine learning, including one dataset featured in the Fine-Grained Visual Categorization (FGVC) workshop at CVPR 2019.

Moving Toward Level 4 Autonomous Driving

Kirk Boydston, Training Data Specialist at Sama shares five considerations to move your machine learning model toward level 4 autonomous driving.

How Vulcan is Using AI for Wildlife Conservation

AI-enabled products come with their share of challenges. Here’s how Vulcan partnered with Sama to use artificial intelligence for wildlife conservation.

Highlights from the 2019 Embedded Vision Summit

Sama exhibited and presented at the venerable 2019 Embedded Vision Summit in Santa Clara, California.

Measuring Impact for a Social Enterprise

As a social business, we often face the dual challenge of generating social impact while balancing business needs. Learn how we’re measuring impact.

4 Training Data Strategies to Avoid Bias

During Embedded Vision Summit 2019, Audrey Boguchwal will share four training data strategies that help AI teams avoid training data bias.

Training Your AI in 3D

Today, we’re announcing the production availability of our new 3D annotation engine for the Sama.

AI Events to Attend in Fall of 2018

To strike the balance between the latest insights from respected industry leaders and practical machine learning tactics to apply to real use cases, here’s our list of AI events to attend.

Introducing Object Tracking with Video Annotation

Today, Sama announces the availability of our latest image annotation toolset for advanced video object tracking.

Machine Learning 101

In this post, we’ll present a simple overview of machine learning and how it helps computers solve complex problems.

Why ISO Certification Matters: Choosing the Right Training Data Partner

ISO certification is a clear indicator that products and services meet the expectations of customers and that your data partner is qualified.

How Quid Creates Reliable Business Intelligence

We’ll showcase how Sama’s web research and data cleaning services help create training data for Quid to build their NLP-powered data platform.

What is Data Collection and Why Do You Care?

Data collection is a systematic strategy for gathering and measuring information from a variety of sources to get an accurate picture about a specific area of interest.

What’s the Latest with Lidar and Point Cloud Annotation?

What is LiDAR and why is it such a hot topic? What are some of the key challenges around point cloud annotation? Find out in this post.

Takeaways from AutoAI Conference 2018

Last week, Sama visited the Auto.AI event, which bills itself as the platform bringing together the stakeholders who play an active role in the deep driving, computer vision, and sensor fusion.

The Advantages and Limitations of Synthetic Data

With increased buzz around synthetic data, it is important to understand the advantages and limitations of this solution, and the overall affect on the application.

What is Synthetic Data?

Synthetic data is system-generated data that mimics real data in terms of essential parameters set by the user.

What is Training Data?

The best way for a computer to gain knowledge is to start by showing it exactly what it is you want it to do. For this, we use training data.

Better Algorithms, Better Lives: Reducing Poverty Through Training Data

We spotlight one of our clients, Markable, to show how our image annotation work can be used in ML to train image recognition algorithms.

How Sama Moves People Out of Poverty with Digital Work

Sama’s mission is to connect low-income people to digital work. This post explains the specifics of how this is done.

3 Computer Vision Applications You Haven’t Heard Of…Yet

While certain applications of ML are prevalent in the media, there are lesser-known applications that are similarly revolutionary

Winning Customers with Algorithms: How Teams in Nairobi Help Shape Your Shopping Experience

Termed “planogramming,” visual merchandising is key in retail stores. The best stores find a balance between exciting customers without overwhelming them with deals shouting from every corner.

Davos: It’s Not Enough to Do Less Bad (5 Tech and Impact Trends)

Here are five key tech and social impact trends from the 2016 World Economic Forum, starting with: It’s Not Enough to Do Less Bad.

High-Quality Training Data From Start to Scale.