Các công ty AI đang chuyển từ thuê lao động giá rẻ ở châu Phi và châu Á sang chi mạnh cho chuyên gia cao cấp

Các công ty AI hàng đầu như Scale AI, Turing và Toloka đang từ bỏ mô hình thuê lao động giá rẻ tại châu Phi và châu Á để chuyển sang tuyển dụng chuyên gia được trả lương cao trong các lĩnh vực như sinh học, tài chính, vật lý và lập trình.

Lao động dán nhãn truyền thống từng được trả dưới 2 USD/giờ để thực hiện các nhiệm vụ đơn giản như xác định vật thể trong ảnh, loại bỏ nội dung phản cảm, mô tả hình ảnh, hoặc chọn câu trả lời trôi chảy.

Tuy nhiên, các mô hình AI mới như OpenAI o3 và Google Gemini 2.5 yêu cầu loại dữ liệu huấn luyện có độ phức tạp cao, dẫn đến sự thay đổi hướng sang “dữ liệu từ chuyên gia thật sự”.

Dữ liệu chất lượng từ chuyên gia hiện đóng vai trò thiết yếu giúp các mô hình AI giải quyết bài toán tư duy (reasoning), lập luận theo chuỗi (chain-of-thought) và đưa ra lời giải như con người.

Toloka cho biết phần lớn công việc gán nhãn thủ công có thể tự động hóa, và các dự án mới cần chuyên gia kiểm tra chất lượng nội dung AI tạo ra.

Turing AI cho biết họ trả cho chuyên gia cao hơn 20-30% so với lương hiện tại để thu hút nhân lực hàng đầu từ nhiều lĩnh vực.

Ví dụ, một bài toán vật lý cần cả nhà vật lý, kỹ sư phần mềm và nhà khoa học dữ liệu hợp tác xây mô phỏng, viết mã, kiểm thử, và phân tích kết quả.

Meta đã đầu tư 15 tỉ USD vào Scale AI vào tháng 6/2025, nâng định giá công ty lên 29 tỉ USD. Turing gọi vốn 111 triệu USD vào tháng 3, và Toloka được Bezos rót 72 triệu USD vào tháng 5.

Turing nhấn mạnh mục tiêu hiện nay là mô phỏng quá trình con người thực hiện công việc tri thức để huấn luyện AI làm tốt hơn cả chuyên gia đa ngành.

📌 Ngành AI đang bước vào giai đoạn thay máu toàn diện khi lao động giá rẻ dán nhãn dữ liệu bị thay thế bởi chuyên gia được trả lương cao nhằm tạo ra bộ dữ liệu huấn luyện tinh vi hơn. Việc này giúp các mô hình mới vượt qua rào cản lập luận phức tạp và tiến gần hơn đến siêu trí tuệ. Các công ty AI sẵn sàng chi hàng tỷ USD cho dữ liệu chất lượng, không chỉ cho máy tính và mô hình.

--> Việt Nam cần chuyển dịch từ cung cấp nhân công giá rẻ sang đào tạo lực lượng chuyên gia có khả năng tham gia chuỗi giá trị AI cao cấp, đồng thời hỗ trợ khởi nghiệp AI và tăng cường hợp tác liên ngành để không bị bỏ lại phía sau.

https://www.ft.com/content/e17647f0-4c3b-49b4-a031-b56158bbb3b8

AI groups spend to replace low-cost ‘data labellers’ with high-paid experts

Industry moves away from paying gig economy workers in Africa and Asia in push to build ‘smarter’ models

Melissa Heikkilä in London

Top artificial intelligence groups are replacing low-cost “data labellers” in Africa and Asia with highly paid industry specialists, in the latest push to build “smarter” and more powerful models.

Companies such as Scale AI, Turing and Toloka are hiring top experts in fields such as biology and finance to help AI groups create more sophisticated training data that is crucial for developing the next generation of AI systems.

The rise of so-called “reasoning” models such as OpenAI’s o3 and Google’s Gemini 2.5 has accelerated the move away from employing thousands of low-cost workers in countries such as Kenya and the Philippines, who are typically paid less than $2 an hour to undertake the time-consuming task of annotating the huge datasets used to train AI models.

“The AI industry was for a long time heavily focused on the models and compute, and data has always been an overseen part of AI,” said Olga Megorskaya, chief executive and co-founder of Dutch group Toloka. “Finally, [the industry] is accepting the importance of the data for training.”

This shift has led to a surge of investor interest in data labelling start-ups. In June, Meta invested $15bn in the US group Scale AI, doubling its valuation to $29bn, as part of a push to catch up with its rivals.

In March, California-based Turing AI raised $111mn at a $2.2bn valuation, while Jeff Bezos’ personal firm Bezos Expeditions in May led a $72mn investment round for Toloka.

Previously data labellers would handle simple tasks, such as drawing boxes on images to identify objects, describing what images represent, selecting fluent ways to express things and weeding out bad answers from data sets that often contained violent or graphic content.

Because AI models need more data to perform better, these workers were expected to process tasks in seconds and complete hundreds of tasks during a work day to create vast datasets.

Now, the demand for these tasks has dropped significantly as many of these tasks can be automated, said Megorskaya.

Joan Kinyua, the president of the Data Labelers Association in Kenya, said they were now being tasked with jobs that relied on localised language skills and knowledge. The group has also seen jobs where human labellers were tasked with conducting a final quality control check for AI-generated content.

As leading AI groups such as OpenAI, Anthropic and Google attempt to develop models that they claim will exceed human intelligence, there is a new push to focus on the quality of these datasets and hiring experts to examine complex problems.

“What these models now need is data of a real human using the models to do knowledge work, and getting feedback on when the model is failing,” said Jonathan Siddharth, co-founder and chief executive of data labelling company Turing AI.

To ensure that models perform well in a wide variety of fields from coding to physics and finance, deep-pocketed AI companies are now willing to pay for more sophisticated datasets and experts from around the world.

In order to attract talent from different industries, Turing pays experts 20-30 per cent more than their current jobs, said Siddharth. While budgets for data are only around 10-15 per cent of the hundreds of billions of dollars AI companies spend on computing power, it remains an “enormous amount of money”, he added.

New features and capabilities, such as chain-of-thought, which shows how AI models solve problems step-by-step, are developed by having human experts show how they break down problems, said Toloka’s Megorskaya.

Experienced software engineers might also be asked to come up with tasks that are relevant for their field, and then solve them by writing code, debugging it and checking for security vulnerabilities.

Meanwhile, validating a physics theory would require contributions from a physicist to articulate how to build a simulator to test whether the theory is true, a software engineer to code the simulator, and a data scientist to analyse the results of the simulation.

“The result of this is the model’s not just going to be better than a physicist. It’s going to be better than a superposition of somebody who’s at the top in physics, computer science and data science,” said Turing’s Siddharth.

SongAI

Tin nóng

Các công ty AI đang chuyển từ thuê lao động giá rẻ ở châu Phi và châu Á sang chi mạnh cho chuyên gia cao cấp

AI groups spend to replace low-cost ‘data labellers’ with high-paid experts

Thảo luận

Follow Us

Tin phổ biến

TAG