Tech Firms Can Easily Identify You Using Anonymized Data On Yourself
Chitanis
Researchers have recently pointed out that even if your personal information has been anonymized, advanced technology can still identify you.
- 3.2 Billion Email And Password Pairs Have Been Leaked, Here's How To Check If You Are Affected
- Company Uses Smart Seat Cushions For Staff Monitoring
- IBM And Fujifilm Team Up To Create Magnetic Tape With World-Record 580TB Storage
Just by living in this modern world, you are giving up a lot of your personal info to many services and institutions. Many places promise that they will keep your data as private and secure as possible, but in fact, they often share your anonymized data to some third parties either for profit or for research. But the new research shows that anonymized data isn’t so anonymous.
Recently, the Imperial College London’s researchers published their paper to show techniques currently used to anonymize data sets are insufficient. Before sharing a dataset, companies will delete identifying information (names, e-mail addresses, etc.). But even if identifiable factors were excluded from the dataset, it isn’t difficult to match definite information and find out who is the user of that data set, with high accuracy.
The researchers used 210 datasets for the analyses. These datasets were collected from 5 sources. It also includes the US government, which has over 11 million individuals’ information. According to the study, by using a machine learning model along with datasets including 15 identifiable factors (gender, birth date, age, marital status, ZIP code, etc.), the researchers can reidentify up to 99.98% of people in an anonymized data set.
The study offered a hypothesis, a health insurance company issues a data set of 1,000 anonymous customers, which is 1% of the total customers of the company in California. This data set includes the ZIP code, gender, date of birth and diagnosis of breast cancer. One of these individuals’ boss finds out that there was a man, who has the same date of birth and ZIP code, and base on the data set, is having breast cancer and his stage IV treatments didn't succeed. However, the health insurance company is able to say that, even if this unique data of the employer and the record in their company’s file match, it could be anyone else among tens of thousands of people insured at that company.
There are a lot of companies are now collecting data sets that can provide enough information to identify someone, and the fact that the researchers are able to reidentify users by using only 15 identifiable characteristics shows that we really need to reevaluate what creates an ethical anonymized dataset.
According to the researchers, policymakers have the responsibility to make better standards for all of the anonymization techniques to make sure that the sharing of data sets will stop becoming an invasion of privacy
Featured Stories
Features - Jul 01, 2025
What Are The Fastest Passenger Vehicles Ever Created?
Features - Jun 25, 2025
Japan Hydrogen Breakthrough: Scientists Crack the Clean Energy Code with...
ICT News - Jun 25, 2025
AI Intimidation Tactics: CEOs Turn Flawed Technology Into Employee Fear Machine
Review - Jun 25, 2025
Windows 11 Problems: Is Microsoft's "Best" OS Actually Getting Worse?
Features - Jun 22, 2025
Telegram Founder Pavel Durov Plans to Split $14 Billion Fortune Among 106 Children
ICT News - Jun 22, 2025
Neuralink Telepathy Chip Enables Quadriplegic Rob Greiner to Control Games with...
Features - Jun 21, 2025
This Over $100 Bottle Has Nothing But Fresh Air Inside
Features - Jun 18, 2025
Best Mobile VPN Apps for Gaming 2025: Complete Guide
Features - Jun 18, 2025
A Math Formula Tells Us How Long Everything Will Live
Features - Jun 16, 2025