A machine learning tool that helps firms share confidential data easily

The tool will make data sharing convenient and safe, at a time when organisations need to flexibly utilise all available data to participate in a data-driven and automated attack landscape.

November 05, 2020 04:00 pm | Updated 04:13 pm IST

The tool is said to help companies share data with third-party vendors to develop products or services.

The tool is said to help companies share data with third-party vendors to develop products or services.

(Subscribe to our Today's Cache newsletter for a quick snapshot of top 5 tech stories. Click here to subscribe for free.)

A new tool called 'DoppelGANger' employs machine learning techniques to enable companies to exchange data with one another without revealing confidential information.

Developed by researchers at Carnegie Mellon University and technology company IBM, the tool uses utilises generative adversarial networks (GAN), which employ machine learning techniques to synthesise datasets that have the same statistics as the original data. GAN refers to a system made up of neural network models that compete with each other to capture and analyse data.

On the datasets provided, models trained with DoppelGANger-produced synthetic data had up to 43% higher accuracy than models trained with synthetic data from competing tools, the team said in a study titled 'Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions'.

The tool is said to help companies share data with third-party vendors to develop products or services.

Also read | Here’s how you can get rid of search history held in Google's servers

The CMU and IBM team says the tool requires no prior knowledge of the dataset and its configurations, as the GANs themselves are able to generalise across different datasets and use cases. This makes the tool highly flexible, the researchers say, and that flexibility is key to data sharing in cybersecurity situations.

The tool will make data sharing convenient and safe, at a time when organisations need to flexibly utilise all available data to participate in a data-driven and automated attack landscape, the team stated. The team aims to expand the tool's capabilities soon to enable it to handle more complex datasets.

0 / 0
Sign in to unlock member-only benefits!
  • Access 10 free stories every month
  • Save stories to read later
  • Access to comment on every story
  • Sign-up/manage your newsletter subscriptions with a single click
  • Get notified by email for early access to discounts & offers on our products
Sign in

Comments

Comments have to be in English, and in full sentences. They cannot be abusive or personal. Please abide by our community guidelines for posting your comments.

We have migrated to a new commenting platform. If you are already a registered user of The Hindu and logged in, you may continue to engage with our articles. If you do not have an account please register and login to post comments. Users can access their older comments by logging into their accounts on Vuukle.