• Home
  • Startup
  • Money & Finance
  • Starting a Business
    • Branding
    • Business Ideas
    • Business Models
    • Business Plans
    • Fundraising
  • Growing a Business
  • More
    • Innovation
    • Leadership
Trending

Why Conversational Commerce is the Future of Shopping

May 29, 2025

10 Leadership Myths You Need to Stop Believing

May 29, 2025

Tesla’s Layoffs Won’t Solve Its Growing Pains

May 29, 2025
Facebook Twitter Instagram
  • Newsletter
  • Submit Articles
  • Privacy
  • Advertise
  • Contact
Facebook Twitter Instagram
InDirectica
  • Home
  • Startup
  • Money & Finance
  • Starting a Business
    • Branding
    • Business Ideas
    • Business Models
    • Business Plans
    • Fundraising
  • Growing a Business
  • More
    • Innovation
    • Leadership
Subscribe for Alerts
InDirectica
Home » Navigating The Data Risk Minefield In Generative AI Adoption
Innovation

Navigating The Data Risk Minefield In Generative AI Adoption

adminBy adminOctober 26, 20230 ViewsNo Comments4 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email

CEO and cofounder of one of the first data intelligence platforms, BigID, and a privacy, security and identity expert.

In today’s data-driven business landscape, the role of artificial intelligence (AI) and machine learning (ML) has never been more prominent. While these technologies offer unprecedented opportunities for innovation and efficiency, they also introduce a host of new challenges including increasing risk, governance and management of unstructured data. This is especially pertinent when dealing with large language models (LLMs), such as those that power generative AI.

Unstructured Data: A Double-Edged Sword

For years, businesses have amassed vast repositories of unstructured data—email archives, chat logs, PDF files and more—that sit in systems like Microsoft Office 365 or Slack. While this unstructured data is increasingly becoming a powerful input for AI-driven solutions, it also represents a potential Achilles’ heel.

Unlike structured data, which resides in well-defined formats and databases, unstructured data often exists in a more chaotic state, making it harder to govern and secure. This is the data that often contains some of the most sensitive information: personal and customer information, credit card numbers, social security numbers, intellectual property, trade secrets and more.

The Risk Vector: Data Misuse And Exposure

LLMs are trained on large sets of unstructured data—and that amplifies risks. Training generative models on sensitive or regulated data, whether it be client-specific, customer-related or otherwise, introduces the risk of violating data privacy regulations.

Failure to properly govern this data could result in data leaks, financial penalties and reputational damage.

Moreover, incorporating confidential intellectual property into the training sets for these models might inadvertently expose the organization to data breaches or unauthorized dissemination of proprietary information.

Simply put, the training process of LLMs could become a liability if the data isn’t carefully curated and managed.

Data Governance: A Proactive Solution

Before deploying generative AI in any business application—be it customer service automation, analytics, marketing, general efficiency or cyber threat detection—it’s imperative to establish robust data governance procedures.

The first step involves identifying, classifying and tagging datasets that contain sensitive or regulated information. Categories to consider include personal information (PI), personally identifiable information (PII), confidential business strategies or intellectual property.

When looking for solutions to accelerate AI governance, companies need to make sure they’re built to scale in their environment, have enterprise-grade security incorporated and can easily address the breadth and depth to cover the range and scope of their data by type, sensitivity and location.

When implementing these solutions, companies should start by categorizing what data is safe for use, what’s regulated and what data requires additional controls. They should avoid leveraging data to fuel AI training sets until they’ve verified that the data—even training sets—have been marked as safe for use and don’t contain any potentially compromising information, regulated data or customer data.

Ensuring The Right Data For The Right Use

Once datasets have been classified, they can be partitioned accordingly for specific applications. For instance, organizations might choose to exclude sensitive human resources data from the training sets for customer service LLMs. Likewise, they could guide the models to rely on publicly available, nonconfidential data, thus further mitigating the risks associated with data misuse.

When choosing the right data to use to fuel AI adoption, make sure it doesn’t include customer or employee information, intellectual property, secrets and credentials or anything that may expose your company to unwanted data breaches.

A Future-Ready Approach

As AI and ML technologies continue to evolve and integrate more deeply into organizational processes, proactive data governance will become a nonnegotiable facet of responsible business operation.

As we grow increasingly reliant on complex AI models like LLMs for a wide array of applications, the unstructured data that powers these models must be managed with an equal, if not greater, level of scrutiny and care. While the capabilities of generative AI offer a plethora of opportunities for business innovation, they should not be deployed without a rigorous data governance strategy in place.

Failure to manage the risks associated with unstructured data can have serious legal and financial repercussions, rendering the promising advantages of AI null and void. As we tread further into this uncharted territory, it’s more critical than ever to know your data and control your data—wherever it lives.

Forbes Technology Council is an invitation-only community for world-class CIOs, CTOs and technology executives. Do I qualify?

Read the full article here

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Articles

Going Eco Benefits Planet And This Hotel’s Bottom Line

Innovation May 29, 2025

What IBM’s Deal For HashiCorp Means For The Cloud Infra Battle

Innovation April 25, 2024

Is Telepathy Possible? Perhaps, Due To New Technology

Innovation April 24, 2024

Luminar Launches Production For Volvo, Shows Next-Gen Halo Lidar

Innovation April 23, 2024

Turning Customers Into Investors – Tiny Health’s Experience

Innovation April 22, 2024

Netflix’s Best New Original Series Is Stressing Me Out

Innovation April 21, 2024
Add A Comment

Leave A Reply Cancel Reply

Editors Picks

Why Conversational Commerce is the Future of Shopping

May 29, 2025

10 Leadership Myths You Need to Stop Believing

May 29, 2025

Tesla’s Layoffs Won’t Solve Its Growing Pains

May 29, 2025

Going Eco Benefits Planet And This Hotel’s Bottom Line

May 29, 2025

What IBM’s Deal For HashiCorp Means For The Cloud Infra Battle

April 25, 2024

Latest Posts

The Future of Football Comes Down to These Two Words, Says This CEO

April 25, 2024

This Side Hustle Is Helping Land-Owners Earn Up to $60,000 a Year

April 25, 2024

A Wave of AI Tools Is Set to Transform Work Meetings

April 25, 2024

Is Telepathy Possible? Perhaps, Due To New Technology

April 24, 2024

How to Control the Way People Think About You

April 24, 2024
Advertisement
Demo

InDirectica is your one-stop website for the latest news and updates about how to start a business, follow us now to get the news that matters to you.

Facebook Twitter Instagram Pinterest YouTube
Sections
  • Growing a Business
  • Innovation
  • Leadership
  • Money & Finance
  • Starting a Business
Trending Topics
  • Branding
  • Business Ideas
  • Business Models
  • Business Plans
  • Fundraising

Subscribe to Updates

Get the latest business and startup news and updates directly to your inbox.

© 2025 InDirectica. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • Press Release
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.