What happened
Enterprise Data Strategy Balancing Compute is at the center of this update. Ahead of the AI & Big Data Expo, HP’s AI & Data Science Business Development Manager discusses the challenges enterprises face in AI data ingestion, the choice between local and cloud compute, and how HP’s hardware portfolio supports autonomous AI lifecycles while addressing governance and cost concerns.
Challenges in Enterprise AI Data Management
In advance of the AI & Big Data Expo held at San Jose McEnery Convention Center on May 18-19, HP’s Jerome Gabryszewski, AI & Data Science Business Development Manager, shared insights on the complexities enterprises encounter when integrating AI and processing data for AI ingestion.
Despite abundant first-party data, many organizations struggle to harness it effectively due to fragmented data ownership, inconsistent schemas, and legacy infrastructures that hinder interoperability. Gabryszewski emphasizes that the architectural and governance challenges often outweigh the technical difficulties in automating data ingestion.
Governance and Risk Management in Continuous AI Learning
With AI models increasingly adopting continuous learning, risks such as concept drift and data poisoning become critical. HP advises treating model updates with the same rigor as software code deployments, incorporating validation gates, automated drift detection, and human oversight prior to retraining. Data provenance is highlighted as essential to prevent poisoning, ensuring clear knowledge of data sources and access control within the organization.
HP’s Hardware Solutions for Autonomous AI Lifecycles
HP leverages its deep experience with the Z series workstations, designed for demanding professional compute tasks for over 15 years. Rather than a single machine solution, HP offers a range of hardware tailored to varying AI workflow needs.
- Developer-level: Mobile and compact workstations like the ZBook Ultra and Z2 Mini support local experimentation and running large language models without cloud dependency.
- AI-first teams: The ZGX Nano, an AI supercomputer powered by NVIDIA GB10 Grace Blackwell Superchip, fits in the palm of a hand yet manages models up to 200 billion parameters locally, scalable to 405 billion parameters when linked.
- Power-user teams: The Z8 Fury supports up to four NVIDIA RTX PRO 6000 Blackwell GPUs and enables full model development cycles on-premises.
- Frontier scale: The ZGX Fury delivers trillion-parameter inference at the deskside, ideal for continuous fine-tuning of sensitive data, often proving cost-effective within a year compared to cloud alternatives.
Importantly, HP’s portfolio supports clustering and rack-ready deployment for scalable, secure IT environments, emphasizing that the main challenge is governance and latency rather than compute capacity.
Addressing the Rising Costs of Generative AI
Enterprise generative AI expenditures surged to an estimated $37 billion in 2025, with many companies overshooting budgets by over 25%. Gabryszewski notes the fundamental tension: while unit inference costs decline, overall spending grows due to rapidly increasing usage.
HP recommends a disciplined approach separating exploratory AI tasks from production workloads. Early experimentation should utilize local hardware investments, avoiding ongoing cloud operational expenses without clear ROI. The optimal strategy involves a three-tier model:
- Cloud for burst training and cutting-edge model access earned through validation.
- On-premises HP Z infrastructure for stable, high-volume inference.
- Edge computing to meet latency-sensitive applications.
Independent analyses suggest on-premises deployments may offer up to 18 times cost savings per million tokens over five years compared to cloud alternatives.
Making Proprietary Data AI-Ready While Ensuring Security
Many enterprises confuse AI readiness with purely data engineering challenges, overlooking data sovereignty and governance complexities. Transmitting sensitive data to cloud services risks compliance breaches, especially in regulated sectors.
HP advocates for Retrieval-Augmented Generation (RAG) architectures on local infrastructure, enabling AI models to query internal knowledge bases at runtime without exposing data externally or training on it. This approach maintains data within controlled environments, supported by robust access controls enforcing role-based permissions, mirroring document management systems.
The Evolving Role of Enterprise IT Teams
Jensen Huang’s perspective highlights a shift from manual IT tasks to higher-value governance and architectural responsibilities as AI agents embed into enterprise applications. Gartner forecasts 40% of enterprise apps will include AI agents by end of 2026, propelling IT teams to oversee agent trust and infrastructure integrity rather than routine operations.
HP stresses that local-first AI infrastructure is critical to provide visibility and control over autonomous agents, a capability diminished in fully cloud-based environments. The future IT team will focus on strategic AI governance aligned with business resilience and ethical considerations.
Fonte: ver artigo original
Related coverage: AI Chronicle analysis and updates.
Why it matters
This update influences the AI race across model providers, infrastructure leaders, and enterprise adoption decisions.

China Approves DeepSeek’s Purchase of NVIDIA H200 AI Chips Under Specific Conditions
China Approves DeepSeek’s Purchase of NVIDIA H200 AI Chips Amid Trade Tensions
ZAYA1 AI Model Achieves Milestone Using AMD GPUs for Large-Scale Training
‘Crypto Yolka’ Blockchain and AI Summit Set for December in Nizhny Novgorod