Workshop @ IEEE Big Data 2026

Building Trustworthy AI Pipelines for Big Data

Verification, Provenance, and Reproducibility in Shared AI-Augmented Data Pipelines

Co-located with IEEE Big Data 2026 · December 14–17, 2026

Paper Submission

Oct 31, 2026

Acceptance Notice

Nov 14, 2026

Camera-Ready

Nov 23, 2026

Workshop Dates

Dec 14–17, 2026

Submit Paper Learn More

About the Workshop

Why TrustAI-BD?

Modern big data pipelines do not operate in isolation — they increasingly span teams, institutions, and organizational boundaries. Datasets, intermediate artifacts, trained models, and downstream decisions flow across organizational boundaries, between teams, and through multi-institution collaborations, as seen in federated healthcare analytics, financial risk platforms, and large-scale scientific consortia.

Unlike traditional deterministic query engines, learned models are inherently stochastic. When an AI-augmented pipeline flags a fraudulent transaction, recommends a treatment plan, or triggers an infrastructure response, the inability to audit the chain of data, model versions, and intermediate decisions is not an academic limitation — it is an operational risk.

This workshop addresses the foundational question: in a world of shared, AI-augmented data pipelines, how do we verify that results are correct, and how do we ensure they can be reproduced?

Research Topics

Page Limit

IEEE

Proceedings

2026

Big Data Conference

Call for Papers

Research Topics

🔗

Data Provenance and Lineage in Shared Big Data Pipelines

🔁

Reproducibility of AI/ML Model Decisions at Scale

✅

Verification of Cross-Organizational Data and Model Artifacts

📉

Model Drift and Pipeline Staleness Detection

🏛️

Auditing and Accountability Across Multi-Party and Cross-Organizational Workflows

🔒

Federated Auditing and Privacy-Preserving Verification

🧠

Explainability and Auditability of Black-Box Pipeline Components

🗂️

Versioning Strategies for Data, Models, and Code in Multi-Party Workflows

🤝

Trust Propagation and Accountability Across Pipeline Stages

🤖

Reproducibility Challenges in Foundation-Model, RAG, and Agentic Data Workflows

📊

Benchmarking and Evaluation Under Real-World Deployment Conditions

📋

Case Studies in Deployed Systems: Failures, Reproducibility Incidents, and Lessons Learned

Important Dates

Key Deadlines

October 31, 2026

Full Paper Submission Deadline

Submit via the online submission system

November 14, 2026

Notification of Acceptance

Authors notified of paper acceptance decisions

November 23, 2026

Camera-Ready Submission

Final version of accepted papers due

December 14–17, 2026

Workshop @ IEEE Big Data 2026

Workshop sessions held at the conference

Organization

People

Program Chairs

Anantaa Kotal

Program Co-Chair

Assistant Professor
The University of Texas at El Paso

Aritran Piplai

Program Co-Chair

Assistant Professor
The University of Texas at El Paso

Prajit Das

Program Co-Chair

Security Researcher & Software Engineering Leader
Cisco Systems Inc.

Program Committee

Ted Bedwell

Distinguished Engineer · Cisco Systems Inc.

Changxue Deng

Software Engineering Technical Leader · Cisco Systems Inc.

Paper Submission

How to Submit

📄 Submission Guidelines

Full-length papers up to 10 pages (IEEE 2-column format)
References counted within the 10-page limit
IEEE Computer Society Proceedings formatting required
All accepted papers included in IEEE Big Data Proceedings

IEEE Formatting Templates

🚀 Submit Your Paper

Submit through the official online system
Deadline: October 31, 2026
Notifications sent by November 14, 2026
Camera-ready due November 23, 2026

Submit via CyberChair →