Priyambada Jain

Priyambada Jain headshot
Blackrock logo
Senior Data Scientist
BlackRock

Presentation Title:

Agentic AI for Financial Data
Automating Insights from Large-Scale Tables

Presentation Summary:

1. Problem Statement:
Financial datasets are vast, complex, and difficult to process manually. Extracting key insights from large financial tables remains a bottleneck for analysts. Unlike free-form text, tabular data is hierarchical, relational, and context-dependent, requiring specialized techniques for accurate summarization and insight generation. Traditional LLMs struggle with numerical reasoning, row-column relationships, and multi-step financial calculations, making agentic AI architectures essential for structured data interpretation.

2. Agentic AI Approach:
Built an autonomous AI agent leveraging LLMs and domain-specific fine-tuning to analyze, summarize, and extract key financial insights dynamically.

3. Core Capabilities:

  • LLM Fine-Tuning: Trained on financial documents to improve context understanding and numerical reasoning.
    • Bayesian Optimization for hyperparameter tuning.
    • Contrastive Learning to improve structured data representation.
    • Quantile Regression to enhance financial trend prediction.
    • Expectation-Maximization (EM) for better handling of missing data in tabular structures.
  • Named Entity Recognition (NER) & Ontology Learning:
    • Developed a NER model to learn the ontology of financial use cases, enabling precise identification of different columns and their relationships.
    • Fine-tuned an embedding model to capture semantic relationships between financial entities, improving contextual understanding of structured data.
  • Evaluation Framework: Supports researchers in assessing different prompting techniques like Chain-of-Thought (CoT) and Chain-of-Table (CoTab) to improve reasoning on structured financial data.
  • Tabular Data Extraction: AI pipeline identifies patterns, trends, and anomalies in financial statements.
  • Synthetic Data Generation: Capable of creating synthetic financial datasets for benchmarking, evaluation, and fine-tuning the framework.
  • Explainability & Auditability: Ensures AI-generated insights are traceable and interpretable for decision-makers.

4. Impact & Business Value:

  • Efficiency Gains: Reduces manual processing time from hours to minutes.
  • Enhanced Decision-Making: Empowers financial professionals with concise, data-driven summaries.
  • Scalability: Adaptable across different financial reports, including earnings statements, balance sheets, and regulatory filings.

Conclusion:
This agentic AI architecture bridges the gap between raw financial data and actionable intelligence, offering a research-friendly framework for advancing structured reasoning in AI. By addressing the unique complexities of tabular financial data, it enables more accurate, interpretable, and scalable AI-driven financial analysis.

Picture of About | Priyambada Jain

About | Priyambada Jain

Priyambada Jain is a data science and artificial intelligence professional based in California, currently working in AI at BlackRock in Palo Alto. She has extensive experience in Generative AI, machine learning, and statistical modeling, applying these technologies to solve real-world business problems.

She holds a degree from the University of Southern California (USC) and has contributed to academic research in data science education. She co-authored papers such as Democratizing Data Science through Data Science Training and worked as a Research Assistant at USC on an NIH-funded project aimed at extending machine learning education to biomedical practitioners.

As part of this initiative, she contributed to the Educational Resource Discovery Index (ERuDIte), which focused on organizing online data science materials for broader accessibility.