The Agentic Future of HR Tech: What We're Building and Why It Matters — Neeraj

Last quarter, we ran a pilot that I want to describe precisely. The summary I just gave you doesn't capture how different it felt to see it actually running in production.

A recruiter opened a requisition for a mid-level product manager role at an enterprise customer. Over the next 72 hours — without any further manual intervention from the recruiter — an AI agent initiated a massive parallel operation:

It sourced 340 candidates from relevant talent pools.
It filtered the list down to 68 candidates who met all hard criteria.
It sent personalized outreach to those 68 candidates and received 31 responses.
It conducted structured asynchronous screening interviews with all 31, evaluated them, and delivered a shortlist of 8 candidates with comprehensive structured evaluation summaries.

340Candidates SourcedAutonomous, no manual intervention

45 minRecruiter Active Timevs. 4–6 hours in old workflow

72 hrsEnd-to-End WorkflowFrom requisition to shortlist

The recruiter reviewed the shortlist, adjusted two rankings with brief notes, and scheduled first-round interviews.

This is not a demo scenario. It's production infrastructure. And it represents a qualitative shift in what "AI in recruiting" means — moving from AI as a tool the human operates, to AI as an agent the human supervises.

What Changed Architecturally?

Function-Call AI (Previous Wave)

You call the AI with a defined input, it returns a defined output. Candidate scoring, resume parsing, scheduling suggestions. Accelerates individual steps but doesn't change the workflow structure.

Agentic AI (Current Wave)

Agent receives a goal and autonomously executes a sequence of actions using tools: search APIs, communication APIs, calendar systems, evaluation engines. Reasons about what to do next based on what it has learned. Loops until goal state or human checkpoint.

The shift from function-call to agent-loop changes the engineering problem in fundamental ways.

The New Engineering Challenges

State Management An agent maintains state across a multi-step process that may run over hours or days. That state needs to be persistent, recoverable if the agent process is interrupted, and auditable. Every action the agent took, every tool call it made, and every decision point it navigated needs to be fully reconstructable from the state log.
Tool Design Agents are only as capable as their tools:
- The sourcing agent's effectiveness is bounded by the quality of the candidate search index.
- The outreach agent's effectiveness is bounded by the quality of the personalization logic and the deliverability of the communication layer.
In function-call AI, tool quality doesn't cascade — each call is independent. In agentic AI, tool quality compounds across the workflow.
Human Checkpoint Design Where you interrupt the agent loop is one of the most consequential design decisions in an agentic system.

Too many checkpoints and the agent doesn't provide the throughput benefit — you've just built a workflow tool with extra steps. Too few and you've created a system that can make consequential decisions at scale without meaningful human oversight. The right checkpoint design must be specific to the workflow and the stakes of each decision.

The Three Components of an Agentic Recruiting System

Our architecture breaks the problem space down into three distinct, cooperating agents:

1. The Sourcing Agent

Handles candidate discovery and initial filtering. Its job is to:

Interpret a job description
Translate it into structured search criteria
Execute searches across talent pools
Enrich candidate profiles with publicly available data
Filter them to a set that meets hard requirements

The quality of this agent's output is highly sensitive to the underlying search index. Semantic search on normalized candidate data produces dramatically better sourcing results than standard keyword indexing on raw resume text.

2. The Outreach and Qualification Agent

Handles communication with candidates. This is the most sensitive component from a candidate experience perspective.

Personalized outreach that references specific aspects of a candidate's background performs significantly better than generic templates — but the personalization has to be highly accurate.

An agent that hallucinates details about a candidate's background in an outreach message creates a trust problem that no follow-up can ever repair.

This agent also handles response classification (interested, not interested, asking questions) and routes conversations accordingly. Follow-up orchestration, scheduling coordination, and handoff to the screening stage all live here.

3. The Scheduling and Screening Agent

Manages the structured interview and evaluation. For our system, this is a voice or chat-based asynchronous interview where the agent conducts a structured conversation, probes responses that warrant deeper investigation, and produces a structured evaluation against the role's rubric.

The output isn't just a transcript — it's a structured JSON evaluation with:

Dimension scores
Evidence citations
Confidence levels
A clean summary the recruiter can act on

What Makes This Hard?

Building this infrastructure exposes you to complexities that simple LLM wrappers never see:

Challenge	Why It's Hard	Mitigation
Agent Reliability at Scale	Edge cases appear at 100+ candidates: unexpected languages, API 503s, missing schema fields	Deep error handling propagating through multi-step reasoning
Human Oversight Design	Shortlist review that takes 45 seconds due to overwhelming output isn't real oversight	Purpose-built review UI that supports meaningful engagement
Candidate Experience	Candidates know when they're interacting with AI; failures break trust	Transparency, clear communication, technical reliability
Compliance	Autonomous hiring actions carry non-trivial legal weight	Meticulous logging of every agent action touching a candidate record

If the shortlist review takes a recruiter 45 seconds because the AI output is overwhelming and the review UI doesn't support meaningful engagement, you've built a system with the appearance of oversight but none of the substance.

What the Organization Needs

Agentic AI amplifies exactly what you already have.

Good Foundation = Superpower

Good data becomes a sourcing superpower. Rigorous evaluation rubrics become consistent, high-volume automated screening. Reliable integrations become seamless workflow automation.

Bad Foundation = Catastrophe

Bad data becomes confidently bad sourcing at astronomical scale. Poorly designed rubrics become systematic evaluation bias at thousands of evaluations per day. Brittle API integrations become agents that fail mid-workflow and corrupt enterprise pipelines.

Before jumping into agentic capabilities, the organizational prerequisites are decidedly unglamorous:

Invest heavily in data quality
Build structured rubric libraries
Guarantee integration reliability
Erect scalable audit infrastructure
Design fast human escalation paths

The audit trail requirement deserves particular emphasis. Every action an agent takes on behalf of a recruiter needs to be attributable, inspectable, and explainable. This isn't optional for enterprise customers in regulated industries — and it's not optional for any employer who might face a hiring discrimination claim.

The Deployment Curve

Most enterprise HR tech vendors are somewhere in the awkward transition from function-call AI to agent loops.

The vendors who move to agentic systems backed by robust infrastructure — data quality, evaluation rigor, auditability — will build a compounding advantage. Their agents get better continuously through recruiter feedback, and their customers gradually build deep institutional dependency on the capability.

The vendors who skip the infrastructure work and deploy agents on shaky foundations will produce high-profile failures: candidates misrepresented in outreach, discrimination patterns in AI-driven filtering, and audit requests that simply can't be satisfied.

// key takeaway

The question isn't whether agentic AI will transform recruiting — it already is. The question is whether you're building the infrastructure to make it trustworthy, or just deploying it fast enough to make the headlines before your competitors. One of those paths leads somewhere worth going.