How to Follow Data Through the Search Pipeline

Use this topic to follow data through the search pipeline from the client environment to the server environment.

Before you start

Is this a single-agent scan or a distributed scan?
- Distributed: One endpoint is the “discovery host” and others are “search workers.”
What is the scan ID / scan start time?
- Helps find the right section of logs and the correct per-scan queue table (search_queue_<scanId>).

Goal: Confirm the “to-do list” was created in the Job Queue.

On the discovery host, review identityfinderCMD.exe (IDF/SystemSearch) logs for evidence it: created the per-scan queue table, and inserted jobs as Pending.

Goal: Confirm search workers are claiming and processing jobs.

On one or more search agents, review identityfinderCMD.exe logs for ‘read the queue’ messages and LOCATION SEARCHED events

Goal: Confirm results were written to shipper_queue locally.

On the endpoint doing the searching, review identityfinderCMD.exe logs for evidence it published results into the shipper queue (results are staged before jobs are considered fully done).

Goal: Confirm the IDFMessagingSvc.exe (IFS/Shipper) is draining shipper_queue and posting to Ingress.

Review IDFMessagingSvc.exe IFS logs for successful POST to Ingress (HTTP success) and retries/failures if Ingress is unreachable.

Goal: Ingress received the shipped payload and streamed it to Kafka.

Review Ingress service logs (match up timestamps). Confirm you see batches are received and produced/streamed to Kafka.

Goal: SearchPersistence consumed the message from Kafka and uploaded/wrote it into SQL Server.

Search SearchPersistence logs for the CorrelationID also found in the IFS log.

Not discovered → Discovery logs never show targets → issue in Discovery stage.
Discovered but not scanned → job queue created, but workers never claim → worker connectivity/queue access/timing issue.
Scanned but no results staged → search completes, but no shipper queue inserts → result staging/publishing issue.
Staged but not delivered → shipper queue grows, IFS shows failures → shipping/connectivity issue.
Delivered but not visible in UI → IFS shipped OK, but Ingress/SearchPersistence missing CorrelationID → server-side ingestion/persistence issue.