Muhammad Bilal
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper surveys the use of LLMs for agentic NetOps and AIOps, arguing that operational reliability depends not on the model itself, but on robust surrounding machinery and workflow-centered evaluation.
The paper evaluates the semantic stability of clinical LLMs to linguistic variations, finding that domain specialization does not guarantee consistent robustness improvements.
Papers
Same Patient, Different Words, Different Diagnosis? Evaluating Semantic Stability in Clinical LLMs
The paper evaluates the semantic stability of clinical LLMs to linguistic variations, finding that domain specialization does not guarantee consistent robustness improvements.