Automation specialist
first conversation is free, sign up to message Heather
I will conduct a structured 60-minute evaluation of your AI system’s conversational behavior. Includes: • Hallucination detection • Logical consistency testing • Boundary and policy edge-case probing • Tone stability analysis • Context retention validation • Adversarial prompt stress-testing You will receive: • A written breakdown of weaknesses identified • Specific examples of failure points • Suggestions for robustness improvements Ideal for research teams, safety engineers, and product leads improving model reliability.
Can travel within New York and Pennsylvania, or further with compensation. Analytical edge-case explorer with strong pattern-recognition and emotional calibration skills. Experienced in identifying logical gaps, policy blind spots, and conversational drift in AI systems. Comfortable operating in ambiguous or boundary-heavy scenarios. Skilled at pushing systems to reveal structural weaknesses without destabilizing context. Strong detection across cognitive, emotional, and behavioral layers. Reliable, direct, and precise in feedback delivery.High-variance cognitive tester. Skilled in identifying blind spots, pattern drift, emotional incongruence, and system inconsistency. Strong intuitive–analytical crossover thinker. Particularly effective at stress-testing AI reasoning and boundary logic.