Led a multidisciplinary team of researchers, engineers, designers, and external facial morphology and speech dialect experts to drive the completion and publishing of the pilot fairness assessments for our face verification and speech to text API's. We developed this framework when completing and communicating out the fairness results:
This launch with our Face and Speech services was the first time Microsoft externally published granular fairness results for any of it's services.
Once we published these, I continued to work with our researchers and policy team to collect insights and feedback to incorporate into a fairness assessment playbook so that other teams at the company could also follow, complete, and publish high quality fairness assessments. This playbook included guidance on hiring external experts and vendors, collecting benchmark datasets, defining the right evaluation metrics and thresholds, and more. Check out the assessments and screenshots below: