Home

Skilled Research Engineer with over 7 years of hands-on experience in implementing research and creating systems which support research.

Education

Thesis : Mapping Acoustics to Kinematics in Speech
GPA : 3.7/4
GPA : 3.5/4

Experience

I currently work as part of a team of researchers and engineers focused on building virtual telepresence. I have worked on many different parts of this research pipeline, from writing data capture software, to large-scale data processing, to implementing computer vision and machine learning research.

  • Increased keypoint detection quality by 10% by making changes to the training pipeline for keypoint detection ML model (PyTorch)
  • Made architectural updates to keypoint detection ML model (PyTorch)
  • Designed and implemented a failure detection and retraining pipeline for our state-of-the-art keypoint and segmentation detectors, leading to a 20% improvement in predicted keypoint quality (PyTorch)
  • Developed a speech to facial animation prediction model using Bi-LSTM (PyTorch)
  • Developed Computer Vision-based metrics to evaluate quality of keypoint annotations (Python)
  • Designed and implemented pipeline for early evaluation of data capture quality (Python)
  • Designed and implemented post-processing steps which made capture data consumable for research. Formalized and automated data preprocessing. (Python, C++, Bash)
  • Created monitoring and alerting solutions for fast detection of issues in data capture pipeline (Python, SQL, PHP)
  • Created a data management system to support PII data captured during user studies (ReactJS, PHP)
  • Created a data management system to support large scale data annotation (Python, PHP)
  • Developed and upgraded 2D and 3D annotation tool for annotation of over one million data points (Javascript - ReactJS, PHP, SQL, Python, C++).
  • Collaborated in the design of a user study capture process that has captured over a thousand hours of data.
  • Led the development of multiple custom software solutions to help Research Assistants interact with research tools and software (C++, Python)
  • Independent Contractor (Jan 2016 - Mar 2017)
    Toronto, ON
    Research Engineer +

Worked with researchers, clinics, and corporations to develop software and hardware solutions to problems at the intersection of physical movement and machine learning.

  • PROMPT Institute - Functional data analysis (FDA) software tool - Developed FDA tool to evaluate consistency of articulation data over multiple repetitions (Matlab).
  • The Speech And Stuttering Institute - Gentle Onset Software tool - Gentle Onset therapy is used to treat people who stutter. This Neural Network-based detector for gentle onset in speech enabled patients to practice Gentle Onset therapy at home.
  • PROMPT Institute - Eguana Python - Developed software for analysis of Electromagnetic Articulograph data.
  • Music Mind Inc - Clarinet Monitor - Developed hardware and software for a real time visualization of tongue position while playing a clarinet.
  • Co-founded and led a healthcare software startup from infancy to producing 300k yearly revenue
  • Led technical team and grew the team from two part-time to three full-time developers
  • Planned project roadmap and supervised the development of iOS, WatchOS, Web and Server Side projects
  • Collaborated with hospitals and researchers in Toronto to produce technical innovations in health care by providing recording, hosting and analysis solutions for human kinematic and physiological data
  • Consulted leading businesses and banks in Canada & USA on iOS strategy and mobile app design
  • Developed iOS applications and supervised a software development team of 4 people to produce software using Agile methodologies
  • Planned project roadmap and supervised the development of iOS, WatchOS, Web and Server Side projects
  • Responsible for emergency bug fixes for multiple projects on Web, iOS, Android and BlackBerry platforms

Projects

EGUANA | Visual acuity software | GPA and CSTI analysis software | Auditory acuity software | Food Spec | LCBO Amazon Alexa app | The Art of Artless Living | Theremin controlled computer mouse

Publications

  • Rampersaud, D., Huynh, A., Namasivayam, A., Bali, R., Ward, R., Yan, T., Law, V., Hayden, D., & Van Lieshout, P. (2016, November). Can we objectively assess the delivery of tactile inputs during speech therapy? Poster presented at Toronto Rehabilitation Institute Research Day 2016, Toronto, ON
  • Pearsell, S., David, V., Bali, R., Namasivayam, A.K., Helms-Park, R., & Van Lieshout, P.H.H.M. (2015). Dual task influence on speech motor stability. Presentation presented at the McGill Canadian Conference for Linguistic Undergraduates: McGill University, Montreal, QC (March 13-14)
  • Law, V., Huynh, A., Namasivayam, A.K., Yan, T., Bali, R., Hayden, D., & Van Lieshout, P.H.H.M. (2015, November) Oro-Facial Tactile Cues Improve Phoneme Recognition. Poster to be presented at the Toronto Rehabilitation Research Conference, Toronto, Ontario
  • Namasivayam, A.K., Bali, R., Yan, T., Hayden, D., & Van Lieshout, P.H.H.M. (2015). Accuracy & Consistency of Oro-Facial Prompting for Speech. Presented at the 2015 ASHA Convention in Denver, Colorado
  • Bali, R., Namasivayam, A.K., Hayden, D., Van Lieshout, P.H.H.M., & Yan, T. (2015, October). Assessing Consistency of Oro-Facial Prompting. Poster presented at International Association of OroFacial Myology Convention, Lake Buena Vista, Florida
  • Slis, A., Bali,R., Namasivayam, A.K., Chaldi, D, Van Lieshout, P.H.H.M. (2015) Articulatory Speech Errors and Word Structure. Poster presentation at the 18th ICPhS in Glasgow
  • Pearsell, S., Chadi, D., Bali, R., Namasivayam, A.K., Helms-Park, R., & Van Lieshout, P.H.H.M. (2014). Dual task influence on speech motor stability. Presentation presented at the 2nd Undergraduate Linguistic Conference: University of Toronto, Toronto, ON (Nov 21)

Skills

  • Computer Languages - Python / PyTorch / MATLAB / R / C / C++ / C# / Objective-C / Java / HTML / CSS / PHP / Javascript / jQuery / Node.js / SQL
  • Mathematics Concepts - Machine Learning / Computer Vision / Audio Signal Processing / Baysian Statistics / Game Theory / Information Theory / Graph Theory / HMM / Vector Quantization / Regression Analysis
  • Engineerng Applications - ASR / TTS / NLU / Articulatory Inversion / Speech Synthesis / Speech Animation / Speaker Recognition

Contact me

contact@rohanbali.com

Oral Dynamics Lab at the University of Toronto (Present)

Software package for speech acoustics and kinematic analysis, used globally by Speech Language Pathologists
Dr. Roslyn Ward at the The University of Western Australia (2015)

Computer vision tracking and software package for psychological test of visual perception
The Prompt Institute (2015)

Recorded speech therapist’s hand and participant’s speech articulator movements using AG501 machine, and analyzed data using Generalized procrustes analysis (GPA) and Cyclic spatiotemporal index (CSTI) analysis
Dr. Yana Yunusova at the University of Toronto (2015)

Software package for psychological test for auditory perception
A nutrition visualization iOS app and website
Speech recognition retail software used to search through the LCBO product list
Blog
University of Illinois Urbana-Champaign

Senior Project