Comparastore is a software that automates self-storage operations, making it easy for individuals and businesses across Canada to find, compare, and reserve the storage solution best suited to their needs.
As a data analyst intern, I worked on the project "Unlocking AI's Potential: An Exploration of Generative Models and the Power of Quality Data."
The primary goal of this project was to discover generative AI and large language models, and ensure quality training data. I got to experience the main stages of AI implementation, have discussions with AI experts in the field, and contribute to the AI model implementation.
I was focused on the textual data labeling aspect. My team and I were responsible for the quality of data used to train the AI model. My role included verifying, cleaning, and preprocessing data, presenting and developing the automatic labeling verification solution in conjunction with our AI professional, and improving the verification process.
I verified and correcting 20,000 storage website data chunks.
6 SKILLS DEVELOPED THROUGH THIS ROLE:
Ensuring quality of data used to train the self-labeling AI model.
Preparing and formatting data for modeling.
Presenting and communicating verification solution to cross-functional teams within the company.
Resolving ambiguous situations through research, and offering constructive feedback for process enhancement to cross-functional teams within the company.
Valuable perspective + firsthand experience with AI deployment in a startup.
Orchestrated project planning, execution, and coordination with team members.
Independently managed tasks and assignments in alignment with project goals and timelines.