working-with-chatgpt-photo
Image courtesy of the team.

Lablab AI, in collaboration with the Gemma team from Google, hosted the 'Gemma2 AI Challenge', inviting participants to leverage Google’s advanced Gemma 2 (Open Source models) to build cutting-edge solutions. The hackathon kicked off at the Google Cloud Space, where experts walked through various Gemma models available. Resources were shared to guide participants throughout the hackathon, along with Discord support and office hours from experts. Out of 1,377 participants and 106 teams, 35 applications were submitted, and two applications emerged as winners.

The winning team included YingLi (Head of AI, R&D at TG0 | Director at WCC), giving me the opportunity to interview her and her team to bring back their lessons, learnings, and tips for our community.

Team celebrating their win with DeepMind
Photo of the team celebrating their win with DeepMind

Ideation

A crucial aspect of any successful product is a clear idea. The team excitedly shared their ideation process. While their initial concept was to build shades with a camera that could detect and read objects, they realised that it would be more valuable to create a product with clear business value. Returning to the drawing board, the team brainstormed different ideas and ultimately settled on one.

Byte the News

The winning idea that emerged from the brainstorming session was a Chrome extension that can summarise news or text on a webpage and read it back to the user. Their target audience was software engineers who want to get the gist of news without having to stare at the screen.

Technical Deep Dive

The technical details shared in the call were fascinating. To better understand how this product works, I created a diagram that breaks down its key components and functionality.

Architecture diagram of Byte the News
Diagram explaining the system architecture

The process starts by scraping text and images from a webpage. The extracted text is sent to the Gemma model hosted on Groq, which returns a text summary. Images in the article are sent to OpenAI to generate a text description of the image. These text and image summaries are then combined and refined into a single synthesized summary of the article. Finally, this summary is scanned for harmful content using Shield Gemma, a content moderation model, based on a set of predefined criteria.

For detailed setup instructions and implementation information, please refer to the project’s well-documented GitHub repository. We can use the README file shared to run the project locally.

The team shared that if they had more time, they would have liked to make the project mobile-compatible or extend its functionality to summarise books! Very cool ideas.

Technical Skills Are Important, But Not the Only Thing That Made This Team Stand Apart

Power Skills

Perseverance Is the Key

Feeling amazed that the team built the product in under two days, I asked them about the work that went into it. Here are some expert insights from the conversation that stood out to me:

Sahana: How did you get the web crawler to work? It sounds amazing.

Joenam: Initially, we were relying on HTML tags to extract text, but we quickly realized that this approach didn’t work for every site. While iterating, I found the newspaper package, which picks up text reliably. However, it still doesn't work for sites that require a subscription to read the article.

Sahana: Amazing! I’m also curious about Shield Gemma. How was your experience using it?

Rekha: We first ran it locally. It outputs a probability based on a set of criteria to determine if content is safe or not. I had to do a lot of experimentation to settle on the probability threshold beyond which content is considered offensive. The threshold was 0.01!

Sahana: What was the experience like setting up the server?

Ying: Setting up the server took a while. Since we were all working on different parts of the product locally, we were able to verify things were running locally. To get the Chrome extension to work, running the product on a server was essential. Initially, Shield Gemma crashed on a 500MB server, and we had to spin up a larger one.

Teamwork Makes the Dream Work

Sahana: How did you manage so many moving parts?

Ying: We used Git for version control and communicated over Google Meet.

Rekha: Ying guided us well through the initial setup phase. She uses Miro to brainstorm ideas. Even with the idea we settled on, we discussed the pros and cons of different approaches. This definitely made the implementation easier by clearing the way for task distribution.

Presentation Is Key

Ying: It took more than half a day to combine slides and ensure we stuck to the requirements. For the voice-over in the slides, I recorded myself, used Whisper (an audio-to-text tool) to convert the audio to text, used OpenAI to clean up the text, and then used Eleven Labs to convert it back into audio. Joenam then added this to the video.

Sahana: Wow! I definitely found your presentation impressive. It’s inspiring to hear the thought and effort behind it. This alone could be a project!

Motivational Message to Readers

Sahana: Do we only need technical skills for a hackathon?

Joenam: You don’t need to be a pro at coding. It’s more important to be a team player. Coming up with ideas is the foundation. If you can’t plan and work as a team, it will be much more difficult. There are many valuable skills needed beyond coding, such as organizational skills, video editing, and storytelling.

Rekha: I agree with the importance of knowing how to work as a team. Having the attitude that we are in this together helps us move forward. It’s also essential to be mindful and respectful of others on the team.

Sahana: Is it scary to be overwhelmed by a hackathon, trying to find people and build products?

Joenam: It is overwhelming when you’re surrounded by brilliant people and doing something massive, especially if it’s something new or something you’ve never done before. It helped when Ying started organizing things and distributing tasks. The most important thing is recognizing that this is a weekend or a temporary event (which will end).

Rekha: I second the value of brainstorming. We went over the pros and cons, did competitive research, and broke the tasks into smaller pieces, which helped in digesting them.

Ying: It’s more important to believe in the value you can contribute. Everyone is here to learn. Let this motivation and effort bind you and the team together.

Sahana: Amazing! Let’s wrap up with our favorite motivational quote or war cry.

Rekha: "Fake it till you make it!"

Joenam: "Step out of your comfort zone; you have nothing to lose."

Ying: "When meeting experts or people you admire, you might face rejections. It’s more important than ever to be bold, approach them, and not be afraid of being treated badly."

To me, this sounded like the affirmation I’d been long due to hear.

You can find Ying at LinkedIn, Rekha at LinkedIn and Joenam at LinkedIn.

Thank you for reading. We love showcasing people and their unique products. If you have an AI system you’d like to share with the community, please reach out. We would love to feature and learn from you.