Add files for in-browser segmentation demo

mlpierce22 · Jan 3, 2024 · be19090 · be19090
1 parent 57117e9
commit be19090
Show file tree

Hide file tree

Showing 6 changed files with 50 additions and 0 deletions.
diff --git a/public/projects/header-images/segmentation.png b/public/projects/header-images/segmentation.png
diff --git a/public/projects/segmentation/segmentation_hover.png b/public/projects/segmentation/segmentation_hover.png
diff --git a/public/projects/segmentation/segmentation_popover.png b/public/projects/segmentation/segmentation_popover.png
diff --git a/src/app/projects/segmentation/layout.tsx b/src/app/projects/segmentation/layout.tsx
@@ -0,0 +1,2 @@
+import Layout from '@/app/prose-layout'
+export default Layout
diff --git a/src/app/projects/segmentation/page.mdx b/src/app/projects/segmentation/page.mdx
@@ -0,0 +1,29 @@
+import SegmentationHover from '../../../../public/projects/segmentation/segmentation_hover.png'
+import SegmentationPopover from '../../../../public/projects/segmentation/segmentation_popover.png'
+import ImageSlider from '../../../components/ImageSlider'
+
+# In-Browser Segmentation
+### 🚀 <a href="https://app.test.camio.com" target="_blank">Try it</a>
+
+## Overview
+- **Date:** 2023
+
+## Description
+A proof of concept utilizing emerging Meta research to enable real-time in-browser segmentation of images. This is massive because historically one of the most time consuming parts of training models is drawing bounding boxes around objects in images. This is a huge step towards automating/removing most of that process. I'm excited to say, Camio was the first company to not only incorporate this cutting edge research into their product, but also to convert it into a form that customers could understand and appreciate its usefulness to them.
+
+## Key Contributions
+- Reverse engineering some of how Meta was formatting the images to send to the model for embeddings
+- Running the model in the browser using WASM and onnx
+- Rendering the embeddings in real time as the user hovered over the image
+- Adding the labelling interface
+- Making it responsive and mobile friendly so we could send it to customers to play with while we were demoing it to them
+
+## Screenshots
+        <ImageSlider imageUrls={[]} images={[SegmentationHover,
+SegmentationPopover]} />
+
+## Challenges and Solutions
+One of the most challenging parts of this (and one of the things I'm most proud of) is that this was cutting edge CV research that had been released just days earlier. It was not really ready for production use, but from a business perspective, enabling Camio to be a first mover here was a competitive advantage. As a result, this was a very high-risk project because there was a tight deadline combined with the fact that I was learning as I went. One of the biggest challenges when building this project was (ironically) figuring out the best way to give the model a correctly formatted image so it could generate the embeddings, then rendering those embeddings in real time for the user as they hovered over the image. This whole process was exciting to me because it really showed me how quickly I could learn and adapt emerging research. Implementing this end to end took ~5 days.
+
+## Credits and Acknowledgements
+Gabby and Paige were massively helpful in getting the model up into the cloud in a secure way!
diff --git a/src/utils/project-helpers.tsx b/src/utils/project-helpers.tsx
@@ -8,6 +8,7 @@ import GptWeightlifting from "../../public/projects/header-images/gpt-weightlift
 import GptSitesmith from "../../public/projects/header-images/gpt-sitesmith.png";
 import GptCoverLetter from "../../public/projects/header-images/gpt-cover-letter.png";
 import GptLingoAmigo from "../../public/projects/header-images/gpt-lingo-amigo.png";
+import Segmentation from "../../public/projects/header-images/segmentation.png";
 
 type ImageId = typeof ProjectId[keyof typeof ProjectId]
 
@@ -20,6 +21,7 @@ enum ProjectId {
     GPT_SITESMITH = "gpt-sitesmith",
     GPT_COVER_LETTER = "gpt-cover-letter",
     GPT_LINGO_AMIGO = "gpt-lingo-amigo",
+    SEGMENTATION = "segmentation",
 };
 
 const getProjectImage = (imageId: ImageId) => {
@@ -40,6 +42,8 @@ const getProjectImage = (imageId: ImageId) => {
             return GptCoverLetter;
         case ProjectId.GPT_LINGO_AMIGO:
             return GptLingoAmigo;
+        case ProjectId.SEGMENTATION:
+            return Segmentation;
         default:
             return "";
     }
@@ -60,6 +64,21 @@ const projects: ProjectCardItems[] = [
             },
         ],
     },
+    {
+        title: "In-Browser Segmentation",
+        description: "A fun demo that utilizes WASM to run real-time segmentation in the browser utilizing the SAM model.",
+        imageId: ProjectId.SEGMENTATION,
+        buttons: [
+            {
+                id: "segmentation",
+                name: "Play with a demo",
+                path: "https://app.test.camio.com",
+                variant: "outline",
+                icon: <ExternalLinkIcon />,
+                isExternal: true,
+            },
+        ],
+    },
     {
         title: "Tech4Good Lab Website",
         description: "A website for the Tech4Good lab - a lab focused on using social computing to support community work.",
Original file line number	Diff line number	Diff line change
		@@ -0,0 +1,2 @@
		import Layout from '@/app/prose-layout'
		export default Layout