Building Google Photos Alternative Using AWS Serverless

Sagar Barai

Full-stack Development

Tags:

AWS S3

serverless

AWS Lambda

React

node.js

DynamoDB

Being an avid Google Photos user, I really love some of its features, such as album, face search, and unlimited storage. However, when Google announced the end of unlimited storage on June 1st, 2021, I started thinking about how I could create a cheaper solution that would meet my photo backup requirement.

“Taking an image, freezing a moment, reveals how rich reality truly is.”

– Anonymous

‍

Google offers 100 GB of storage for 130 INR. This storage can be used across various Google applications. However, I don’t use all the space in one go. For me, I snap photos randomly. Sometimes, I visit places and take random snaps with my DSLR and smartphone. So, in general, I upload approximately 200 photos monthly. The size of these photos varies in the range of 4MB to 30MB. On average, I may be using 4GB of monthly storage for backup on my external hard drive to keep raw photos, even the bad ones. Photos backed up on the cloud should be visually high-quality, and it’s good to have a raw copy available at the same time, so that you may do some lightroom changes (although I never touch them 😛). So, here is my minimal requirement:

Should support social authentication (Google sign-in preferred).

Photos should be stored securely in raw format.

Storage should be scaled with usage.

Uploading and downloading photos should be easy.

Web view for preview would be a plus.

Should have almost no operations headache and solution should be as cheap as possible 😉.

Selecting Tech Stack

To avoid operation headaches with servers going down, scaling, or maybe application crashing and overall monitoring, I opted for a serverless solution with AWS. The AWS S3 is infinite scalable storage and you only pay for the amount of storage you used. On top of that, you can opt for the S3 storage class, which is efficient and cost-effective.

- Infrastructure Stack
‍
1. AWS API Gateway (http api)
2. AWS Lambda (for processing images and API gateway queries)
3. Dynamodb (for storing image metadata)
4. AWS Cognito (for authentication)
5. AWS S3 Bucket (for storage and web application hosting)
6. AWS Certificate Manager (to use SSL certificate for a custom domain with API gateway)

- Software Stack

1. NodeJS
2. ReactJS and Material-UI (front-end framework and UI)
3. AWS Amplify (for simplifying auth flow with cognito)
4. Sharp (high-speed nodejs library for converting images)
5. Express and serversless-http
6. Infinite Scroller (for gallery view)
7. Serverless Framework (for ease of deployment and Infrastructure as Code)

Create S3 Buckets:

We will create three S3 buckets. Create one for hosting a frontend application (refer to architecture diagram, more on this discussed later in the build and hosting part). The second one is for temporarily uploading images. The third one is for actual backup and storage (enable server-side encryption on this bucket). A temporary upload bucket will process uploaded images.

During pre-processing, we will resize the original image into two different sizes. One is for thumbnail purposes (400px width), another one is for viewing purposes, but with reduced quality (webp format). Once images are resized, upload all three (raw, thumbnail, and webview) to the third S3 bucket and create a record in dynamodb. Set up object expiry policy on the temporary bucket for 1 day. This way, uploaded objects are automatically deleted from the temporary bucket.

Setup trigger on the temporary bucket for uploaded images:

We will need to set up an S3 PUT event, which will trigger our Lambda function to download and process images. We will filter the suffix jpg (and jpeg) for an event trigger, meaning that any file with extension .jpg and .jpeg uploaded to our temporary bucket will automatically invoke a lambda function with the event payload. The lambda function with the help of the event payload will download the uploaded file and perform processing. Your serverless function definition would look like:

CODE: https://gist.github.com/velotiotech/7825ef993e95a060e2593b3a9d1b26f0.js

Notice that in the YAML events section, we set “existing:true”. This ensures that the bucket will not be created during the serverless deployment. However, if you plan not to manually create your s3 bucket, you can let the framework create a bucket for you.

DynamoDB as metadatadb:

AWS dynamodb is a key-value document db that is suitable for our use case. Dynamodb will help us retrieve the list of photos available in the time series. Dynamodb uses a primary key for uniquely identifying each record. A primary key can be composed of a hash key and range key (also called a sort key). A range key is optional. We will use a federated identity ID (discussed in setup authorization) as the hash key (partition key) and name it the username for attribute definition with the type string. We will use the timestamp attribute definition name as a range key with a type number. Range key will help us query results with time-series (Unix epoch). We can also use dynamodb secondary indexes to sort results more specifically. However, to keep the application simple, we’re going to opt-out of this feature for now. Your serverless resource definition would look like:

CODE: https://gist.github.com/velotiotech/4b69c55c6529bd3e800e89d50d6cbfcd.js

Finally, you also need to set up the IAM role so that the process image lambda function would have access to the S3 bucket and dynamodb. Here is the serverless definition for the IAM role.

CODE: https://gist.github.com/velotiotech/364a73d9e4fea61b121079690877d6de.js

Setup Authentication:

Okay, to set up a Cognito user pool, head to the Cognito console and create a user pool with below config:

1. Pool Name: photobucket-users

2. How do you want your end-users to sign in?

Select: Email Address or Phone Number
Select: Allow Email Addresses
Check: (Recommended) Enable case insensitivity for username input

3. Which standard attributes are required?

4. Keep the defaults for “Policies”

5. MFA and Verification:

I opted to manually reset the password for each user (since this is internal app)
Disabled user verification

6. Keep the default for Message Customizations, tags, and devices.

7. App Clients :

App client name: myappclient
Let the refresh token, access token, and id token be default
Check all “Auth flow configurations”
Check enable token revocation

8. Skip Triggers

9. Review and create the pool

Once created, goto app integration -> domain name. Create a domain Cognito subdomain of your choice and note this. Next, I plan to use the Google sign-in feature with Cognito Federation Identity Providers. Use this guide to set up a Google social identity with Cognito.

Setup Authorization:

Once the user identity is verified, we need to allow them to access the s3 bucket with limited permissions. Head to the Cognito console, select federated identities, and create a new identity pool. Follow these steps to configure:

1. Identity pool name: photobucket_auth

2. Keep Unauthenticated and Authentication flow settings unchecked.

3. Authentication providers:

User Pool I: Enter the user pool ID obtained during authentication setup
App Client I: Enter the app client ID generated during the authentication setup. (Cognito user pool -> App Clients -> App client ID)

4. Setup permissions:

Expand view details (Role Summary)
For authenticated identities: edit policy document and use the below JSON policy and skip unauthenticated identities with the default configuration.

CODE: https://gist.github.com/velotiotech/7b07cf393802bdbf6b4a5f20b9600f7b.js

${cognito-identity.amazonaws.com:sub} is a special AWS variable. When a user is authenticated with a federated identity, each user is assigned a unique identity. What the above policy means is that any user who is authenticated should have access to objects prefixed by their own identity ID. This is how we intend users to gain authorization in a limited area within the S3 bucket.

Copy the Identity Pool ID (from sample code section). You will need this in your backend to get the identity id of the authenticated user via JWT token.

Amplify configuration for the frontend UI sign-in:

This object helps you set up the minimal configuration for your application. This is all that we need to sign in via Cognito and access the S3 photo bucket.

CODE: https://gist.github.com/velotiotech/3a3cb00d52668f84124b7c58e986c20c.js

You can then use the below code to configure and sign in via social authentication.

CODE: https://gist.github.com/velotiotech/7f6dc48f9328c8a4589882de62630073.js

Gallery View:

When the application is loaded, we use the PhotoGallery component to load photos and view thumbnails on-page. The Photogallery component is a wrapper around the InfinityScoller component, which keeps loading images as the user scrolls. The idea here is that we query a max of 10 images in one go. Our backend returns a list of 10 images (just the map and metadata to the S3 bucket). We must load these images from the S3 bucket and then show thumbnails on-screen as a gallery view. When the user reaches the bottom of the screen or there is empty space left, the InfiniteScroller component loads 10 more images. This continues untill our backend replies with a stop marker.

The key point here is that we need to send the JWT Token as a header to our backend service via an ajax call. The JWT Token is obtained post a sign-in from Amplify framework. An example of obtaininga JWT token:

CODE: https://gist.github.com/velotiotech/db4230b457b79f67c7767eeeb5f1c806.js

An example of an infinite scroller component usage is given below. Note that “gallery” is JSX composed array of photo thumbnails. The “loadMore” method calls our ajax function to the server-side backend and updates the “gallery” variable and sets the “hasMore” variable to true/false so that the infinite scroller component can stop queering when there are no photos left to display on the screen.

CODE: https://gist.github.com/velotiotech/c2293d6609739f1af9ae9da3d343b196.js

The Lightbox component gives a zoom effect to the thumbnail. When the thumbnail is clicked, a higher resolution picture (webp version) is downloaded from the S3 bucket and shown on the screen. We use a storage object from the Amplify library. Downloaded content is a blob and must be converted into image data. To do so, we use the javascript native method, createObjectURL. Below is the sample code that downloads the object from the s3 bucket and then converts it into a viewable image for the HTML IMG tag.

CODE: https://gist.github.com/velotiotech/b83f1a4504c7e1f2b4e407270972f944.js

Uploading Photos:

The S3 SDK lets you generate a pre-signed POST URL. Anyone who gets this URL will be able to upload objects to the S3 bucket directly without needing credentials. Of course, we can actually set up some boundaries, like a max object size, key of the uploaded object, etc. Refer to this AWS blog for more on pre-signed URLs. Here is the sample code to generate a pre-signed URL.

CODE: https://gist.github.com/velotiotech/3683da7f61c099f32b1f19892aa6300e.js

For a better UX, we can allow our users to upload more than one photo at a time. However, a pre-signed URL lets you upload a single object at a time. To overcome this, we generate multiple pre-signed URLs. Initially, we send a request to our backend asking to upload photos with expected keys. This request is originated once the user selects photos to upload. Our backend then generates pre-signed URLs for us. Our frontend React app then provides the illusion that all photos are being uploaded as a whole.

When the upload is successful, the S3 PUT event is triggered, which we discussed earlier. The complete flow of the application is given in a sequence diagram. You can find the complete source code here in my GitHub repository.

React Build Steps and Hosting:

The ideal way to build the react app is to execute an npm run build. However, we take a slightly different approach. We are not using the S3 static website for serving frontend UI. For one reason, S3 static websites are non-SSL unless we use CloudFront. Therefore, we will make the API gateway our application’s entry point. Thus, the UI will also be served from the API gateway. However, we want to reduce calls made to the API gateway. For this reason, we will only deliver the index.html file hosted with the help API gateway/Lamda, and the rest of the static files (react supporting JS files) from S3 bucket.

Your index.html should have all the reference paths pointed to the S3 bucket. The build mustexclusively specify that static files are located in a different location than what’s relative to the index.html file. Your S3 bucket needs to be public with the right bucket policy and CORS set so that the end-user can only retrieve files and not upload nasty objects. Those who are confused about how the S3 static website and S3 public bucket differ may refer to here. Below are the react build steps, bucket policy, and CORS.

CODE: https://gist.github.com/velotiotech/82d5396ab375105c34b4c663ce6eafcd.js

Once a build is complete, upload index.html to a lambda that serves your UI. Run the below shell commands to compress static contents and host them on our static S3 bucket.

CODE: https://gist.github.com/velotiotech/26f86e4dfcb32eb1a6f624f08d3a362a.js

Our backend uses nodejs express framework. Since this is a serverless application, we need to wrap express with a serverless-http framework to work with lambda. Sample source code is given below, along with serverless framework resource definition. Notice that, except for the UI home endpoint ( "/" ), the rest of the API endpoints are authenticated with Cognito on the API gateway itself.

CODE: https://gist.github.com/velotiotech/776965bd23d4a17b390da46cd0ae931e.js

CODE: https://gist.github.com/velotiotech/57dfc810ad21fd2288a3260d8e8dd6a5.js

Final Steps :

Lastly, we will setup up a custom domain so that we don’t need to use the gibberish domain name generated by the API gateway and certificate for our custom domain. You don’t need to use route53 for this part. If you have an existing domain, you can create a subdomain and point it to the API gateway. First things first: head to the AWS ACM console and generate a certificate for the domain name. Once the request is generated, you need to validate your domain by creating a TXT record as per the ACM console. The ACM is a free service. Domain verification may take few minutes to several hours. Once you have the certificate ready, head back to the API gateway console. Navigate to “custom domain names” and click create.

Enter your application domain name
Check TLS 1.2 as TLS version
Select Endpoint type as Regional
Select ACM certificate from dropdown list
Create domain name

Select the newly created custom domain. Note the API gateway domain name from Domain Details -> Configuration tab. You will need this to map a CNAME/ALIAS record with your DNS provider. Click on the API mappings tab. Click configure API mappings. From the dropdown, select your API gateway, select stage as default, and click save. You are done here.

Future Scope and Improvements :

To improve application latency, we can use CloudFront as CDN. This way, our entry point could be S3, and we no longer need to use API gateway regional endpoint. We can also add AWS WAF as an added security in front of our API gateway to inspect incoming requests and payloads. We can also use Dynamodb secondary indexes so that we can efficiently search metadata in the table. Adding a lifecycle rule on raw photos which have not been accessed for more than a year can be transited to the S3 Glacier storage class. You can further add glacier deep storage transition to save more on storage costs.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Building Google Photos Alternative Using AWS Serverless

“Taking an image, freezing a moment, reveals how rich reality truly is.”

– Anonymous

‍

Should support social authentication (Google sign-in preferred).

Photos should be stored securely in raw format.

Storage should be scaled with usage.

Uploading and downloading photos should be easy.

Web view for preview would be a plus.

Should have almost no operations headache and solution should be as cheap as possible 😉.

Selecting Tech Stack

- Software Stack

Create S3 Buckets:

Setup trigger on the temporary bucket for uploaded images:

CODE: https://gist.github.com/velotiotech/7825ef993e95a060e2593b3a9d1b26f0.js

DynamoDB as metadatadb:

CODE: https://gist.github.com/velotiotech/4b69c55c6529bd3e800e89d50d6cbfcd.js

Finally, you also need to set up the IAM role so that the process image lambda function would have access to the S3 bucket and dynamodb. Here is the serverless definition for the IAM role.

CODE: https://gist.github.com/velotiotech/364a73d9e4fea61b121079690877d6de.js

Setup Authentication:

Okay, to set up a Cognito user pool, head to the Cognito console and create a user pool with below config:

1. Pool Name: photobucket-users

2. How do you want your end-users to sign in?

Select: Email Address or Phone Number
Select: Allow Email Addresses
Check: (Recommended) Enable case insensitivity for username input

3. Which standard attributes are required?

4. Keep the defaults for “Policies”

5. MFA and Verification:

I opted to manually reset the password for each user (since this is internal app)
Disabled user verification

6. Keep the default for Message Customizations, tags, and devices.

7. App Clients :

App client name: myappclient
Let the refresh token, access token, and id token be default
Check all “Auth flow configurations”
Check enable token revocation

8. Skip Triggers

9. Review and create the pool

Setup Authorization:

1. Identity pool name: photobucket_auth

2. Keep Unauthenticated and Authentication flow settings unchecked.

3. Authentication providers:

User Pool I: Enter the user pool ID obtained during authentication setup
App Client I: Enter the app client ID generated during the authentication setup. (Cognito user pool -> App Clients -> App client ID)

4. Setup permissions:

Expand view details (Role Summary)
For authenticated identities: edit policy document and use the below JSON policy and skip unauthenticated identities with the default configuration.

CODE: https://gist.github.com/velotiotech/7b07cf393802bdbf6b4a5f20b9600f7b.js

Copy the Identity Pool ID (from sample code section). You will need this in your backend to get the identity id of the authenticated user via JWT token.

Amplify configuration for the frontend UI sign-in:

This object helps you set up the minimal configuration for your application. This is all that we need to sign in via Cognito and access the S3 photo bucket.

CODE: https://gist.github.com/velotiotech/3a3cb00d52668f84124b7c58e986c20c.js

You can then use the below code to configure and sign in via social authentication.

CODE: https://gist.github.com/velotiotech/7f6dc48f9328c8a4589882de62630073.js

Gallery View:

CODE: https://gist.github.com/velotiotech/db4230b457b79f67c7767eeeb5f1c806.js

CODE: https://gist.github.com/velotiotech/c2293d6609739f1af9ae9da3d343b196.js

CODE: https://gist.github.com/velotiotech/b83f1a4504c7e1f2b4e407270972f944.js

Uploading Photos:

CODE: https://gist.github.com/velotiotech/3683da7f61c099f32b1f19892aa6300e.js

React Build Steps and Hosting:

CODE: https://gist.github.com/velotiotech/82d5396ab375105c34b4c663ce6eafcd.js

Once a build is complete, upload index.html to a lambda that serves your UI. Run the below shell commands to compress static contents and host them on our static S3 bucket.

CODE: https://gist.github.com/velotiotech/26f86e4dfcb32eb1a6f624f08d3a362a.js

CODE: https://gist.github.com/velotiotech/776965bd23d4a17b390da46cd0ae931e.js

CODE: https://gist.github.com/velotiotech/57dfc810ad21fd2288a3260d8e8dd6a5.js

Final Steps :

Enter your application domain name
Check TLS 1.2 as TLS version
Select Endpoint type as Regional
Select ACM certificate from dropdown list
Create domain name

Future Scope and Improvements :

About the Author

Did you like the blog? If yes, we're sure you'll also like to work with the people who write them - our best-in-class engineering team.

We're looking for talented developers who are passionate about new emerging technologies. If that's you, get in touch with us.

Explore current openings

Subscribe to get the latest technology updates

Building Google Photos Alternative Using AWS Serverless

Sagar Barai

Selecting Tech Stack

Create S3 Buckets:

Setup trigger on the temporary bucket for uploaded images:

DynamoDB as metadatadb:

Setup Authentication:

Setup Authorization:

Amplify configuration for the frontend UI sign-in:

Gallery View:

Uploading Photos:

React Build Steps and Hosting:

Final Steps :

Future Scope and Improvements :

MORE POSTS BY THIS AUTHOR

Sagar Barai

You may also like

A Guide to End-to-End API Test Automation with Postman and GitHub Actions

Praful Kolhe

Why Signals Could Be the Future for Modern Web Frameworks?

Harshil Shah

Automating test cases for text-messaging (SMS) feature of your application was never so easy

Praful Kolhe

Building Google Photos Alternative Using AWS Serverless

Selecting Tech Stack

Create S3 Buckets:

Setup trigger on the temporary bucket for uploaded images:

DynamoDB as metadatadb:

Setup Authentication:

Setup Authorization:

Amplify configuration for the frontend UI sign-in:

Gallery View:

Uploading Photos:

React Build Steps and Hosting:

Final Steps :

Future Scope and Improvements :

About the Author

Did you like the blog? If yes, we're sure you'll also like to work with the people who write them - our best-in-class engineering team.

We're looking for talented developers who are passionate about new emerging technologies. If that's you, get in touch with us.

About Velotio

Subscribe to get the latest technology updates

Related Posts

A Guide to End-to-End API Test Automation with Postman and GitHub Actions

Why Signals Could Be the Future for Modern Web Frameworks?

Automating test cases for text-messaging (SMS) feature of your application was never so easy

Creating a Frictionless SignUp Experience with Auth0 for your Application

Why You Should Prefer Next.js 12 Over Other React Setup

Setting up S3 & CloudFront to Deliver Static Assets Across the Web

Setting Up A Single Sign On (SSO) Environment For Your App

SEO for Web Apps: How to Boost Your Search Rankings

Creating Faster and High Performing User Interfaces in Web Apps With Web Workers

A Beginner’s Guide to Kubernetes Python Client

Flutter vs React Native: A Detailed Comparison

Building a WebSocket Service with AWS Lambda & DynamoDB

Improving Elasticsearch Indexing in the Rails Model using Searchkick

Test Automation in React Native apps using Appium and WebdriverIO

Optimize React App Performance By Code Splitting

Building a Collaborative Editor Using Quill and Yjs

How to Use Pytest Fixtures With Django Models

Getting Started With Golang Channels! Here’s Everything You Need to Know

Set Up Simple S3 Deployment Workflow with Github Actions and CircleCI

Eliminate Render-blocking Resources using React and Webpack

How to Test the Performance of Flutter Apps - A Step-by-step Guide

Automating Serverless Framework Deployment using Watchdog

Building Scalable and Efficient React Applications Using GraphQL and Relay

Implementing Federated GraphQL Microservices using Apollo Federation

Implementing Async Features in Python - A Step-by-step Guide

Building Type Safe Backend Apps with Typegoose and TypeGraphQL

UI Automation and API Testing with Cypress - A Step-by-step Guide

An Introduction to React Fiber - The Algorithm Behind React

Using DRF Effectively to Build Cleaner and Faster APIs in Django

The 7 Most Useful Design Patterns in ES6 (and how you can implement them)

Enable Real-time Functionality in Your App with GraphQL and Pusher

Set Up A Production-ready REST API Server Using TypeScript, Express And PostgreSQL

Building High-performance Apps: A Checklist To Get It Right

Building a Progressive Web Application in React [With Live Code Examples]

Node.js vs Deno: Is Deno Really The Node.js Alternative We All Didn’t Know We Needed?

Building Dynamic Forms in React Using Formik

Building A Scalable API Testing Framework With Jest And SuperTest