Page cover

Overview

Welcome to the official documentation for File Deduplication for ownCloud using SHA-256 and AWS Lambdaa serverless, scalable solution for reducing redundant file storage across users, powered by the AWS ecosystem and integrated with ownCloud.

Whether you're building a cloud storage service, optimizing storage for teams, or simply want to learn how deduplication works in real-world applications — you’re in the right place.

This documentation walks you through the full project lifecycle — from setup to deployment, and all the cloud services stitched in between. Each chapter gives you code, config, and context so you can build or extend it your way.

The project is divided into multiple steps (chapters):

  1. Getting Started: Install the required tools, set up your AWS CLI, and configure environment prerequisites.

  2. EC2 Instance Setup : Launch and secure your Ubuntu EC2 instance to serve ownCloud(or your own UI) or manage deployments.

  3. ownCloud Setup (Optional) : Install and configure ownCloud on your EC2 instance. Enable public file upload access and connect it with AWS services.

  4. ownUI Setup: nstall and configure your own UI on your EC2 instance. Enable public file upload access and connect it with AWS services.

  5. S3 Bucket Setup: Create and configure an S3 bucket to store deduplicated files with versioning, encryption, and secure access for Lambda.

  6. DyanmoDB Setup : Create the DynamoDB table structure for storing file hash, user associations, and S3 keys.

  7. Lamba Function Setup : Create Lambda function toused to handle file uploads, deduplication, and deletions.

  8. IAM Roles & Policies : Create and attach secure IAM roles with the least privilege needed for Lambda to access S3, DynamoDB, and other services.

  9. API Gateway Configuration: Set up HTTP API routes to expose Lambda functions as REST endpoints for uploading and deleting files.

  10. API Testing (using Postman): Validate the upload and delete functionalities by sending structured API requests and checking the deduplication logic in action.

Overview of DeDuper.

🔗 Reach out to us:

Jump right in

Last updated