Sirui Chen·sirui.dev

Resume

Sirui Chen

Full-stack engineer building backend systems, cloud infrastructure, distributed systems, and AI/NLP applications.

Download as PDF

One-page resume, kept in sync with this page.

Download resume.pdf

Summary

University of Pennsylvania CIS M.S.E. candidate (May 2026) with a B.S. in Computer Science and a second major in Mathematical Science from Washington University in St. Louis (CS GPA 3.98/4.00). I build React web apps, backend APIs, PostgreSQL data systems, distributed search infrastructure, OS components, and ML/NLP research pipelines. I care about readable code, product workflows, scalable systems, and cross-functional execution.

Location: Philadelphia, PA · Work authorization: Green Card

Education

University of Pennsylvania

Philadelphia, PA · Aug 2024May 2026 (Expected)

M.S.E. in Computer and Information Science

  • Coursework: Internet and Web Systems, Database and Information Systems, Natural Language Processing, Machine Learning

Washington University in St. Louis

St. Louis, MO · Aug 2020May 2024

B.S. in Computer Science; Second Major in Mathematical Science

  • CS GPA: 3.98 / 4.00
  • Overall GPA: 3.84 / 4.00
  • Focus: full-stack development, systems, security, mobile apps, and OS labs

Experience

Research Assistant — Prof. Chris Callison-BurchUniversity of Pennsylvania

May 2025Aug 2025

  • Built and evaluated retrieval-augmented generation workflows for DARPA SciFy and OpenScholar using score-based filtering, contrastive model work, and domain data preparation.
  • Turned research goals into reusable scripts, datasets, and documentation so model and retriever changes could be tested consistently across the lab.

Python Developer InternCambioML

Dec 2023Mar 2024

  • Built an LLM-powered data portal with Danswer AI to search datasets from AWS, Datarade, and Snowflake across ingestion, indexing, and user-facing discovery workflows.
  • Wrote Python crawlers and cleaners for 5,000+ datasets; resolved metadata issues that affected search relevance and downstream data quality.
  • Automated AWS search infrastructure with CDK and deployment scripts for reproducible development, testing, and release support.

Lab Assistant — Vision-Assisted Self-Driving F1Tenth CarWashington University in St. Louis

Aug 2023May 2024

  • Improved model robustness in glare and shadow conditions through targeted data augmentation, experiment tracking, and evaluation.
  • Built a ROS accelerometer driver for sensor integration and applied self-attention distillation to improve convergence speed by 20%.

Data Processing InternGuwave Technology Co., Ltd.

May 2023Aug 2023

  • Designed Python ETL modules and a unified schema for 10+ semiconductor clients, supporting analytics and reporting workflows.
  • Reduced data processing time by 50% through modularization, stronger error handling, and reusable parsing logic.

Toolchain Development InternBlack Sesame Technology Co., Ltd.

Jun 2022Jul 2022

  • Built test automation for quantized neural networks on embedded systems, with MD5 checks, concurrent multi-device deployment, and structured client reports.

Projects

sirui.dev — Full-Stack Portfolio on AWS EC2

A typed Next.js portfolio shipped end-to-end: app, Nginx reverse proxy, PM2 process manager, Let's Encrypt HTTPS, and a GitHub Actions deploy pipeline running on Ubuntu EC2.

Next.js 16React 19TypeScript (strict)Tailwind CSS 4shadcn/uiAWS EC2 (Ubuntu)NginxPM2CertbotGitHub Actions

SafeCommerce-LA

Authenticated dashboards, maps, charts, filters, and CSV export over a PostgreSQL schema that joins LA crime data with California retail sales records. Graded 99/100.

ReactPythonPostgreSQLJWTSQL OptimizationREST APIs

Distributed Web Search Engine

End-to-end distributed search platform with a custom crawler, indexer, PageRank, replicated key-value store, multithreaded web server, and an RDD-style compute layer named Flame.

JavaConcurrencyNetworkingDistributed SystemsCustom KVSRDD-style Compute

PennOS — A Unix-Like Educational OS

C implementation of Unix-like OS features: shell commands, process scheduling, signal handling, and a PennFAT filesystem with full error-path testing.

COperating SystemsShellsSchedulingSignalsFilesystems

MasterCard Onboarding Automation

Full-stack onboarding tool with structured questionnaires, input validation, standardized submissions, and backend APIs for client intake.

ReactREST APIsForm ValidationNode.js

Kinexus — WUSTL Medical Research Application

Fixed Apple Watch sync issues, refactored REST APIs, optimized AWS RDS queries, and wrote developer documentation that reduced onboarding time for new contributors.

REST APIsAWS RDSApple WatchSQLDocumentation

Skills

Languages

JavaScript, TypeScript, Python, Java, C/C++, SQL, Shell

Frontend & Product

React, Next.js, Tailwind CSS, UI workflows, Structured forms, Input validation, API integration, Product debugging

Backend & Data

Node.js, Express, Tornado, REST APIs, PostgreSQL, MySQL, Snowflake, ETL, Dashboards, Query optimization

Cloud & Systems

AWS, AWS CDK, EC2, Docker, Nginx, PM2, Distributed systems, Concurrency, Networking, Operating systems, ROS

Engineering Quality

Testing, Debugging, Performance checks, Documentation, Maintainable code, Reproducible scripts