Software Engineer at Meta

Building backend, infrastructure, and AI-native product systems.

I'm Peng Zhang, a software engineer focused on backend and platform problems, product-facing systems, and developer tooling. My work spans API design, orchestration, retrieval pipelines, streaming interfaces, and applied AI systems that need to work reliably in practice.

I bring a builder's mindset with research depth: I like shipping useful systems, reducing complexity, and staying close to fast-moving areas like LLM applications, foundation models, and trustworthy ML.

Portrait of Peng Zhang
Current Software Engineer, Meta
Focus Infra, Product, Foundation Models
Background Systems building + applied ML research

About

I build software that combines systems thinking, product sense, and applied AI.

The work I enjoy most sits at the intersection of backend infrastructure, developer productivity, and intelligent user-facing systems. This site highlights a few projects that best reflect that mix.

Focus

What I build

Backend Systems

APIs, orchestration layers, data pipelines, persistence, and service architecture.

Developer Tools

Workflow automation, code intelligence, review systems, and tools that improve engineering leverage.

AI Applications

LLM-powered products, agent pipelines, retrieval systems, and pragmatic model-driven UX.

Profile

How I work

Systems Mindset

I like clean interfaces, clear abstractions, and systems that stay understandable as they grow.

Product Taste

I care about building things people can actually use, not just technically interesting demos.

Research Fluency

I can move between implementation and research contexts, especially around LLMs, retrieval, and ML systems.

Stack

Tech Stack

C++
Java
Python
PHP
TypeScript
Linux
FastAPI
Next.js
React
SQL
Docker
Kubernetes
LLM Systems
ML Infrastructure

Selected Work

Featured projects

FastAPI / Next.js / DSPy / Qdrant / SQLite

ResearchPilot

A full-stack research copilot that retrieves papers, extracts structured findings, synthesizes literature, and drafts citation-aware related work through a live, streaming workflow.

  • Built a production-style architecture with FastAPI, Next.js, SSE streaming, report history, and semantic search.
  • Designed an agent pipeline across retrieval, extraction, synthesis, and writing with persistent storage and vector search.
Python / FastAPI / LangGraph / Next.js

RepoReviewer

A local-first code review system for public GitHub repositories and pull requests, built around a multi-agent review pipeline and a shared backend for both CLI and web workflows.

  • Combined CLI and web interfaces around a reusable review engine with FastAPI APIs and live status updates.
  • Added prioritization, artifact generation, and file filtering for practical large-repository review workflows.
Python / OpenRouter / NLP / Simulation

CLARA Medical Dialogue Simulation

A prototype dialogue simulation system for history taking and diagnostic reasoning, supporting both deterministic and LLM-enhanced analysis paths.

  • Built a modular pipeline spanning transcription, semantic alignment, feedback generation, and performance analytics.
  • Designed graceful fallback from LLM-enhanced flows to deterministic execution paths.

Research

Publications

2025

Hybrid Deep Learning Framework for Enhanced Melanoma Detection

Peng Zhang, Divya Chaudhary

IEEE Transactions on Computational Biology and Bioinformatics, 2025

Leveraging Clinical Record Geolocation for Improved Alzheimer's Disease Diagnosis Using DMV Framework

Peng Zhang, Divya Chaudhary

Biomedicines, 2025

Beyond Detection: Predicting Code-Switch Points in Multilingual Conversations

Jiawen Xie, Peng Zhang, Jayash Koshal, Shanu Sushmita

WiML Workshop at NeurIPS, 2025

When Pain Hides in Silence: ML-Driven Flare-Up Prediction for Hidradenitis Suppurativa Using Synthetic Patient Data

Shagun Saboo, Peng Zhang, Rishabh Jain, Adwita Arora, Krish Chopra, Sandeep Kumar, Divya Chaudhary

GenAI4HealthInfo at IJCAI, 2025

Leveraging Geolocation in Clinical Records to Improve Alzheimer's Disease Diagnosis Using DMV Framework

Peng Zhang, Divya Chaudhary

arXiv preprint, 2025

2024

Hybrid Deep Learning Framework for Enhanced Melanoma Detection

Peng Zhang, Divya Chaudhary

BIOKDD, 2024

Experience

Professional timeline

2025 - Present

Software Engineer

Meta

Working across infrastructure, product and ecosystem systems, and ML foundation model initiatives.

2024 - 2025

Research Assistant

Northeastern University, advised by Yifan Hu

Worked on applied machine learning and research engineering projects.

2024 - 2025

Research Assistant

Generative AI Research Lab, Northeastern University, advised by Shanu Sushmita

Contributed to research around generative AI, NLP, and multilingual systems.

2024 - 2025

Teaching Assistant

Northeastern University

Courses: CS6240 Large-Scale Parallel Data Processing and CS6140 Machine Learning.

2024 - 2025

Peer Mentor

Mentor Collective

Mentored students in the MSc in Computer Science Align program.

2024 - 2025

Student Chair

AAAI 2025 Workshop on Large Language Models and Generative AI for Health

Helped coordinate workshop operations for a research venue focused on LLMs and generative AI.

2020 - 2022

Software Engineer

Meltwater

Worked on dashboard platform infrastructure and data pipeline systems.

2018

Founder (of SCOPE Camp)

JA Worldwide, Shanghai, China

Education

Academic background

Northeastern University logo

Northeastern University

2022 - 2024

Khoury College of Computer Sciences

MS in Computer Science

Shanghai University logo

Shanghai University

2016 - 2020

BS in Business and Information System