Vincent (Linshen) Wu (LinkedIn)
Email: *************@*****.***
Strength
• Professional team management experience of two Data+AI products with 60+ team members (developers, testers, etc.) for a large technology company
• 8+ years of hands-on experience on Big Data solutions in production as a Big Data Engineer/Cloud Engineer/Machine Learning Engineer
• 12+ years Full-Stack (Frontend + Backend) Software Engineer
• Demonstrable hands-on Cloud Engineering experience with both Microsoft Azure and Amazon Web Services (AWS)
• Expert in Machine Learning and Artificial Intelligence (including Generative AI) fields
• DevOps/AIOps Engineering in Continuous Integration, Continuous Deployment
(CICD) for production application development and deployment automation
• High-quality coding skills in programming languages such as Python, Java, JavaScript, HTML5, C#, C#.NET, PL-SQL, T-SQL, etc.
• Daily data-centric roles involving in both RDBMS (Oracle/ SQL Server/PostgresSQL) and NoSQL (MongoDb/Firebase) databases.
• Johns Hopkins University and Massachusetts Institute of Technology (MIT) certified Work Experience
1. Product Manager/GenAI Architect
AsiaInfo Technologies,
Chengdu, China. June 2023-May 2024
Responsibilities:
• Product Owner/Manager of two AIGC products: Data Platform (DataOS) and Data Infrastructure (DataInfra) (managing 60+ team members)
• Innovate and report new product features to CTO (Chief Technology Officer) and other senior management with the latest AI advancements
• Hands-on experience with Generative AI technologies in marketing texts, images, videos, and Large Language Model (LLMs) and model fine-tuning
(instruct tuning, SFT, RLHF, etc.)
• Build AI Agent from scratch using LangChain with direct prompting and CoT
(Chain-of-Thought)
• Design, develop, and optimize RAG (Retrieval-Augmented Generation) models to facilitate effective information retrieval and generation in conversational AI systems
• Utilize Vector Databases and advanced indexing techniques to efficiently store and retrieve relevant information for conversational contexts
• Strong programming skills in Python, with experience using AI/ML libraries such as TensorFlow, PyTorch, HuggingFace, or similar frameworks
• Innovate and propose new solutions to enhance chat experiences, model outputs, and integrate new tools with Generative AI (ChatGPT, Ollama, Midjourney, Stable Diffusion, etc.)
• Experienced with training and testing Deep Learning/Artificial Intelligence models (Deep Neural Networks) such as Convolutional Neural Network
(CNN), Recurrent Neural Network (RNN) and Reinforcement Learning (RL)
• Professional in writing Python code for API development using Flask and FastAPI framework
• Run trials and MVPs (Minimum Viable Product), develop and deliver go-to- market launch plans
• Set and track KPI (Key Performance Indicator) to measure product success
• Collect, organize, and present progress with team leaders and stakeholders 2. Palantir Support Engineer
KForce Inc. (client - Palantir Technologies, Inc.), Washington D.C., USA July 2022-June 2023
Responsibilities:
• Take ownership of customer issues reported and seeing problems through to resolution
• Troubleshoot technical issues or questions for Palantir Foundry’s data ingestion pipeline and machine learning platform
• Diagnose and fixing technical issues, including application and network problems
• Perform root cause analysis for user errors and communicate with our application developers with recommend improvements
• Debug user’s Machine Learning models using Sklearn, Tensorflow and Pytorch on Jupyter Lab and Databricks Notebooks
• Document all fixes and investigations, and keep a record of issues and solutions
• Develop and implement scripts to replicate reported issues and automate output verification
• Proficient in Linux/Unix (Ubuntu, RedHat, CentOS) system administration
• Following standard procedures for proper escalation of unresolved issues to the appropriate internal teams
3. Senior Big Data Engineer/Machine Learning Engineer Bechtel Global Corporation,
Reston VA, USA Oct 2019-July 2022
Responsibilities:
• Technical Lead for the end-to-end AWS-to-Azure cloud migration (entire platform and data migration)
• Design and build ingestion pipelines for Azure Data Lake using Azure Data Factory, Azure PostgreSQL and Azure Databricks
• Develop and build Microservices-based applications using Docker containers and deploy to Azure Kubernetes Service (AKS)
• Expertise in Microsoft Azure Cloud Services ( PaaS & IaaS ), Application Insights, CosmosDB, Internet-of-Things (IoT), Azure Monitoring, KeyVault and SQL Azure/PostgreSQL
• Experience creating and managing self-hosted Elasticsearch clusters including creation of master/worker nodes, Kibana VM, index backup and restoration, etc.
• Develop and build microservices-based applications using Docker containers and deploy to Azure Kubernetes Service (AKS)
• Hands-on experience in Azure Development, worked on Azure web applications using App Services, Azure Functions, Azure Storage, Azure Virtual Machines, Azure AD, Azure Search
• Expertise in Microsoft Azure Cloud Services ( PaaS & IaaS ), Application Insights, CosmosDB, Internet-of-Things (IoT), Azure Monitoring, KeyVault and SQL Azure/PostgreSQL
• Daily data ingestion and analytics using Databricks (Apache Spark)
• Applying OpenCV solutions for OCR digital recognition
• Hands-on experience in Azure Development, worked on Azure web applications using App Services, Azure Functions, Azure Storage, Azure Virtual Machines, Azure AD, Azure Search
• Demonstrating a systematic, disciplined and analytical approach to problem solving
• Clean, transform, and analyze vast amounts of raw data from various systems using Spark (PySpark and Scala) to provide ready-to-use data to data scientists and business analysts
• Deep experience in developing data processing tasks using PySpark such as reading data from external sources, merge data, perform data enrichment and load in to data lake and data warehouse
4. Big Data Engineer
Bechtel Global Corporation,
Reston VA, USA. Oct 2016-Oct 2019
Responsibilities:
• Data ingestion from various data sources (RDBMS, NoSQL, csv, txt, API, etc.) to HDFS and AWS S3 storage using Apache NIFI, Sqoop, Spark, etc.
• Setting up databases in AWS using RDS including MSSQL, MySQL, MongoDB & DynamoDB
• Built S3 buckets and managed policies for S3 buckets and used S3 bucket and Glacier for storage and backup on AWS
• Worked on JIRA for defect/issues logging & tracking and documented all my work using CONFLUENCE
• Deploy and maintain Azure resources using Azure Resource Manager
(ARM) and Terraform (Infrastructure-As-Code (IaC))
• Utilizing online development IDE Jupyter Notebook for developing and testing
• Creating Quantity Dashboard using PowerBI Stack including PowerPivot, PowerQuery, PowerView, PowerMap
• Working with Big Data & Analytics team to get feature tables from the AWS data lake and present the daily status change through PowerBI
• Performance Tuning and Monitoring Oracle and SQL Server Databases using PL-SQL and T-SQL
• Utilizing HTML5 APIs such as Golocation, LocalStorage, WebStorage, Drag-and-Drop, Offline, etc.
5. Full-Stack Software Engineer
Bechtel Global Corporation,
Frederick MD, USA. Dec 2012-Oct 2016
Responsibilities:
• Creating API using both .NET WEBAPI 2 and NodeJS Express framework to be consumed by AngularJS and ReactJS frontend (Javascript)
• Developing Lump Sum Market Cost Estimating Tool in Oracle database
• Writing complex PS-SQL, T-SQL Queries, Views, Triggers, Functions, Packages, Procedures to calculate Direct and Indirect cost at both Market and Detailed level
• Extract, Transfer and Load (ETL) Timesheet information among various Oracle database schemas
• Creating SSRS Reports including Crystal Reports, Cross-tab Report and Parameterized Report with BI Tools
• Working closely with other IT functions, Project Controls, Engineers to gather changing requirements for new software feature to fulfill future business goal
• Expert in software development and support including developing and maintaining web application, windows desktop application and cross-platform
(IOS and Android) mobile application
• Developing Windows Service and file system watcher using C# to auto-trigger BOM (Bill-Of-Materials) report and automatic email notification to users
• Automating SSRS data-driven email subscription and file share subscription
• Experienced on JIRA for defect/issues logging & tracking and documented all my work using Confluence
6. Business System Analyst
Antra Inc. (formerly I Vision Solutions Inc.)
Sterling, VA, USA. July 2012-Dec 2012
Responsibilities:
• Provides Level-4 software support, and software production development support using BI stack (SSRS/SSIS) for our project controls application system
• Developing programming specifications from business requirements
• Working with other IT functions, team developers in North America, Australia and India in trouble-shooting Stored Procedures, Functions and Views to quickly resolve production application support issues and SSRS report issues
• Developing, maintaining, and supporting ETL Tools (SSIS) Immigration Status
Green Card – No need for Visa sponsorship
Education
Johns Hopkins University Master of Science in Computer Science
(Data Science and Cloud Computing track) 2019
Certification
MIT Sloan – Artificial Intelligence: Implications for Business Strategy Major Courses
Machine Learning • Applied Machine Learning • Deep Neural Networks • Data Science • Large-Scale Database Systems • Foundations of Algorithms • Computer Architecture • Software Engineering • Big Data Processing Using Hadoop • Mobile Application Development for IOS
Technical Skills Summary
Programming Languages: Python, JavaScript, Java, C#. Net, ADO.Net, PL-SQL, T-SQL, LINQ, Lambda, Python, Pig Latin, Hive SQL, Bash, PowerShell Big Data Ecosystem: Apache Hadoop/HDFS/S3, Spark, Sqoop, Zepplin, Linux/Unix Web Technologies: ReactJS, Angular, ASP.Net, HTML, CSS, JavaScript, JQuery, AJAX, Web Form
IDE: Microsoft Visual Studio/VS Code, Atom, PyCharm, Jupyter Notebook/Lab
Databases: RDBMS: Oracle; MS SQL Server; PostgreSQL; Amazon RDS; Big Data: HBase; Hive Database;
NoSQL: MongoDb, Azure CosmosDB, Amazon
DynamoDB/Cassandra, Firebase
API Development: Rest API: WebAPI/WebAPI2, NodeJS, Express, Flask, FastAPI Version Management: GitHub, Azure Repository, Visual Source Safe (VSS), Team Foundation Server (TFS), Subversion (SVN)
Other Tools: SSIS/SSRS/SSAS, Entity Framework 4.0/4.1/4.5, IIS 6.0/7.0, LINQ, Cloud computing tool (Windows Azure, Azure Virtual Machine, EC2)
References
Available upon request