Senior professional with an understanding of patient and claims analytics within the intricate framework of the pharmaceutical data ecosystem. Offering clients the power of data to drive transformative changes by ensuring data quality and crafting strategic data-driven approaches makes candidate an invaluable asset in steering pharmaceutical endeavors towards success. Their expertise extends beyond mere analysis, encompassing a profound grasp of data quality and strategy, pivotal for driving impactful decisions in the pharmaceutical industry.

Professional Experience

KMK Consulting Inc. – Morristown, NJ (Jul 2019 – Present)
Senior Data Analyst
⦿ Maintain and update Datamart of rare disease products by aggregating data from multiple data sources such as Special Pharmacy, Special Distributor etc. using SAS and Alteryx.
⦿ Develop and modify reports including daily Datamart report, KPI report etc. based on Excel and SAS.
⦿ Manipulate CMS open payment data and generate report regarding market trend and competitors’ analysis.
⦿ Generate data visualization of different business metrics to help marketing and sales teams make clear decisions and target clients.
⦿ Derive business insights and analytical solutions by performing diversification analysis on Patient Claims, Payer Medical Claims, Physician data using Python/SQL/Alteryx.
⦿ Outline patient journey framework to identify specific treatment events and strategic insights along with disease progression statistics of patients to facilitate improved quality and safety of patient care.
⦿ Work on ad hoc reports requested by client to compare market landscape.
⦿ Evaluate and follow up data flow of projects by negotiating data assessment and data strategy with third parties.
⦿ Build data quality automation system and perform sales trend analysis.

Manufacturing Firm – Bethlehem, PA (Dec 2018 – May 2019)
Data Analyst Consultant
⦿ Processed abnormal data and restructured data for predictive analytics using MySQL and Excel.
⦿ Utilized data segmentation by applying coefficient of variance and seasonal factor on K means clustering.
⦿ Built sales forecasting and demand planning model mainly based on Holt Winters, SARIMAX and Prophet.
⦿ Created customized dashboards to transform forecasting results and important KPIs by Qlik Sense.


Flights Delay Analysis and Prediction (Nov 2018 – Dec 2018)
⦿ Merged data files and cleaned noisy data for predictive analytics of delay reasons, airline companies etc.
⦿ Created dummy variables and applied Pearson correlation to filter out variables.
⦿ Split data by using K fold cross validation, then applied logistic regression, K nearest neighbor and random forest in both sklearn and mllib packages.
⦿ Utilized confusion matrix and ROC to find the best model which was random forest with accuracy of 0.88.

Market Analysis Based on New York City Taxi and Limousine Data (Oct 2018 – Nov 2018)
⦿ Processed data and made explorations on trip distances, terminations, fares etc. with SQL and Pandas
⦿ Clustered trips on pick-up and drop-off locations utilizing K means clustering respectively.
⦿ Classified customers’ preference and made suggestions to the company using Tableau.

Prediction of the Customers’ Buying Behavior of Fixed Time Deposit (Nov 2017 – Dec 2017)
⦿ Preprocessed data and analyzed customers’ information (job scopes, education level, housing loans etc.).
⦿ Exploited Pearson correlation and PCA for feature selection.
⦿ Applied and compared model performances by using decision tree, random forest, K nearest neighbor and SVM.
⦿ Tuned parameters and evaluated models (to prevent over-fitting) and found random forest as the best classifier.

Technical Skills

Tools: Python, SQL, Alteryx, R, SAS, Eviews, Spark, AMPL, MATLAB, Tableau, SPSS, Minitab, C, Qlik Sense, Power BI


Master of Engineering in Industrial & Systems Engineering
Lehigh University Bethlehem, PA

B.E in Electrical Engineering & Automation
Beijing Technology & Business University, China

Get The Latest Updates

Subscribe To Our Monthly Newsletter

No spam, only the content you’ll want to read.

Details about how we process your Information is available in our 

Privacy Policy

Watch our Latest Webinar On-demand

Patient Journey 2.0:

The Re-Remix! A fresh take on health and integrating primary and real-world data