Projects

Application of Natural Language Processing for Phishing Detection Using Machine and Deep Learning Models

The prevalence of phishing scams on the Internet served as the inspiration of the research article. The study utilized Machine Learning techniques such as Logistic Regression, Multinomial Naive Bayes, and Deep Learning models such as Long Short Term Memory (LSTM), Gated Recurrent Units, and their bidirectional counterparts on the procured dataset. The results were satisfactory but the LSTM and GRU had the highest training and validation score with an overall accuracy of 95%.

NLP Phishing Image
[Publication Site]

Sentiment Analysis on Ice Skating Reviews in Philippines

This is a notebook project conducted to determine if I can expect to have a good time ice skating in the aforementioned country which is likely based on the results. For this project, review data from different branches (i.e., Mall of Asia, Megamall, Seaside City Cebu) were scraped from Google Maps which were then cleaned. Subsequently, the cleaned data were used to perform exploratory data analysis and sentiment analysis with visualizations.

NLP Phishing Image
[GitHub Repo]

Design of Footage Filtering System for CCTV Archives using Object Detection

This project was made for the completion of our capstone for Design Project course in our program. The general objective of this project is to design a CCTV footage filtering system through human clothing and weapons using object detection. Furthermore, the system aims to assist in finding the relevant clips from the footage. PyQt6 was used for implementing the algorithm and YOLOv5 from Ultralytics for the detection algorithm. Moreover, it uses fast-colorthief for classifying the color of detected clothings.

NLP Phishing Image
[GitHub Repo]

IoT Data Pipeline

This project implements an Extract-Transform-Load (ETL) pipeline for IoT data using Kafka for streaming and MySQL for storage. It is designed to handle real-time ingestion of IoT device data, process it, and persist it into a relational database for further analytics. The project uses Docker to run Kafka and MySQL. Although the mode of data fetching is via API, the project is designed to easily change the method of extraction, possibly directly from a machine PLC. Thus, this project is a good template for persisting real-time IoT data.

IoT Data Source: https://thingspeak.mathworks.com/channels/1293177/feeds

NLP Phishing Image
[GitHub Repo]

CDC for Job Order Tracking System

A real-time monitoring system that tracks manufacturing job orders by capturing live database changes using Debezium CDC and streaming them through Apache Kafka. The system processes the event streams and visualizes updates instantly on a Streamlit dashboard, providing real-time visibility of production progress without manual refresh.

NLP Phishing Image
[GitHub Repo]

Prediction of House Prices in the Philippines

This project aims to predict house prices in the Philippines given the following factors: Number of Bedrooms, Number of Bathrooms, Floor Area, Land Area, and Urbanicity. For data wrangling and feature engineering, Numpy and Pandas were used. For Machine Learning modeling, Scikit-Learn was used wherein hyperparameter optimization across three models were performed. These models are RandomForestRegressor(), SVR(), and StackingRegressor(). It was observed that the errors were lower for houses with lower prices which was reflected by the target values because its distribution is positively skewed, that is - contains more houses with lower prices. A basic Streamlit application was developed to demonstrate the deployment of the model.

NLP Phishing Image
[App Demo] [GitHub Repo]

Time Series Analysis and Forecasting on Coca-cola Stocks

This project aims to explore time series analysis on stocks data. It also aims to explore time series forecasting using non-deep learning (i.e., ARIMA and SARIMAX) and deep learning (i.e., Long Short Term Memory (LSTM)) techniques. Various error metrics were used to asses the performance of each technique wherein LSTM had the least value across all error metrics. Thus, it was used to perform the final step which is forecasting.

NLP Phishing Image
[GitHub Repo]

Create, Read, Update, and Delete (CRUD) App - CSIBALE

This project was made for the completion of my Software Design course along with my colleagues. The client is a non-government organization (NGO) with a mission to assist and create opportunities for children with disabilities which are their beneficiaries. To conveniently keep records of their beneficiaries' information, this Create-Read-Update-Delete (CRUD) application was developed.

NLP Phishing Image
[GitHub Repo]

SpotInsights

I am interested on having a look on my music taste and I happen to stumble upon an application called "Receiptify" which exactly does that. Thus, I made an application inspired from it. This is a Flask application that provides insights and visualizations regarding the user's tracks and artists played on Spotify using their API.

NLP Phishing Image
[GitHub Repo]

Coniculator

This application was my final project during my second-year college for my Object Oriented Programming (OOP) course. It is a PyQt5 Application for calculating values (e.g., center, foci, vertices, length of latus rectum, eccentricity, etc.) needed given a template conic equations to plot the following conic shapes: Circle, Ellipse, Parabola, Hyperbola.

NLP Phishing Image
[GitHub Repo]

About Me

Tech-savvy professional with experience in building and maintaining data pipelines. Skilled in data transformation, ETL processes, and client collaboration to deliver reliable data solutions. Committed to continuous learning, professional growth, and bridging the gap between data engineering and machine learning

Education

2019 - 2024

College (Bachelor of Science in Computer Engineering)
Technological Institute of the Philippines - Quezon City

2017 - 2019

Senior High School (STEM Track)
Technological Institute of the Philippines - Quezon City

2013 - 2017

Junior High School
Ramon Magsaysay Cubao High School

Tech Stack

Category Technologies
Languages/DB Python, SQL, Bash
Frameworks/Libraries Pandas, SQLAlchemy, Numpy, FastAPI, Flask, Selenium, BeautifulSoup, Streamlit, Scikit-Learn, Matplotlib, Seaborn, PyTorch
Applications and Tools Git, GitHub, Anaconda, Databricks, Docker
Basic Web Development HTML, CSS, Javascript

Work Experience

September 2024 - Current

Data Engineer
Philippine Batteries Incorporated

  • Designs, builds, and maintains data pipelines and web applications to extract and deliver production-related data for business analytics and operational use. These solutions enabled the standardization of battery production data, simplifying access for engineers. They also streamlined KPI monitoring and related metrics to help ensure consistency and quality across manufactured units.
  • Designs and maintains automated data pipelines that reliably transfer data to the cloud on a scheduled basis.
  • Developed and integrated real-time alarm systems within the digital workspace to facilitate rapid detection and response to production anomalies.
  • Worked on development of a Computer Vision-enabled system to meet the stakeholder demands based on consumer feedback.
  • Oversees full project lifecycles, from requirements analysis to post-deployment support, partnering with cross-functional teams to develop real-time monitoring solutions.
  • Regularly communicates project updates to users and stakeholders to ensure transparency and alignment.

February 2023 - July 2023

Project Intern
Ramcar Technologies Inc.

  • Trained and evaluated Machine Learning (ML) algorithms for condition-based maintenance of machines using Scikit-Learn.
  • Performed data preprocessing and analysis on acquired data using Microsoft Excel and Python libraries such as Pandas, Numpy, and Matplotlib.
  • Regularly presented progress reports regarding the project to supervisors and clients.
  • Provided documentation for trained models which includes instructions on how to use them.

November 2019 - January 2020

Intern
SMS Global Technologies Inc.

  • Manually tested Subscriber Identity Module (SIM) cards
  • Performed spreadsheet works using Microsoft Excel to organize data

Contact

"Get in touch anytime—I'd love to hear from you!"

  • Email: jerickodg2000@gmail.com

Socials

Certs & Badges

Data Engineer

DataCamp | June 2024 - June 2026

Show credential

Data Engineer Associate

DataCamp | May 2024 - May 2026

Show credential

Google Data Analytics Specialization

Coursera | November 2023

Show credential

CCNA: Enterprise Networking, Security, and Automation

Cisco | August 2022

Show credential

CCNA: Switching, Routing, and Wireless Essentials

Cisco | January 2022

Show credential

CCNA: Introduction to Networks

Cisco | March 2021

Show credential