LiRA Road Prediction
Project Overview
Top Summary: A collaborative research and engineering initiative (Live Road Assessment) focused on developing a cost-effective, data-driven system for monitoring road infrastructure using IoT data from moving vehicles.
| Quick Stats | Details |
|---|---|
| Role | Software Engineer / Data Researcher |
| Duration | Oct 2022 - Apr 2023 |
| Stack/Tools | Python, C#, RabbitMQ, Google Cloud Platform (GCP), Bayesian Networks, CI/CD |
| Link | LiRA Project Website |
1. The Challenge
Traditional road maintenance relies on expensive, specialized laser-scanning vehicles that are deployed infrequently, leading to outdated infrastructure data and reactive rather than proactive repairs.
- Problem: The project needed a way to ingest, process, and analyze massive streams of “raw” data from ordinary passenger vehicles to detect road degradation in real-time.
- Goal: To build a scalable backend capable of big-data ingestion and to develop predictive models that could accurately translate vehicle vibrations and sensor data into actionable road quality metrics.
2. My Solution & Technical Approach
I bridged the gap between data science and distributed systems by developing both the predictive logic and the infrastructure required to run it at scale.
- Architecture: Built a series of scalable C# microservices utilizing RabbitMQ for message queuing. This decoupled the data ingestion from the processing layers, ensuring the system could handle “bursty” data uploads from vehicle fleets.
- Key Decisions: Implemented Bayesian Belief Networks for road condition prediction. This probabilistic approach allowed the system to account for uncertainty in sensor data (like vehicle type or speed variations), significantly improving the accuracy of live assessments in prototype deployments.
- Cloud Infrastructure: Leveraged Google Cloud Platform to host the services, implementing automated CI/CD pipelines to ensure rapid deployment and high availability of the data processing backbone.
3. Implementation & Overcoming Obstacles
- Hardest Challenge: Handling the sheer volume and “noise” of big-data ingestion while ensuring the feature engineering process remained performant enough for real-time applications.
- Resolution: I optimized the data pipeline by implementing advanced feature engineering in Python to pre-process raw signals before they reached the Bayesian models. By automating the deployment through CI/CD, I reduced the overhead of managing complex microservice dependencies on GCP.
4. Results & Conclusion
- Measurable Impact: Successfully developed a prototype deployment that demonstrated improved live assessment accuracy. The scalable architecture now allows the LiRA project to process data from diverse vehicle sources, moving the concept of “Data-Driven Road Asset Management” from theory to functional software.
- Lessons Learned: This project emphasized the importance of designing for scalability from day one. It required a deep understanding of how to translate complex mathematical models (Bayesian Networks) into production-ready code that survives the rigors of big-data cloud environments.