
Big Data drives big results
Written by Dr. Allan M. Zarembski
UP tie cranes position new ties in Post Falls, Idaho. Bruce Kelly photo.
Railway Age, February 2019: As railroads continue to expand their data collection technologies across all of their operational areas, they simultaneously continue to expand their ability to analyze this data and convert the data into actionable information—in other words, to generate information that can be directly used in their operation or maintenance activities.
This was clearly evident at the 2018 Big Data in Railroad Maintenance Planning Conference, held at the University of Delaware Dec. 13-14. With more than 250 registrants for this year’s conference, railroads, suppliers, data scientists and university researchers came together to talk about what is being done in this exciting new arena of Data Analytics or “Big Data.” Furthermore, it was clearly apparent that the focus was no longer on what is needed or should be done, but rather what is actually being developed and implemented.
Equally impressive was the presence of railroad and supplier attendees with titles such as Chief Data Scientist, Manager of Advanced Analytics, Data Analyst, etc. Clearly, the railway industry has taken note of the need for this type of advanced analytics approach, which can extract value from data. This is a very practical part of the newly emerging field of Data Science.
Data Science is an interdisciplinary field using evolving analysis tools and techniques to extract knowledge or insights from data in various forms, either structured or unstructured [1]. Data Science has all of the characteristics needed by railway engineering and maintenance personnel to address and handle the enormous amount of data generated by the various technology platforms currently in place [2]. This includes all of the current and emerging data collection systems being used by railways to help monitor infrastructure and equipment condition, optimize and plan maintenance and improve safety, as well as the predictive analytics tools being developed within the field of Data Science and now being implemented in the railway industry. This is illustrated in Figure 1, which shows current and future data acquisition systems integrated with enhanced data analytics and decision support tools with a goal of “lean” and “effective” maintenance, i.e. effective maintenance at a minimum cost [3].

Figure 1. Integration of Data Acquisition Systems and Data Analytics in Maintenance Actions [3].

Figure 2: Reduction in Track Caused Derailments as a Function of Increased Track Inspection [4].

Figure 3: Application of Latent Semantic Analysis (LSA) for Predictive Modeling of Railway Data [8].

Figure 4: Risk Model for Development of Recurrent Rail Defects [9].
On the track side, use of data analytics addressed all aspects of track maintenance and safety, ranging from rail wear prediction, broken rail safety, tie design and inspection and prediction of track geometry degradation and associated risk of derailments. One presentation discussed a model for calculating the probability of a track geometry caused derailment as a function of a Geometry Condition Indicator (GCI) and distance from the geometry condition [5]. Prediction of rail failure to include both fatigue and wear was a recurring focus, with one class one railroad developing a rail wear modeling tool that has since been incorporated into their capital planning process. Figure 6 illustrates another risk model, focusing on the risk of developing recurrent rail defects and rail service defects [9]. Several FRA sponsored activities [10] focusing on broken rail risk included:
• Development of Artificial Intelligence Aided Track Risk Analysis (AI-Track Risk) model focused on rail failures.
• Development of an integrated broken-rail derailment risk analysis and simulation framework that included development of a Bayesian analytical framework for predicting the probability of broken rails; prediction of derailment consequence using multivariate data analyses; and evaluation of segment-specific risk and assessment of the impacts of various track risk management strategies.

Figure 5: Overlay of Multiple Rail Wear Measurements before and after Alignment [11].
- Correcting Position Errors in overlay of multiple measurement runs to include rail profile [11, 12] (Figure 4) and track geometry [13].
- Correction of position errors such as Absolute Position Error (APE), and Relative Position Error (RPE), associated with measurement systems, wheel slip and adhesion (Figure 5) [14].
- Correction of position errors due to the use of different measurement systems or vehicles, where measurement systems or sensors are located in different locations within the vehicle (Channel-inside Position Offset (CPO), Figure 6) [14].
- Addressing “abnormal” data or data exceptions (Figure 5) [11, 14].
Likewise on the mechanical side, use of data analytics for both passenger and freight equipment was discussed. The use of data in Condition Based Maintenance (CBM) of rolling stock is illustrated in Figure 6 [15]. Using both onboard data and data from wayside train scanners, maintenance can be performed at a number of levels ranging from threshold alerted conditions (reactive maintenance) to rules based maintenance to predictive maintenance using trend analysis and forecasting models with inputs from multiple data sources.

Figure 6: Methodology for Correction of Position Errors Due to Multiple Causes [14].

Figure 7: Use of data in Condition Based Maintenance (CBM) of Rolling Stock [15].

Figure 8: Using Data Analytics for Real Time Train Delay Forecasting [16].

Figure 9: Use of data to Move from Reactive to Prescriptive Maintenance [17].
The 2019 Big Data in Railroad Maintenance Planning conference will be held Dec. 11-12, 2019, at the University of Delaware’s Newark, Del., campus. Contact Professor Allan M. Zarembski at [email protected].
REFERENCES
- Zarembski, A. M., “The Emerging Role of Data Science in Railroad Maintenance Management,” Railway Age, May 2018.
- Attoh-Okine, N., Big Data and Differential Privacy: Analysis Strategies for Railway Track, Wiley, May 2017.
- Tegelberg, Erland, “Effective Asset Management and Exciting New Big Data Sources,” Managing Consultant, Strukton Rail North America, 2018 Big Data in Railroad Maintenance Planning Conference.
- Messner, M., “BNSF Geometry Tag Prioritization,” Assistant Director of Roadway Planning,” BNSF, 2018 Big Data Conf.
- Smart K. and Einbinder D., “Utilizing Bayesian Inference and Machine Learning to Identify Risks to Railroads,” ENSCO, Inc., 2018 Big Data Conf.
- Stewart, L. and Pagliuco, S., “An Artificial Intelligence Approach to Aligning Historical Railroad Data,” GREX, 2018 Big Data Conf.
- Attoh-Okine, N., “The Future of Blockchain Technology in Railroad Track Engineering,” University of Delaware, 2018 Big Data Conf.
- Williams, T. and Betak, J., “Using Text and Data Analytics to Study Railroad Operations,” Collaborative Solutions, LLC, 2018 Big Data Conf.
- He, Q., “Data-Driven Rail Defect Deterioration Modeling for Responsive Maintenance,” University of Buffalo, 2018 Big Data Conf.
- Baillargeon, J, “Update on FRA’s Predictive Analytics Research,” Program Manager, FRA, 2018 Big Data Conf.
- Palese, J “Application of Data Analytics to Rail Wear Forecasting,” Senior Scientist, University of Delaware, 2018 Big Data Conf.
- Rice, J. S. and Amouie, M, “Norfolk Southern’s Rail Wear Prediction Using Artificial Intelligence and Machine Learning,” 2018 Big Data Conf.
- Rome, J., “Developing Geometry Data Alignment for Amtrak,” Navigation Innovations, Inc., 2018 Big Data Conf.
- Wang, Y., “Position Synchronization for Track Geometry Inspection Data via Big-Data Fusion and Incremental Learning,” Southwest Jiaotong University, China, 2018 Big Data Conf.
- Flix, N., “Acquisition, processing and Storage of Rolling Stock CBM Data,” Alstom, 2018 Big Data Conf.
- Karnik, A., “Evolution of Operational Analysis Using Discrete Data Streams and Big Data Approach: Case study: Prediction of Train Arrival Times,” Volanno, Inc. , 2018 Big Data Conf.
- Thompson, T., “Utilizing Artificial intelligence to Increase Rolling Stock Maintenance Efficiency,” Uptake, 2018 Big Data Conf.
- Bellias, M., “The Evolution of Maintenance,” https://www.ibm.com/blogs/internet-of-things/maintenance-evolution-prescriptive/.