Hide metadata

dc.date.accessioned2023-01-03T17:22:16Z
dc.date.available2023-01-03T17:22:16Z
dc.date.created2022-12-06T20:25:59Z
dc.date.issued2022
dc.identifier.urihttp://hdl.handle.net/10852/98445
dc.description.abstractThis paper aims to propose an efficient machine learning framework for maritime big data and use it to train a random forest model to estimate ships’ propulsion power based on ship operation data. The comprehensive data include dynamic operations, ship characteristics and environment. The details of data processing, model configuration, training and performance benchmarking will be introduced. Both scikit-learn and Spark MLlib were used in the process to find the best configuration of hyperparameters. With this combination, the search and training are much more efficient and can be executed on latest cloud-based solutions. The result shows random forest is a feasible and robust method for ship propulsion power prediction on large datasets. The best performing model achieved a R2 score of 0.9238.
dc.languageEN
dc.publisherIEEE conference proceedings
dc.titleData-Driven Prediction of Ship Propulsion Power Using Spark Parallel Random Forest on Comprehensive Ship Operation Data
dc.title.alternativeENEngelskEnglishData-Driven Prediction of Ship Propulsion Power Using Spark Parallel Random Forest on Comprehensive Ship Operation Data
dc.typeChapter
dc.creator.authorLiang, Qin
dc.creator.authorVanem, Erik
dc.creator.authorKnutsen, Knut Erik
dc.creator.authorZhang, Houxiang
cristin.unitcode185,15,13,25
cristin.unitnameStatistikk og Data Science
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode1
dc.identifier.cristin2090070
dc.identifier.startpage303
dc.identifier.endpage308
dc.identifier.doihttps://doi.org/10.1109/ICCA54724.2022.9831854
dc.type.documentBokkapittel
dc.type.peerreviewedPeer reviewed
dc.type.versionAcceptedVersion
cristin.btitle2022 IEEE 17th International Conference on Control & Automation (ICCA)


Files in this item

Appears in the following Collection

Hide metadata