AI Commons Health & Wellbeing Hackathon Solutions
Overview
The AI Commons Project is a proof of concept of a new methodology of developing Artificial Intelligence solutions that allows anyone, anywhere to benefit from the possibilities that AI can provide. The project aims to increase/improve the accessibility, reproducibility, contextualization and enhancement of Artificial Intelligence solutions globally and especially in emerging markets.
The project aims to demonstrate how a global community of AI experts can learn and co-create mutually beneficial solutions with the opportunity for cross-county incremental enhancement.
Pneumonia Classification
Statement of Purpose
Introduction
Data Science Nigeria
Rising Odegua (Data Science Nigeria)
Nelson Ogbeide; Ojeabulu O. Gift; Comfort Igboko; Caleb Emelike; Precious Cadeton
Problem Definition
Pneumonia is an infection that affects one or both lungs and it is known to cause premature death around the world. World Health Organization(WHO) estimated that about 4 million death occur annually from air pollution diseases including pneumonia
Research has shown that children and people older than 65 are the most vulnerable to the disease and continents with a high number of adult and children ratio like Africa and Asia are mostly affected.
A doctor examines the chest x ray of a patient to determine if the patient has pneumonia
Yes
Malaria detection using machine learning
Cancer detection using machine learning
Solution
The solution is a pneumonia detection model. It helps health practitioners in hospitals, clinics and laboratories to quickly detect whether a patient has pneumonia, given the patient’s chest X-ray. The first and only release ofthe solution was in 2019.
See paper Here
Health practitioners
Patients
A data scientist/ machine learning engineer is needed to build the prediction model and a domain expert (such as a thoracic surgeon) is needed to provide technical details on how the diseases work and are being identified
Small amount of training data
More data should be used for training and evaluation
No. We hope to keep it updated whenever the solution is retrained.
Usage
A simple use case: When a patient shows symptoms of pneumonia, her chest xray is fed into the model to predict in seconds the likelihood of the patient having pneumonia. With this, doctors can help recommend drugs that could help treat the disease at early stage.
Health practitioners
The chest x ray image which is the input is uploaded into the model by a medical practitioner and the prediction of whether the patient has pneumonia or not is displayed on the screen.
Domain and Applications
Dataset
Composition
Collection Process
Preprocessing/Cleaning/Labelling
Uses
Maintenance
Dataset Publicly Available
Chest X-Ray Images (Pneumonia)
The dataset contains 5,863 images in 2 categories namely Normal(Chest x ray) and infected (Chest x ray). The dataset is hosted on kaggle and maintained by the owner Paul Mooney and it does not have a datasheet.
Link to dataset: Click Here
Model
Model Details
Model date: 2019. The model was built using keras, a neural network framework in python for deep learning applications. The type of neural network is a 7-level convolutional neural network.
Basic image resize was performed on the data
Evaluation
Testing the Solution
The dataset was split in the ratio 80:10:10 (train, validation, and test dataset). The performance metric used is mean average precision (mAP), it is a very important metric to use when building an object detection model.
Testing by Third Party
Result
Result Details
The model was able to predict 95% of the test data correctly and 5% wrongly. The model accuracy, precision, recall, f1_score are 94%, 93%, 100% and 96% respectively.
40 epochs in 217 steps
The measure of statistics utlized is the classification report available in scikit-learn (python library) which showed the precision, recall , accuracy and f1_score of the model
On training the model, there was an error drop from 100% to 7%.
The average runtime is about 1-2s to predicting each sample result.
Environment
Operating system is linux, programming language is python3.7, keras version 1.5
Yes. Django, a python framework was used
Steps to Reduce the Solution
Result Details
Safety
General
The possible sources of bias/ unfairness weren’t analysed.
Explainability
The solution output are not easily explainable
Concept Drift
Security
It is not suitable for any solution that is not related to pneumonia.
The solution does not collect user data.