数据集:
imodels/diabetes-readmission
Port of the diabetes-readmission dataset from UCI (link here ). See details there and use carefully.
Basic preprocessing done by the imodels team in this notebook .
The target is the binary outcome readmitted .
Load the data:
from datasets import load_dataset
dataset = load_dataset("imodels/diabetes-readmission")
df = pd.DataFrame(dataset['train'])
X = df.drop(columns=['readmitted'])
y = df['readmitted'].values
Fit a model:
import imodels import numpy as np m = imodels.FIGSClassifier(max_rules=5) m.fit(X, y) print(m)
Evaluate:
df_test = pd.DataFrame(dataset['test'])
X_test = df.drop(columns=['readmitted'])
y_test = df['readmitted'].values
print('accuracy', np.mean(m.predict(X_test) == y_test))