Function clean_feature
clean_feature.Rd
The clean_feature functions quickly converts all of the feature names (variables) of your code list into feature names that are compatible with Lucid functions.
Arguments
- code_list
user defined table with 1 mandatory column:
feature
- the feature the exact code will roll up to and name of the column in the output dataframe (letters, numbers, underscores only, no spaces, not case sensitive)optional columns
code
- parent or child codecode_system
- ICD-10-CM, RxNorm, etcqualifier_num
- numeric qualifierqualifier_text
- text qualifier
Value
a spark dataframe that contains all of the up to 5 columns allowed to be used in the code_list input, plus a new column called feature and the column the user calls feature in the input table is returned as feature_old
any spaces in any of the values of the feature column are replaced with an underscore (_) in the feature column
all letters of the feature column are forced to lowercase in the feature column
strip anything in the feature column that is not a number or letter
trim all leading & trailing spaces
Details
:
code list| feature | code | code_system |
|-----------------|----------|-------------|
| Lung Transplant | T86.81 | ICD-10-CM |
| Lung Transplant | Z48.24 | ICD-10-CM |
| Lung Transplant | Z94.2 | ICD-10-CM |
| Lung Transplant | T86.81 | ICD-10-CM |
| Lung Transplant | 1006036 | CPT |
| Lung Transplant | 88039007 | CPT |
= tnx.clean_feature(code_list)
example example.head()
| feature | feature_old | code | code_system |
|-----------------|-----------------|----------|-------------|
| lung_transplant | Lung Transplant | T86.81 | ICD-10-CM |
| lung_transplant | Lung Transplant | Z48.24 | ICD-10-CM |
| lung_transplant | Lung Transplant | Z94.2 | ICD-10-CM |
| lung_transplant | Lung Transplant | T86.81 | ICD-10-CM |
| lung_transplant | Lung Transplant | 1006036 | CPT |
| lung_transplant | Lung Transplant | 88039007 | CPT |