Scénario de test & Cas d'usage
Management of Large Language Models (LLM) and NLP.
Discover all actions of langModelCreation of a reference table with ground truth medical notes and a hypothesis table with simulated model outputs containing typical transcription errors (substitutions and deletions).
| 1 | |
| 2 | DATA mycas.med_ref; |
| 3 | LENGTH audio_id $15 content $200; |
| 4 | INPUT audio_id $ content &; |
| 5 | DATALINES; |
| 6 | REC001 Patient exhibits signs of acute bronchitis REC002 Prescribed 50mg of Atenolol daily REC003 No history of cardiovascular disease ; |
| 7 | |
| 8 | RUN; |
| 9 | |
| 10 | DATA mycas.med_hyp; |
| 11 | LENGTH pred_id $15 pred_text $200; |
| 12 | INPUT pred_id $ pred_text &; |
| 13 | DATALINES; |
| 14 | REC001 Patient exhibits signs of acute bronchitis REC002 Prescribed 15mg of Atenolol daily REC003 No history of cardio vascular disease ; |
| 15 | |
| 16 | RUN; |
| 17 |
| 1 | |
| 2 | PROC CAS; |
| 3 | TABLE.tableInfo / TABLE='med_ref'; |
| 4 | TABLE.tableInfo / TABLE='med_hyp'; |
| 5 | |
| 6 | RUN; |
| 7 |
| 1 | |
| 2 | PROC CAS; |
| 3 | langModel.calculateErrorRate / TABLE={name='med_hyp'} reference={name='med_ref'} tableId='pred_id' tableText='pred_text' referenceId='audio_id' referenceText='content'; |
| 4 | |
| 5 | RUN; |
| 6 |
The action should successfully map the columns despite different names. It must return a CAS result table showing a low error rate for REC001 (perfect match), a substitution error for REC002 (50mg vs 15mg), and potentially an insertion/substitution error for REC003 (cardiovascular vs cardio vascular).