...
首页> 外文期刊>International Journal of Population Data Science >Application of natural language processing methods to extract coded data from administrative data held in the Scottish Prescribing Information System
【24h】

Application of natural language processing methods to extract coded data from administrative data held in the Scottish Prescribing Information System

机译:应用自然语言处理方法从苏格兰处方信息系统中保存的管理数据中提取编码数据

获取原文
           

摘要

ABSTRACT ObjectivesThe Prescribing Information System (PIS) datamart, hosted by NHS National Services Scotland receives around 90 million electronic prescription messages per year from GP practices across Scotland. Prescription messages contain information including drug name, quantity and strength stored as coded, machine readable, data while prescription dose instructions are unstructured free text and difficult to interpret and analyse in volume. The aim, using Natural Language Processing (NLP), was to extract drug dose amount, unit and frequency metadata from freely typed text in dose instructions to support calculating the intended number of days’ treatment. This then allows comparison with actual prescription frequency, treatment adherence and the impact upon prescribing safety and effectiveness. ApproachAn NLP algorithm was developed using the Ciao implementation of Prolog to extract dose amount, unit and frequency metadata from dose instructions held in the PIS datamart for drugs used in the treatment of gastrointestinal, cardiovascular and respiratory disease. Accuracy estimates were obtained by randomly sampling 0.1% of the distinct dose instructions from source records, comparing these with metadata extracted by the algorithm and an iterative approach was used to modify the algorithm to increase accuracy and coverage. ResultsThe NLP algorithm was applied to 39,943,465 prescription instructions issued in 2014, consisting of 575,340 distinct dose instructions. For drugs used in the gastrointestinal, cardiovascular and respiratory systems (i.e. chapters 1, 2 and 3 of the British National Formulary (BNF)) the NLP algorithm successfully extracted drug dose amount, unit and frequency metadata from 95.1%, 98.5% and 97.4% of prescriptions respectively. However, instructions containing terms such as ‘as directed’ or ‘as required’ reduce the usability of the metadata by making it difficult to calculate the total dose intended for a specific time period as 7.9%, 0.9% and 27.9% of dose instructions contained terms meaning ‘as required’ while 3.2%, 3.7% and 4.0% contained terms meaning ‘as directed’, for drugs used in BNF chapters 1, 2 and 3 respectively. ConclusionThe NLP algorithm developed can extract dose, unit and frequency metadata from text found in prescriptions issued to treat a wide range of conditions and this information may be used to support calculating treatment durations, medicines adherence and cumulative drug exposure. The presence of terms such as ‘as required’ and ‘as directed’ has a negative impact on the usability of the metadata and further work is required to determine the level of impact this has on calculating treatment durations and cumulative drug exposure.
机译:摘要目标由NHS苏格兰国民服务局(National Services Scotland)托管的处方信息系统(PIS)数据集市每年从苏格兰各地的GP诊所接收约9000万电子处方信息。处方消息包含以下信息:药物名称,以编码形式存储的数量和强度,机器可读数据,数据,而处方剂量说明则是非结构化的自由文本,并且难以批量解释和分析。使用自然语言处理(NLP)的目的是从剂量说明中自由键入的文本中提取药物剂量,单位和频率元数据,以支持计算预期的治疗天数。然后可以与实际处方频率,治疗依从性以及对处方安全性和有效性的影响进行比较。方法使用Prolog的Ciao实施开发了一种NLP算法,以从PIS数据市场中保存的用于胃肠道,心血管和呼吸道疾病的药物的剂量指令中提取剂量,单位和频率元数据。通过从源记录中随机抽取0.1%的不同剂量指令,并将其与算法提取的元数据进行比较,来获得准确度估算值,并使用迭代方法修改算法以提高准确性和覆盖范围。结果NLP算法应用于2014年发布的39,943,465处方药指令,包括575,340不同的剂量指令。对于用于胃肠,心血管和呼吸系统(即英国国家配方(BNF)第1、2和3章)的药物,NLP算法成功地从95.1%,98.5%和97.4%提取了药物剂量,单位和频率元数据的处方。但是,包含诸如“按指示”或“按要求”之类的指令的指令会导致难以计算特定时间段内的总剂量,因为所含剂量指令的7.9%,0.9%和27.9%会降低元数据的可用性。对于BNF第1、2和3章中使用的药物,分别表示“按需”的术语和3.2%,3.7%和4.0%的术语“按指示的”。结论所开发的NLP算法可以从用于治疗多种疾病的处方药中提取剂量,单位和频率元数据,该信息可用于支持计算治疗时间,药物依从性和累积药物暴露。 “按要求”和“按指示”等术语的存在会对元数据的可用性产生负面影响,需要进一步的工作来确定其对计算治疗持续时间和累积药物暴露的影响程度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号