化學資訊學

化學資訊學是一門交叉學科，研究如何使用電腦與資訊科技來表示、儲存、檢索、分析和預測化學資料與分子性質。它結合了化學、電腦科學、數學與資訊學的原理，旨在從大量化學結構與實驗資料中提取有用資訊，以支援化學研究與藥物發現等應用。

歷史

術語化學資訊學（Cheminformatics）是1998年被F.K. 布朗^[1]^[2]定義的：

化學資訊學是這些資訊資源的混合，將資料轉化為資訊和資訊轉化為知識，以便在藥物導向辨識和最佳化領域更快地做出更好的決策。

英文中，Cheminformatics和Chemoinformatics兩種拼寫均被使用，而使用更多的則是較短的變體——Cheminformatics^[3]。最近成立的《化學資訊學學報（英語：Journal of Cheminformatics）》是前者的強力推動。

基礎

化學資訊學將化學，電腦科學和資訊學的科學工作領域結合在化學空間（英語：Chemical space）中的拓撲（英語：Topology_(chemistry)），化學圖論，資訊檢索和資料探勘領域^[4]^[5]^[6]^[7]。化學資訊學也可以應用於各種行業，如紙張和紙漿，染料和相關行業的資料分析。

應用

儲存和檢索

化學資訊學的主要應用是儲存，索引和搜尋與化合物有關的資訊。這種儲存資訊的有效搜尋包括電腦科學中作為資料探勘，資訊檢索，資訊抽取和機器學習的主題。相關研究課題包括：

非結構化資料（英語：Unstructured data）
- 資訊檢索
- 資訊抽取
結構化資料探勘（英語：Structure mining）和挖掘結構化資料
- 資料庫挖掘（英語：Structure mining）
- 圖形挖掘（英語：Structure mining）
- 分子挖掘
- 序列模式挖掘（英語：Sequential pattern mining）
- 樹挖掘（英語：Structure mining）
數字圖書館

檔案格式

化學結構的電腦表示使用專門的格式，例如基於XML的化學標記式語言或簡化分子線性輸入規範（SMILES）。這些表徵通常用於儲存在大型化學資料庫中。雖然一些格式適用於2或3維的視覺表示，但其他格式更適合於研究物理相互作用，建模和對接研究。

虛擬化合物庫（Virtual Libraries）

化學資料可以涉及真實或虛擬分子。可以以各種方式產生化合物的虛擬化合物庫以探索化學空間，並假設具有所需性質的全新的化合物。

最近使用片段最佳化生長（fragment optimized growth）演算法生成化合物類（藥物，天然產物，多樣性合成產品）的虛擬化合物庫^[8] 。這是通過使用化學資訊工具來訓練馬爾可夫鏈在真實化合物類別上的轉變概率，然後使用馬爾可夫鏈產生類似於訓練資料庫的全新化合物。

虛擬篩選（Virtual screening）

與高通量篩選相比，虛擬篩選涉及通過各種方法在化合物庫中進行計算篩選，例如蛋白質和蛋白質或多肽的對接，用於辨識可能具有所需特性（例如針對特定靶標的生物活性）的分子。在某些情況下，組合化學用於化合物庫的開發，以提高挖掘化學空間的效率。更常見的是，篩選多樣化的小分子或天然產物庫。

定量構效關係（QSAR）

這是定量結構-活性關係和定量結構性質關係值的計算，用於預測其結構中化合物的活性。在這種情況下，化學計量學也和它有很強的關係。化學專家系統也是相關的，因為它們代表化學知識的一部分，作為電腦類比表示。有一個相對較新的匹配分子對分析（英語：Matched molecular pair analysis）或預測驅動的MMPA的概念，它與QSAR模型相結合，以便辨識活動懸崖（Activity cliffs）^[9]。化學懸崖通常被定義為結構相似且對同一靶點都具有活性但藥效差異較大的化合物或化合物組。

參閱

生物資訊學
《化學資訊學學報（英語：Journal of Cheminformatics）》（Journal of Cheminformatics）
計算化學
資料分析
分子建模
製藥工程
科學視覺化
統計學

參考資料

^ F.K. Brown. Chapter 35. Chemoinformatics: What is it and How does it Impact Drug Discovery. Annual Reports in Med. Chem. Annual Reports in Medicinal Chemistry. 1998, 33: 375. ISBN 978-0-12-040533-6. doi:10.1016/S0065-7743(08)61100-8.
^ Brown, Frank. Editorial Opinion: Chemoinformatics – a ten year update. Current Opinion in Drug Discovery & Development. 2005, 8 (3): 296–302.
^ Cheminformatics or Chemoinformatics ?. [2017-07-01]. （原始內容存檔於2017-06-21）.
^ Gasteiger J.(Editor), Engel T.(Editor): Chemoinformatics : A Textbook. John Wiley & Sons, 2004, ISBN 3-527-30681-1
^ A.R. Leach, V.J. Gillet: An Introduction to Chemoinformatics. Springer, 2003, ISBN 1-4020-1347-7
^ Alexandre Varnek and Igor Baskin. Chemoinformatics as a Theoretical Chemistry Discipline. Molecular Informatics. 2011, 30 (1): 20–32. doi:10.1002/minf.201000100.
^ Barry A. Bunin (Author), Brian Siesel (Author), Guillermo Morales (Author), Jürgen Bajorath (Author): Chemoinformatics: Theory, Practice, & Products. Springer, 2006, ISBN 978-1402050008
^ Kutchukian, Peter; Lou, David; Shakhnovich, Eugene. FOG: Fragment Optimized Growth Algorithm for the de Novo Generation of Molecules occupying Druglike Chemical. Journal of Chemical Information and Modeling. 2009, 49 (7): 1630–1642. PMID 19527020. doi:10.1021/ci9000458.
^ Sushko, Yurii; Novotarskyi, Sergii; Körner, Robert; Vogt, Joachim; Abdelaziz, Ahmed; Tetko, Igor V. Prediction-driven matched molecular pairs to interpret QSARs and aid the molecular optimization process. Journal of Cheminformatics: 48. 2014-12-11 [2017-07-01]. doi:10.1186/s13321-014-0048-0. （原始內容存檔於2015-09-07）（英語）.

外部連結

（英文）印第安納化學資訊學教育入口網站
（英文）化學資訊學學報（Journal of Cheminformatics）（頁面存檔備份，存於網際網路檔案館）
（英文）OEChem化學資訊學編程工具包（頁面存檔備份，存於網際網路檔案館）

[Brown_1998-1] F.K. Brown. Chapter 35. Chemoinformatics: What is it and How does it Impact Drug Discovery. Annual Reports in Med. Chem. Annual Reports in Medicinal Chemistry. 1998, 33: 375. ISBN 978-0-12-040533-6. doi:10.1016/S0065-7743(08)61100-8.

[2] Brown, Frank. Editorial Opinion: Chemoinformatics – a ten year update. Current Opinion in Drug Discovery & Development. 2005, 8 (3): 296–302.

[3] Cheminformatics or Chemoinformatics ?. [2017-07-01]. （原始內容存檔於2017-06-21）.

[Gasteiger_2004-4] Gasteiger J.(Editor), Engel T.(Editor): Chemoinformatics : A Textbook. John Wiley & Sons, 2004, ISBN 3-527-30681-1

[5] A.R. Leach, V.J. Gillet: An Introduction to Chemoinformatics. Springer, 2003, ISBN 1-4020-1347-7

[Varnek_2011a-6] Alexandre Varnek and Igor Baskin. Chemoinformatics as a Theoretical Chemistry Discipline. Molecular Informatics. 2011, 30 (1): 20–32. doi:10.1002/minf.201000100.

[Bunin_2006-7] Barry A. Bunin (Author), Brian Siesel (Author), Guillermo Morales (Author), Jürgen Bajorath (Author): Chemoinformatics: Theory, Practice, & Products. Springer, 2006, ISBN 978-1402050008

[8] Kutchukian, Peter; Lou, David; Shakhnovich, Eugene. FOG: Fragment Optimized Growth Algorithm for the de Novo Generation of Molecules occupying Druglike Chemical. Journal of Chemical Information and Modeling. 2009, 49 (7): 1630–1642. PMID 19527020. doi:10.1021/ci9000458.

[Prediction-driven_matched_molecular_pairs_to_interpret_QSARs_and_aid_the_molecular_optimization_process-9] Sushko, Yurii; Novotarskyi, Sergii; Körner, Robert; Vogt, Joachim; Abdelaziz, Ahmed; Tetko, Igor V. Prediction-driven matched molecular pairs to interpret QSARs and aid the molecular optimization process. Journal of Cheminformatics: 48. 2014-12-11 [2017-07-01]. doi:10.1186/s13321-014-0048-0. （原始內容存檔於2015-09-07）（英語）.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

閱論編組學主題
基因組學	認知基因組學（英語：Cognitive genomics）計算基因組學（英語：Computational genomics）比較基因組學基因組計劃人類基因組計劃古多倍性（英語：Paleopolyploidy）總體基因體學人類微生物組計劃功能基因組學（英語：Functional genomics）泛基因組學個人基因組學社會基因組學（英語：Social genomics）結構基因組學
生物資訊學	生物晶片化學資訊學化學基因組學（英語：Chemogenomics）免疫組學連接組學人類連接組計劃表觀遺傳學人類外基因組計劃糖組學脂類組學代謝物組學多肽組學（英語：Peptidomics）藥物遺傳學（英語：Pharmacogenetics）藥物基因組學影像組學（英語：Radiomics）系統生物學毒理基因組學（英語：Toxicogenomics）轉錄物組學
結構生物學	蛋白質組學人類蛋白質組計劃結構藥物設計
研究工具	雙向電泳質譜法電噴霧離子法（英語：Electrospray ionization）（ESI）基質輔助雷射解吸/電離（MALDI）微流控同位素標記染色體構象擷取（3C）
機構	國立衛生研究院 (美國) 日本DNA資料庫（日本）歐洲分子生物學實驗室（歐盟）維康桑格研究所（英國）
分類