Skip to content

Commit bf2ed17

Browse files
authored
Merge pull request #5962 from bebatut/genomics-gai-llm-with-python
Large Langage Model for Genomics using Python
2 parents ccf27f1 + 094ab20 commit bf2ed17

File tree

17 files changed

+3439
-71
lines changed

17 files changed

+3439
-71
lines changed

bin/check-url-persistence.sh

+3
Original file line numberDiff line numberDiff line change
@@ -25,6 +25,9 @@ cat /tmp/20*.txt | sort -u | \
2525
grep -v '/transcriptomics/tutorials/ref-based/faqs/rnaseq_data.html' | \
2626
grep -v '/topics/data-management/' | \
2727
grep -v 'training-material/tags/' | grep -v 'data-library'| grep -v '/recordings/index.html' |\
28+
grep -v '/topics/statistics/tutorials/gai-llm-with-python/slides.html' | \
29+
grep -v '/topics/statistics/tutorials/gai-llm-with-python/slides-plain.html' | \
30+
grep -v '/topics/statistics/tutorials/gai-llm-with-python/tutorial.html' | \
2831
sed 's|/$|/index.html|' | grep '.html$' | sort -u | sed 's|https://training.galaxyproject.org|_site|' > /tmp/gtn-files.txt
2932

3033
count=0

learning-pathways/ml-using-python.md

+15-3
Original file line numberDiff line numberDiff line change
@@ -46,11 +46,23 @@ pathway:
4646
name: deep-learning-without-gai-with-python
4747
topic: statistics
4848
-
49-
section: "Module 4: Generative Artificial Intelligence and Large Langage Model"
50-
description: Generative Artificial Intelligence and Large Langage Model using Python
49+
section: "Module 4: Generative Artificial Intelligence and Large Langage Model for Genomics using Python"
50+
description: This tutorial series provides a comprehensive guide to leveraging large language models for genomics, covering pretraining, fine-tuning, mutation impact prediction, sequence generation, and optimization.
5151
tutorials:
5252
-
53-
name: gai-llm-with-python
53+
name: genomic-llm-pretraining
54+
topic: statistics
55+
-
56+
name: genomic-llm-finetuning
57+
topic: statistics
58+
-
59+
name: genomic-llm-zeroshot-prediction
60+
topic: statistics
61+
-
62+
name: genomic-llm-sequence-generation
63+
topic: statistics
64+
-
65+
name: genomic-llm-sequence-optimization
5466
topic: statistics
5567
-
5668
section: "Module 5: Regulations/standards for AI using DOME"

topics/statistics/metadata.yaml

+16-5
Original file line numberDiff line numberDiff line change
@@ -7,12 +7,23 @@ summary: "Statistical Analyses for omics data and machine learning using Galaxy
77
docker_image: "quay.io/galaxy/statistics-training"
88
edam_ontology: ["topic_2269"]
99

10-
requirements:
11-
-
12-
type: "internal"
13-
topic_name: introduction
10+
#requirements:
11+
# -
12+
# type: "internal"
13+
# topic_name: introduction
1414

1515
editorial_board:
1616
- marziacremona
1717
- cumbof
18-
- anuprulez
18+
- anuprulez
19+
20+
subtopics:
21+
# - id: machine-learning
22+
# title: "Machine Learning"
23+
# description: "Tutorials introducing fundamental concepts and techniques, guiding learners through data preprocessing, model training, evaluation, and application."
24+
# - id: deep-learning
25+
# title: "Deep Learning"
26+
# description: "In-depth guidance on designing and implementing neural networks, covering topics such as architecture design, training techniques, and applications"
27+
- id: gai-llm
28+
title: "Generative Artificial Intelligence and Large Langage Model"
29+
description: "Tutorials covering the creation, pretraining, and applications of Generative Artificial Intelligence and Large Language Models."

topics/statistics/tutorials/gai-llm-with-python/slides.html

-24
This file was deleted.

topics/statistics/tutorials/gai-llm-with-python/tutorial.md

-39
This file was deleted.
Loading
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
@misc{zhou2024dnabert2efficientfoundationmodel,
2+
title={DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species Genome},
3+
author={Zhihan Zhou and Yanrong Ji and Weijian Li and Pratik Dutta and Ramana Davuluri and Han Liu},
4+
year={2024},
5+
eprint={2306.15006},
6+
archivePrefix={arXiv},
7+
primaryClass={q-bio.GN},
8+
url={https://arxiv.org/abs/2306.15006},
9+
}

0 commit comments

Comments
 (0)