技能 cellxgene-census

🧬

cellxgene-census

Name: cellxgene-census
Author: davila7

安全 ⚙️ 外部命令🌐 網路存取

查詢 6100 萬+ 單細胞基因組數據集

也可從以下取得: K-Dense-AI

研究人員需要高效存取群體規模的單細胞數據以進行基因組學研究。此技能提供 AI 驅動的 CZ CELLxGENE Census 存取功能,具備優化的查詢模式和下游分析整合指南。

支援: Claude Code(CC)

📊 69 充足

下載技能 ZIP

在 Claude 中上傳

前往設定 → 功能 → 技能 → 上傳技能

開啟並開始使用

測試它

正在使用「cellxgene-census」。尋找肺組織中的所有免疫細胞類型及其數量

預期結果:

B cell: 45,230 cells across 3 datasets
T cell: 67,890 cells across 5 datasets
Macrophage: 23,450 cells across 4 datasets
Dendritic cell: 12,100 cells across 2 datasets
NK cell: 18,760 cells across 3 datasets

正在使用「cellxgene-census」。展示如何查詢 COVID-19 患者 T 細胞中的標記基因

預期結果:

Use get_anndata with obs_value_filter for disease and cell type
Filter by feature_name using var_value_filter for gene selection
Include is_primary_data == True to avoid duplicate cells
Retrieve cell_type, tissue_general, and donor_id metadata columns

正在使用「cellxgene-census」。如何將 Census 數據與 scanpy 一起用於降維

預期結果:

Load data with get_anndata using appropriate filters
Apply scanpy normalization: sc.pp.normalize_total
Run log transformation: sc.pp.log1p
Compute PCA: sc.pp.pca
Generate UMAP: sc.tl.umap and sc.pl.umap

安全審計

安全

v5 • 1/17/2026

Pure documentation skill containing only markdown files with Python code examples. Static scanner flagged documentation patterns (code block syntax, text strings) as security issues due to misinterpretation. All findings are false positives. No executable code, network calls, file system access, or environment variable access exists.

已掃描檔案

1,235

分析行數

發現項

審計總數

風險因素

⚙️ 外部命令 (200)

🌐 網路存取 (1)

skill-report.json:6

審計者: claude 查看審計歷史 →

品質評分

架構

100

可維護性

內容

社群

100

安全

規範符合性

你能建構什麼

探索組織中的細胞類型

使用元數據篩選器和聚合函數查詢組織間的細胞類型分布

建立細胞類型分類器

使用 PyTorch 整合在 Census 數據上訓練機器學習模型以進行生物標記發現

跨數據集分析

使用 scanpy 工作流程整合多個數據集進行群體規模研究

試試這些提示

探索可用數據

Show me the unique cell types in the brain tissue from the Census. Use the cellxgene-census skill to query metadata with is_primary_data == True filter.

查詢基因表達

Query expression data for CD4, CD8A, and CD19 genes in T cells and B cells from lung tissue. Use cellxgene-census to retrieve AnnData objects.

訓練機器學習模型

Create a PyTorch dataloader using cellxgene-census experimental ml module to train a cell type classifier on liver cell data with 80-20 train-test split.

大規模分析

Show me how to use axis_query with out-of-core processing to iterate through brain cell expression data in chunks for memory-efficient analysis.

最佳實務

始終包含 is_primary_data == True 篩選器以避免跨數據集重複計算細胞
在生產工作流程中明確指定 census_version 以確保分析可重現
使用上下文管理器(with 語句)在開啟 Census 時自動清理資源

避免

在沒有篩選器的情況下載入整個 Census 會導致記憶體溢出
忽略數據集存在矩陣會導致基因數據遺失
使用自由文字篩選器而非本體術語會降低查詢一致性

常見問題

支援哪些物種?

Census 僅包含 Homo sapiens(人類)和 Mus musculus(小鼠)物種的數據。

我一次可以查詢多少個細胞?

10 萬個細胞以下的小型查詢可在記憶體中運作。較大的查詢需要使用 axis_query 進行核心外處理。

我可以與 scanpy 一起使用嗎?

可以。此技能提供將 Census 數據直接載入 AnnData 物件以用於 scanpy 工作流程的整合模式。

我的數據安全嗎?

此技能僅讀取公開的 Census 數據。不會傳輸任何使用者數據。該函式庫連接至 CZ CELLxGENE 數據儲存庫。

為什麼某些基因遺失?

基因可能在 Census 建構期間被篩選,或並非在所有數據集中都有測量。使用存在矩陣檢查覆蓋範圍。

這與直接下載數據集相比如何?

Census 提供具有統一元數據的標準化、版本化存取。查詢在伺服器端篩選數據,減少下載需求。

開發者詳情

作者

davila7

授權

MIT

儲存庫

https://github.com/davila7/claude-code-templates/tree/main/cli-tool/components/skills/scientific/cellxgene-census

引用

main

檔案結構

📁 references/

📄 census_schema.md

📄 common_patterns.md

📄 SKILL.md