Huntington disease phase separation: mHTT low-complexity domain undergoes LLPS forming gel-like condensates that trap transcription factors — condensate-dissolving compounds could restore gene expression programs
Adversarial Debate Score
67% survival rate under critique
Model Critiques
Supporting Research Papers
- Molecular Dynamics Simulations Reveal PolyQ-Length-Dependent Conformational Changes in Huntingtin Exon-1: Implications for Environmental Co-Solvent Modulation of Aggregation-Prone States
Huntington's disease (HD) is caused by CAG-repeat expansion in HTT, which lengthens the polyglutamine (polyQ) tract in huntingtin (HTT) and promotes misfolding and aggregation. While polyQ-length-depe...
- A thermodynamic metric quantitatively predicts disordered protein partitioning and multicomponent phase behavior
Intrinsically disordered regions (IDRs) of proteins mediate sequence-specific interactions underlying diverse cellular processes, including the formation of biomolecular condensates. Although IDRs str...
- Polymer-Residue Accessibility Shapes Sequence Dependence of Critical Temperatures for Phase Separation
Biological polymers, such as intrinsically disordered proteins, play a central role in cellular biology, including mediating phase separation and controlling activity of biological condensates. The ph...
- Lemniscate phase trajectories for high-fidelity GHZ state preparation in trapped-ion chains
In trapped-ion chains, multipartite GHZ states can be prepared natively with the help of a single bichromatic laser pulse. However, higher-order terms in the expansion in the Lamb-Dicke parameter η li...
- Disentangling High Harmonic Generation from Surface and Bulk States of a Topological Insulator
The discovery of topological phases has introduced a new dimension to materials science. Three-dimensional (3D) topological insulators (TIs) are a remarkable class of matter that is insulating in the ...
Formal Verification
Z3 checks whether the hypothesis is internally consistent, not whether it is empirically true.
This discovery has a Claude-generated validation package with a full experimental design.
Precise Hypothesis
The low-complexity domain (LCD) of mutant Huntingtin protein (mHTT), specifically the polyglutamine (polyQ) expansion region (>36 repeats) combined with the N17 amphipathic helix and proline-rich domain, undergoes liquid-liquid phase separation (LLPS) in neuronal cells to form gel-like condensates with material properties distinct from liquid droplets (higher viscosity η > 10³ Pa·s, reduced fluorescence recovery in FRAP < 20% within 60 minutes). These condensates sequester critical transcription factors (SP1, TFIID, CBP/p300, NF-Y) at concentrations ≥3-fold above cytoplasmic baseline, reducing their nuclear availability by ≥40% and suppressing downstream gene expression programs (BDNF, PGC-1α, dopamine receptor genes) by ≥50%. Small molecules capable of dissolving or fluidizing these condensates (reducing viscosity by ≥50%, restoring FRAP recovery to >60% within 30 minutes) will restore transcription factor availability and rescue gene expression to ≥70% of wild-type levels in HD model neurons.
- FRAP recovery: If mHTT condensates show >60% fluorescence recovery within 30 minutes (liquid-like behavior), the gel-like condensate classification is disproven.
- Transcription factor sequestration: If immunofluorescence/proximity ligation assays show <1.5-fold enrichment of SP1, CBP, or TFIID within condensates versus cytoplasm, the sequestration mechanism is disproven.
- Gene expression rescue: If condensate-dissolving compounds restore <30% of suppressed gene expression (BDNF, PGC-1α) despite confirmed condensate dissolution, the causal link between condensates and transcriptional suppression is disproven.
- PolyQ specificity: If wild-type HTT (≤35 repeats) forms equivalent gel-like condensates under identical conditions, the polyQ-expansion-specific mechanism is disproven.
- Transcription factor availability: If nuclear fractionation shows no significant reduction (<15% decrease) in free transcription factor pools in HD versus WT neurons, the sequestration hypothesis is disproven.
- Compound mechanism: If condensate-dissolving compounds restore gene expression through off-target mechanisms (e.g., direct transcription factor activation confirmed by ChIP-seq showing no change in condensate occupancy), the condensate-dissolution mechanism is disproven.
- In vivo irrelevance: If R6/2 or zQ175 mouse models show no detectable gel-like condensates by cryo-ET or expansion microscopy in striatal neurons, the in vivo relevance is disproven.
Experimental Protocol
PHASE 1 — In Vitro Biophysical Characterization (Weeks 1–6): Purify recombinant mHTT-N-terminal fragments (exon1, Q46, Q72, Q97) with GFP/mCherry tags. Perform turbidity assays (OD600 vs. concentration), DLS (hydrodynamic radius), and microrheology (passive bead tracking) to establish phase diagrams. Conduct FRAP on condensates formed at 2 μM protein, 37°C, pH 7.4 with 10% PEG-8000. Measure half-time of recovery and mobile fraction.
PHASE 2 — Transcription Factor Sequestration (Weeks 4–10): Co-incubate purified mHTT condensates with recombinant SP1-DBD, CBP-KIX, TFIID-TAF1 at equimolar ratios (1 μM each). Quantify partitioning by centrifugation (16,000×g, 10 min) and SDS-PAGE/western blot of pellet vs. supernatant fractions. Validate by fluorescence microscopy (confocal, TIRF).
PHASE 3 — Cellular Validation (Weeks 6–16): Use STHdh Q7/Q7 vs. Q111/Q111 striatal cell lines and iPSC-derived HD neurons (Q60, Q109). Perform: (a) live-cell imaging with mHTT-GFP and transcription factor-mCherry fusions; (b) FRAP in cells; (c) nuclear/cytoplasmic fractionation + western blot; (d) RNA-seq for transcriptome-wide expression changes.
PHASE 4 — Compound Screening and Rescue (Weeks 12–24): Screen 500–2,000 compounds (FDA-approved library + condensate-targeting compounds: 1,6-hexanediol analogs, small molecule chaperones, PPI inhibitors) using high-content imaging (condensate number, size, FRAP recovery as readouts). Top 20 hits validated by RNA-seq rescue assay.
PHASE 5 — In Vivo Validation (Weeks 20–36): Administer top 3 compounds to R6/2 mice (n=15/group) via IP injection or CNS delivery. Assess: striatal mHTT condensate properties by cryo-ET on brain slices, transcription factor nuclear availability by IF, gene expression by qRT-PCR/RNA-seq, behavioral outcomes (rotarod, open field).
- Protein sequences: Human HTT exon1 with Q46, Q72, Q97 repeats (UniProt P42858, engineered variants)
- Transcription factor structures: SP1 (PDB: 1SP1), CBP KIX domain (PDB: 1KDX), TFIID subunits (PDB: 6MZM)
- HD patient iPSC lines: NINDS Human Cell and Data Repository — HD iPSC lines Q60, Q109, Q180 (catalog #ND42229, ND42230)
- Mouse models: R6/2 (JAX #002810), zQ175 knock-in (JAX #027410)
- STHdh cell lines: Q7/Q7 and Q111/Q111 (Coriell Institute)
- RNA-seq reference: GTEx striatum expression data; HD human brain RNA-seq (GEO: GSE64810, GSE33000)
- Compound libraries: Selleckchem FDA-approved library (L1300, ~1,400 compounds); Sigma condensate-targeting set
- Cryo-ET reference maps: mHTT fibril structures (EMDB: EMD-0839, EMD-3990)
- AlphaFold2 predictions: mHTT LCD conformational ensemble (AF-P42858-F1)
- Proteomics: HD striatum proteome (ProteomeXchange PXD013277)
- ChIP-seq data: SP1 binding in HD vs. WT neurons (ENCODE ENCSR000EGM)
- MD simulation force fields: CHARMM36m for IDR/polyQ simulations
- Biophysical: mHTT Q72/Q97 condensates show FRAP mobile fraction <20% and t½ >10 min (gel-like); viscosity η >10³ Pa·s by microrheology; Q23 shows liquid-like behavior (mobile fraction >80%).
- Partition coefficient: Kp ≥3.0 for SP1, CBP, and TFIID in vitro; ≥2.5-fold enrichment by IF in cellular condensates.
- Nuclear TF depletion: ≥40% reduction in nuclear SP1, CBP, TFIID in Q111 vs. Q7 cells by western blot (p<0.01).
- Gene expression suppression: ≥50% reduction in BDNF, PGC-1α, DRD2 mRNA in Q111 vs. Q7 cells (RNA-seq, FDR<0.05, log2FC ≤−1).
- Compound screening: ≥5 compounds identified with Z-score <−2 for condensate dissolution and <20% cytotoxicity at 10 μM.
- Gene expression rescue: Top compound restores ≥70% of suppressed genes to WT levels in RNA-seq rescue experiment.
- In vivo: R6/2 mice treated with top compound show ≥30% reduction in striatal condensate density by cryo-ET; ≥40% improvement in rotarod performance vs. vehicle-treated R6/2 (p<0.05).
- iPSC validation: Findings replicated in ≥2 independent iPSC-derived MSN lines.
- mHTT condensates show liquid-like FRAP recovery (>60% mobile fraction within 30 min) — gel-like classification fails.
- Partition coefficient Kp <1.5 for all tested transcription factors — sequestration mechanism fails.
- Nuclear TF levels unchanged (<15% difference) between Q7 and Q111 cells — cellular relevance fails.
- RNA-seq shows <25% of expected gene expression changes (fewer than 50 differentially expressed genes at FDR<0.05) — transcriptional impact fails.
- Compound screen yields 0 hits with Z-score <−2 and <30% cytotoxicity — therapeutic tractability fails.
- Best compound restores <30% of suppressed gene expression — rescue mechanism fails.
- No detectable condensates in R6/2 striatum by cryo-ET — in vivo relevance fails.
- Condensate dissolution by compound does not correlate with TF release (r² <0.3) — mechanistic link fails.
4,800
GPU hours
252d
Time to result
$185,000
Min cost
$1,250,000
Full cost
ROI Projection
- Therapeutic: Lead compounds from screen are immediately patentable as condensate-dissolving agents for HD; composition-of-matter patents on novel condensate modulators; method-of-treatment patents for condensate dissolution in polyQ diseases. Estimated licensing value: $50–200M upfront + royalties.
- Screening platform: High-content FRAP-based condensate dissolution assay is licensable to pharma (Pfizer, Roche, AstraZeneca all have active condensate programs); platform value estimated $20–50M.
- Diagnostics: Patient-derived neuron condensate assay as companion diagnostic for HD clinical trials; market size ~$15M/year for HD diagnostics.
- Research tools: Recombinant mHTT condensate reagents, validated antibodies, and cell lines have commercial value ($500K–$2M/year reagent sales).
- Adjacent markets: Condensate-targeting approach applicable to ALS (FUS, TDP-43), frontotemporal dementia, and other polyQ diseases — total addressable market for condensate-targeted neurodegeneration therapies estimated at $15–25B by 2035.
- Academic partnerships: Validated platform enables 3–5 major pharma collaborations ($5–20M each) and NIH BRAIN Initiative or HEAL Initiative funding ($10–50M).
- IP landscape: Currently limited patents on condensate dissolution for HD; first-mover advantage significant; FTO analysis recommended before filing.
🔓 If proven, this unlocks
Proving this hypothesis is a prerequisite for the following downstream discoveries and applications:
- 1HD-CONDENSATE-DRUG-SCREEN-PHASE2-010
- 2POLYX-DISEASE-CONDENSATE-ATLAS-011
- 3TF-SEQUESTRATION-NEURODEGENERATION-012
- 4CONDENSATE-TARGETED-THERAPY-PLATFORM-013
- 5HTT-INTERACTOME-CONDENSATE-014
- 6CAG-REPEAT-PHASE-BEHAVIOR-SPECTRUM-015
- 7HD-BIOMARKER-CONDENSATE-PROPERTIES-016
Prerequisites
These must be validated before this hypothesis can be confirmed:
- HTT-LCD-LLPS-BIOPHYSICS-001
- POLYX-PHASE-DIAGRAM-002
- TF-CONDENSATE-PARTITIONING-003
- HD-IPSC-NEURONAL-DIFFERENTIATION-004
- CONDENSATE-RHEOLOGY-METHODS-005
Implementation Sketch
# Experimental Validation Pipeline — HD Condensate EVP # Pseudocode architecture for computational and data analysis components class HDCondensateValidationPipeline: def __init__(self): self.protein_sequences = load_HTT_sequences(Q_lengths=[23, 46, 72, 97]) self.transcription_factors = ['SP1', 'CBP', 'TFIID_TBP', 'NFY_A'] self.compound_library = load_compound_library(n=2000) self.cell_lines = ['STHdh_Q7', 'STHdh_Q111', 'iPSC_Q60', 'iPSC_Q109'] # MODULE 1: Phase Diagram Computation def compute_phase_diagram(self, sequence, conditions): """ Coarse-grained MD using CALVADOS2 force field Input: HTT LCD sequence, salt/temperature conditions Output: Csat, phase boundary, condensate properties """ model = CALVADOS2(sequence) trajectories = model.simulate( n_chains=100, box_size=50, # nm temperature=310, # K ionic_strength=0.15, # M duration=10e-6, # 10 microseconds timestep=10e-15 # 10 fs ) Csat = calculate_critical_concentration(trajectories) contact_map = compute_contact_map(trajectories, cutoff=0.8) # nm return PhaseData(Csat=Csat, contacts=contact_map, Rg=calc_Rg(trajectories)) # MODULE 2: FRAP Analysis def analyze_FRAP(self, fluorescence_timeseries, bleach_time, pixel_size): """ Fit FRAP recovery curves to extract mobile fraction and t_half """ # Normalize: F_norm = (F(t) - F_bleach) / (F_pre - F_bleach) F_norm = normalize_FRAP(fluorescence_timeseries, bleach_time) # Single exponential fit: F(t) = A*(1 - exp(-t/tau)) + C params = curve_fit( lambda t, A, tau, C: A*(1-np.exp(-t/tau))+C, time_axis, F_norm, p0=[0.8, 30, 0.1], bounds=([0,0,0],[1,3600,0.5]) ) mobile_fraction = params['A'] t_half = params['tau'] * np.log(2) # Classification if mobile_fraction > 0.8 and t_half < 30: state = 'LIQUID' elif mobile_fraction < 0.2 and t_half > 600: state = 'GEL' else: state = 'VISCOELASTIC' return FRAPResult(mobile_fraction=mobile_fraction, t_half=t_half, state=state) # MODULE 3: Transcription Factor Partitioning def calculate_partition_coefficient(self, pellet_intensity, supernatant_intensity, pellet_volume=0.01, total_volume=0.1): """ Kp = ([TF]_condensate) / ([TF]_dilute_phase) """ conc_condensate = pellet_intensity / pellet_volume conc_dilute = supernatant_intensity / (total_volume - pellet_volume) Kp = conc_condensate / conc_dilute return Kp # Target: Kp >= 3.0 # MODULE 4: RNA-seq Differential Expression def run_differential_expression(self, count_matrix, metadata): """ DESeq2 wrapper for HD vs. WT comparison """ dds = DESeqDataSet(count_matrix, design='~condition') dds = DESeq(dds) results = results(dds, contrast=['condition', 'Q111', 'Q7'], alpha=0.05, lfcThreshold=1.0) # Filter significant genes sig_genes = results[results['padj'] < 0.05] suppressed = sig_genes[sig_genes['log2FoldChange'] < -1] # GSEA on ranked gene list ranked_genes = results.sort_values('stat', ascending=False) gsea_results = run_GSEA(ranked_genes, gene_sets=['HALLMARK', 'KEGG', 'HD_SIGNATURES']) return DEResults(suppressed_genes=suppressed, gsea=gsea_results, n_DE=len(sig_genes)) # MODULE 5: Compound Screening Analysis def analyze_compound_screen(self, plate_data, controls): """ High-content imaging analysis for condensate dissolution """ # Calculate Z-scores per plate pos_ctrl_mean = np.mean(controls['positive']) pos_ctrl_std = np.std(controls['positive']) for compound in plate_data: compound['condensate_area_zscore'] = ( (compound['mean_condensate_area'] - pos_ctrl_mean) / pos_ctrl_std ) compound['frap_recovery_zscore'] = ( (compound['frap_mobile_fraction'] - pos_ctrl_mean) / pos_ctrl_std ) # Combined score compound['hit_score'] = ( compound['condensate_area_zscore'] + compound['frap_recovery_zscore'] ) / 2 # Select hits: Z-score < -2, viability > 70% hits = [c for c in plate_data if c['hit_score'] < -2 and c['viability'] > 0.70] return sorted(hits, key=lambda x: x['hit_score']) # MODULE 6: In Vivo Analysis def analyze_mouse_behavior(self, rotarod_data, groups): """ Statistical analysis of behavioral outcomes """ # Rotarod: latency to fall (seconds) anova_result = one_way_ANOVA(rotarod_data, groups) posthoc = tukey_hsd(rotarod_data, groups) # Effect size (Cohen's d) for compound vs. vehicle in R6/2 d = cohens_d( rotarod_data['R6/2_compound'], rotarod_data['R6/2_vehicle'] ) return BehaviorResult(anova=anova_result, posthoc=posthoc, effect_size=d) # MAIN EXECUTION PIPELINE def run_full_validation(self): # Phase 1: Biophysics phase_diagrams = {Q: self.compute_phase_diagram(seq, conditions) for Q, seq in self.protein_sequences.items()} # Phase 2: TF partitioning partition_data = self.measure_TF_partitioning( condensates=phase_diagrams, TFs=self.transcription_factors ) # Phase 3: Cellular validation cellular_data = {} for cell_line in self.cell_lines: frap = self.analyze_FRAP(measure_FRAP(cell_line)) tf_nuclear = measure_nuclear_TF(cell_line) rnaseq = self.run_differential_expression( get_counts(cell_line), get_metadata(cell_line) ) cellular_data[cell_line] = {'frap': frap, 'tf': tf_nuclear, 'rnaseq': rnaseq} # Phase 4: Compound screen screen_results = self.analyze_compound_screen( run_HCS(self.compound_library, 'STHdh_Q111'), controls=get_controls() ) top_compounds = screen_results[:20] # Phase 5: In vivo mouse_results = run_mouse_study(top_compounds[:3]) behavior = self.analyze_mouse_behavior(mouse_results['rotarod'], mouse_results['groups'])