language:
- en
license: mit
tags:
- cuda
- gpu
- number-theory
- computational-mathematics
- continued-fractions
- zaremba
- ramsey
- kronecker-coefficients
- class-numbers
- hausdorff-dimension
- ramanujan-machine
- erdos-straus
- prime-convergents
- flint-hills
- spectral-methods
- bigcompute
library_name: other
pipeline_tag: other
datasets:
- cahlen/zaremba-density
- cahlen/zaremba-conjecture-data
- cahlen/class-numbers-real-quadratic
- cahlen/kronecker-coefficients
- cahlen/hausdorff-dimension-spectrum
- cahlen/continued-fraction-spectra
- cahlen/ramanujan-machine-results
bigcompute.science CUDA Kernels
51 custom CUDA kernels for GPU-accelerated computational mathematics research. These kernels power the experiments at bigcompute.science.
All kernels are standalone β compile with nvcc, run from the command line. No PyTorch dependency.
Hardware
Developed and tested on:
- 8x NVIDIA B200 (183 GB VRAM each, sm_90)
- NVIDIA RTX 5090 (32 GB VRAM, sm_120)
Most kernels will run on any CUDA GPU (sm_50+). Compile with your target architecture:
nvcc -O3 -arch=sm_XX -o kernel kernel.cu -lm
Kernels by Experiment
Zaremba's Conjecture (25 kernels)
Density enumeration (zaremba-density/) β complete CF tree enumeration with bitset marking:
zaremba_density_gpu.cuβ production kernel, 65+ runs to 10^12zaremba_density_v2.cuβ alternative implementationzaremba_density_gpu_worksteal_v2.cuβ work-stealing variant for load balancing
Transfer operator (zaremba-transfer-operator/) β Chebyshev collocation spectral method:
transfer_operator.cuβ spectral gap computation for Ruelle operator
Effective bound (zaremba-effective-bound/) β Bourgain-Kontorovich proof framework:
spectral_gaps_fast.cuβ bulk spectral gap verificationspectral_gaps_primes.cuβ prime-indexed gapscertify_rho_cuda.cuβ arb ball arithmetic certificationcompute_Q0.cu/Q0_frolenkov_kan.cuβ effective constant extractioncount_representations.cuβ CF representation countingdolgopyat_exact.cu/dolgopyat_profile.cuβ Dolgopyat estimate profilingexponential_sum.cuβ exponential sum boundsextract_eigenfunction.cuβ transfer operator eigenfunction extractionflat_spectral_gap.cuβ uniform spectral gap verificationmatrix_enum.cu/matrix_enum_multipass.cuβ SL(2,Z) matrix enumerationminor_arc_primes.cu/minor_arc_profile.cuβ minor arc estimatesverify_all_gaps_fp64.cu/verify_gaps_interval.cu/verify_gaps_v2.cuβ gap verification suitecompute_c1_rigorous.cuβ rigorous constant computation
Cayley diameters (zaremba-cayley-diameter/) β BFS on Cayley graphs of SL(2,Z/pZ):
cayley_diameter.cu/cayley_gpu.cuβ full BFS diameter computation
Transitivity (zaremba-transitivity/) β algebraic verification:
check_transitivity.cuβ Dickson classification check
Ramsey R(5,5) (7 kernels)
ramsey-r55/ β search for 2-colorings of complete graphs with no monochromatic K5:
ramsey_gpu.cuβ base simulated annealing kernelramsey_incremental.cu/ramsey_incremental_v2.cuβ incremental K5 counterramsey_extend.cu/ramsey_extend_all.cuβ exhaustive extension checking (4.4T extensions of K42 to K43)ramsey_fullcount.cuβ complete clique enumerationramsey_search.cu/ramsey_global.cu/ramsey_verified.cuβ search variants
Class Numbers (4 kernels)
class-numbers/ β class numbers of real quadratic fields via BSGS:
class_numbers_v2.cuβ production kernel (10^9 to 10^12 range)class_number_rqf.cuβ real quadratic field specializationclass_number_fast.cuβ optimized inner loopsieve_gpu.cuβ GPU prime sieve
Kronecker Coefficients (3 kernels)
kronecker-coefficients/ β character tables and Kronecker triple computation:
kronecker_gpu.cuβ full character table (S20: 3.7s, S30: 7.4 min, S40: 9.5 hr)kronecker_fast.cuβ optimized triple-sumkronecker_compute.cuβ targeted triple computation
Ramanujan Machine (2 kernels)
ramanujan-machine/ β automated discovery of continued fraction formulas:
ramanujan_gpu.cuβ v1 kernel (equal-degree polynomials, exhausted)ramanujan_v2.cuβ v2 kernel (asymmetric-degree, where new discoveries live)
Prime Convergents (2 kernels)
prime-convergents/ β prime statistics of CF convergents:
prime_convergents.cuβ v1 (uint64, depth ~38)prime_convergents_v2.cuβ v2 (uint128, depth ~75, 128-bit Miller-Rabin)
Erdos-Straus Conjecture (1 kernel)
erdos-straus/ β solution counting for 4/p = 1/x + 1/y + 1/z:
erdos_straus.cuβ per-prime f(p) enumeration, tested to 10^9
Spectral Computations (4 kernels)
hausdorff-spectrum/ β Hausdorff dimension via transfer operator + Chebyshev collocation:
hausdorff_spectrum.cuβ all 2^20 - 1 subsets of {1,...,20}
lyapunov-spectrum/ β Lyapunov exponents of CF digit sets:
lyapunov_spectrum.cuβ full spectrum computation
minkowski-spectrum/ β Minkowski question-mark function:
minkowski_spectrum.cuβ singularity spectrum
flint-hills/ β Flint Hills series partial sums:
flint_hills.cuβ high-precision partial sum to 10B terms
Results
All computation results are open:
- Website: bigcompute.science
- Datasets: huggingface.co/cahlen
- Source code: github.com/cahlen/idontknow
- MCP server: mcp.bigcompute.science
License
MIT
Citation
@misc{humphreys2026bigcompute,
author = {Humphreys, Cahlen},
title = {bigcompute.science: GPU-Accelerated Computational Mathematics},
year = {2026},
url = {https://bigcompute.science}
}
Human-AI collaborative research (Cahlen Humphreys + Claude). All code and data open for verification.