Towards Deeper, Lighter and Interpretable Cross Network for CTR Prediction
Paper β’ 2311.04635 β’ Published
| Metric | This Run | Paper (full 40M) |
|---|---|---|
| AUC | 0.749974 | 0.7905 |
| LogLoss | 0.396100 | 0.3739 |
| Epoch | 2/8 | - |
| Params | 1,812,017 | - |
GDCN (CIKM 2024)
22 fields β Embed(16d) β [352d]
βββ Gated Cross Net (3 layers) β [352d]
βββ DNN (400β400β400) β [400d]
Concat β Linear(1) β Ο β P(click)
Gate: Ο(W_gΒ·x) filters each cross layer's interactions.
This model repository was generated by ML Intern, an agent for machine learning research and development on the Hugging Face Hub.
from transformers import AutoModelForCausalLM, AutoTokenizer
model_id = 'klozehsu/gdcn-criteo-ctr'
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)
For non-causal architectures, replace AutoModelForCausalLM with the appropriate AutoModel class.