File size: 815 Bytes
ef330b0
 
 
 
00d24d5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
dbdc453
00d24d5
 
 
 
 
 
 
dbdc453
 
 
 
 
 
 
 
 
00d24d5
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
---
license: apache-2.0
datasets:
- liarzone/EVAttrs-95K
---

# Model Card for EagleVision-4B


## Model Details

* Baseline Detector: Oriented R-CNN
* LLM: microsoft/Phi-3-mini-128k-instruct
* Training Data: EVAttrs-95K

## Model Sources

<!-- Provide the basic links for the model. -->

- **Repository:** https://github.com/XiangTodayEatsWhat/EagleVision
- **Paper:** https://arxiv.org/abs/2503.23330


## Citation 



```
@misc{jiang2025eaglevisionobjectlevelattributemultimodal,
      title={EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing}, 
      author={Hongxiang Jiang and Jihao Yin and Qixiong Wang and Jiaqi Feng and Guo Chen},
      year={2025},
      eprint={2503.23330},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2503.23330}, 
}
```