new

Get trending papers in your email inbox!

Subscribe

Daily Papers

byAK and the research community

Dec 12

Unveiling the soft X-ray source population towards the inner Galactic disk with XMM-Newton

Across the Galactic disk lies a diverse population of X-ray sources, with the fainter end remaining poorly understood due to past survey sensitivity limits. We aim to classify and characterize faint X-ray sources detected in the eROSITA All-Sky Survey (eRASS1) towards the inner Galactic disk (350^circ < l < 360^circ, -1^circ < b < 1^circ) using deeper XMM-Newton observations (typical exposure of sim 20,ks). We analyzed 189 eRASS1 sources, combining X-ray spectral fitting (0.2--10,keV) with Gaia astrometric and photometric data for robust classification. Our results show that the eRASS1 catalog towards the Galactic disk is overwhelmingly dominated by coronal sources (sim 74%), primarily active stars and binaries, with sim 8% being wind-powered massive stars and sim 18% being accreting compact objects. We propose an empirical hardness-ratio cut (HR > -0.2) to efficiently isolate these non-coronal sources. By stacking the classified population and comparing with the Galactic Ridge X-ray Emission (GRXE), we estimate that sim 6% of the GRXE flux in the 0.5--2.0,keV band is resolved into point sources above the eRASS1 flux limit (sim 5times 10^{-14},erg,cm^{-2},s^{-1}). This resolved soft-band emission is dominated by active stars, while hard-band flux originates primarily from X-ray binaries. We conclude that the eRASS1 catalog retains a non-negligible population of compact objects that can be effectively distinguished using X-ray color selection.

  • 8 authors
·
Oct 27

The Chandra Source Catalog

The Chandra Source Catalog (CSC) is a general purpose virtual X-ray astrophysics facility that provides access to a carefully selected set of generally useful quantities for individual X-ray sources, and is designed to satisfy the needs of a broad-based group of scientists, including those who may be less familiar with astronomical data analysis in the X-ray regime. The first release of the CSC includes information about 94,676 distinct X-ray sources detected in a subset of public ACIS imaging observations from roughly the first eight years of the Chandra mission. This release of the catalog includes point and compact sources with observed spatial extents <~ 30''. The catalog (1) provides access to the best estimates of the X-ray source properties for detected sources, with good scientific fidelity, and directly supports scientific analysis using the individual source data; (2) facilitates analysis of a wide range of statistical properties for classes of X-ray sources; and (3) provides efficient access to calibrated observational data and ancillary data products for individual X-ray sources, so that users can perform detailed further analysis using existing tools. The catalog includes real X-ray sources detected with flux estimates that are at least 3 times their estimated 1 sigma uncertainties in at least one energy band, while maintaining the number of spurious sources at a level of <~ 1 false source per field for a 100 ks observation. For each detected source, the CSC provides commonly tabulated quantities, including source position, extent, multi-band fluxes, hardness ratios, and variability statistics, derived from the observations in which the source is detected. In addition to these traditional catalog elements, for each X-ray source the CSC includes an extensive set of file-based data products that can be manipulated interactively.

  • 39 authors
·
May 25, 2010

Assemblage: Automatic Binary Dataset Construction for Machine Learning

Binary code is pervasive, and binary analysis is a key task in reverse engineering, malware classification, and vulnerability discovery. Unfortunately, while there exist large corpuses of malicious binaries, obtaining high-quality corpuses of benign binaries for modern systems has proven challenging (e.g., due to licensing issues). Consequently, machine learning based pipelines for binary analysis utilize either costly commercial corpuses (e.g., VirusTotal) or open-source binaries (e.g., coreutils) available in limited quantities. To address these issues, we present Assemblage: an extensible cloud-based distributed system that crawls, configures, and builds Windows PE binaries to obtain high-quality binary corpuses suitable for training state-of-the-art models in binary analysis. We have run Assemblage on AWS over the past year, producing 890k Windows PE and 428k Linux ELF binaries across 29 configurations. Assemblage is designed to be both reproducible and extensible, enabling users to publish "recipes" for their datasets, and facilitating the extraction of a wide array of features. We evaluated Assemblage by using its data to train modern learning-based pipelines for compiler provenance and binary function similarity. Our results illustrate the practical need for robust corpuses of high-quality Windows PE binaries in training modern learning-based binary analyses. Assemblage can be downloaded from https://assemblage-dataset.net

  • 8 authors
·
May 7, 2024