SentenceTransformer based on allenai/specter2_aug2023refresh_base
This is a sentence-transformers model finetuned from allenai/specter2_aug2023refresh_base. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
Model Details
Model Description
- Model Type: Sentence Transformer
- Base model: allenai/specter2_aug2023refresh_base
- Maximum Sequence Length: 512 tokens
- Output Dimensionality: 768 dimensions
- Similarity Function: Cosine Similarity
Model Sources
- Documentation: Sentence Transformers Documentation
- Repository: Sentence Transformers on GitHub
- Hugging Face: Sentence Transformers on Hugging Face
Full Model Architecture
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
Usage
Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("m7n/discipline-tuned_specter_2_024")
# Run inference
sentences = [
'Twenty one surviving infants of pregnancies complicated by rupture of the membranes during the second trimester that lasted at least one week have been followed up for a median of months. Five infants ( %) had recurrent respiratory problems (episodes of wheezing and coughing occurring at least once a week) which related significantly to the use of neonatal ventilation and to very preterm delivery. Five of the infants who were born preterm and with birth weights of less than g had recurrent respiratory symptoms ( %). This compares favourably with an incidence of symptoms of % among surviving low birthweight infants born at this hospital after pregnancies not complicated by premature rupture of the membranes. Neither recurrent respiratory symptoms nor admission to hospital for chest related disorders were associated with the timing of onset or duration of rupture of the membranes. We conclude that, among survivors of premature rupture of the membranes, chronic respiratory morbidity would best be prevented by avoiding very preterm delivery, regardless of the duration of the rupture.',
'Doppler ultrasound measurements of pulmonary blood flow in babies with severe respiratory distress syndrome treated in a randomised controlled trial of surfactant replacement showed that the immediate improvement of oxygenation was not associated with a significant increase in pulmonary blood flow. Reduction in ventilator settings and increases in the extent of chest wall movements measured by a cardiorespiratory monitor suggested that the improvement after surfactant had been given was a result of alveolar stabilisation and increased pulmonary compliance. Further simultaneous studies of pulmonary blood flow and pulmonary compliance are needed to confirm these findings.',
'The effect of different variants of compiling integrated samples for biochemical oxygen demand (BOD) kinetics was studied in long-term experiments (up to days) with water samples taken from the central deep-water region of Lake Onego. It was a series of experiments carried out simultaneously at and in different seasons of . Five sampling variants were employed with different horizon combinations: near surface, near bottom, from different depths in the water column, from the photic and profundal layers. Two experiments were performed with winter water, three with summer water, four with autumn water, and seven experiments with spring water. The most representative sample for studying BOD in long-term experiments is an sample composed of water from different horizons of the photic layer ( m). For each variant of integrated sample composition, BOD development in the experiments was modeled by a corresponding kinetic equation whose parameters represented the oxidation characteristics of components of the organic matter present in the water and transformed in the long-term BOD experiment. The resultant kinetic parameters of BOD were analyzed in relation to the factors determining the final oxidation of the organic matter components. The patterns in which the type of BOD development is formed depend on the integrated water sample collection/compilation conditions and are characterized by the average values of the organic matter contained in the water, estimated either analytically or from empirical equations, as well as by the temperature of exposure of water samples in the experiment. Synthesis of the resultant information showed that the values of BOD kinetic parameters were generally lower in spring water taken from the central part of Lake Onego as compared with other seasons, since the oxidation potential of organic matter components in spring water is higher.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
Evaluation
Metrics
Triplet
- Datasets:
discipline-tuned_specter_2_022
anddiscipline-tuned_specter_2_024
- Evaluated with
TripletEvaluator
Metric | discipline-tuned_specter_2_022 | discipline-tuned_specter_2_024 |
---|---|---|
cosine_accuracy | 0.9714 | 0.971 |
Training Details
Training Dataset
Unnamed Dataset
- Size: 43,494 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 80 tokens
- mean: 232.53 tokens
- max: 512 tokens
- min: 81 tokens
- mean: 230.16 tokens
- max: 512 tokens
- min: 86 tokens
- mean: 229.66 tokens
- max: 512 tokens
- Samples:
anchor positive negative Lupus nephritis (LN) is one of the major risk factors for morbidity and overall mortality in systemic lupus erythematosus (SLE). Its pathogenesis is multifactorial, and a number of risk factors, including serological markers, have been identified in recent years, correlating with clinical course and disease severity. Furthermore, a distinctive autoantibody profile has recently been reported in African- American SLE women with LN. The aim of this study was to characterize the autoantibody profile in African-American SLE patients, with LN and without. Only anti-dsDNA achieved statistical significance between the two groups (P < ). Fourteen ( %) patients with LN and ( %) without it exhibited positive anti-Ro/SS-A, anti-Sm, and anti-nRNP, but without anti-La/SS B (P > ). We conclude that African-American SLE patients with LN do not exhibit a specific or distinctive autoantibody profile. However, our data confirm the value of anti-dsDNA in SLE patients with LN.
TRIM00 is a member of the tripartite motif family proteins and is one of the autoantigens which react with anti-SS-A antibody (Ab) present in sera of patients with systemic lupus erythematosus (SLE) and Sjogren's syndrome. Previous studies have shown that TRIM00 dysfunction promotes aberrant B-cell differentiation and Ab production in SLE, and anti-TRIM00 Ab may be related to the TRIM00 dysfunction in human SLE pathogenesis. Here, we examined the relationship between anti-TRIM00 Ab and clinical and immunological characteristics in SLE patients.Twenty-seven patients with SLE ( women and four men) before immunosuppressive therapies, who fulfilled the revised American College of Rheumatology criteria for SLE, and four healthy controls ( women and one man) were enrolled in the study. SLE patients were divided into two groups according to the seropositivity for anti-TRIM00 Ab. Serum anti-TRIM00 Ab levels were measured using enzyme-linked immunosorbent assays. The serum levels of cytokines a...
We construct a stochastic model of real estate pricing. The method of the pricing construction is based on a sequential comparison of the supply prices. We proof that under standard assumptions imposed upon the comparison coefficients there exists an unique non-degenerated limit in distribution and this limit has the lognormal law of distribution. The accordance of empirical distributions of prices to thetheoretically obtained log-normal distribution we verify by numerous statistical data of real estate prices from Saint-Petersburg (Russia). For establishing this accordance we essentially apply the efficient and sensitive test of fit of Kolmogorov-Smirnov. Basing on "The Russian Federal Estimation Standard N0", we conclude that the most probable price, i.e. mode of distribution, is correctly and uniquely defined under the log-normal approximation. Since the mean value of log-normal distribution exceeds the mode - most probable value, it follows that the prices valued by the mathematica...
A laboratory prototype of an enzyme biosensor based on pHsensitive field-effect transistors has been developed to determine the total content of indole alkaloids in Rauwolfia serpentina Benth. Ex Kurz tissue culture. The biosensor was characterized by high sensitivity to th A laboratory prototype of an enzyme biosensor based on pHsensitive field effect transistors has been developed to determine the total content of indole alkaloids in Rauwolfia serpentina Benth. Ex Kurz tissue culture. The biosensor was characterized by high sensitivity to the total content of indole alkaloids (minimum limit of determination g/ml of the total content of indole alkaloids contained in the juice obtained from tissue culture of Rauwolfia serpentina). The linear range of biosensor determination of the analyte was from to g / ml of the total content of indole alkaloids. Analysis of indole alkaloids using a biosensor is simple and fast and does not require expensive equipment and special sample preparation f...
A procedure of separate biosensor analysis of the multicomponent sample with aflatoxins and pesticides has been developed and optimized. Biosensor determination of aflatoxins and pesticides was performed using enzyme inhibition analysis. For creation of bioselective element we used enzyme acetylcholinesterase which is co-immobilized with bovine serum albumin on the surface of potentiometric transducer by glutaraldehyde covalent crosslinking. As transducers were pH-sensitive field effect transistors. The concentration of acetylcholine chloride as a substrate for subsequent inhibition analysis was fit; optimal time of inhibition by toxins solution was determinate together with concentration of reactivator (pyridine- -aldoxymmethyliodyd) and time of enzyme reactivation after inhibition. A synergism between trichlorfon and aflatoxin B0 in inhibition of immobilized on a surface pH-sensitive field-effect transistors acetylcholinesterase was investigated. The proposed procedure allows selecti...
Objective: To observe the effect of modified Zhenwu decoction on blood glucose and blood lipid of experimental diabetic rats.Methods: Diabetic model rats randomly were divided into normal control group,diabetic modeling group,modified Zhenwu decoction group.Establish intraperitoneal injection of Streptozotocin diabetic animal models by,after eight weeks blood glucose and blood lipids were detrmined.Results: After the treatment by modified Zhenwu decoction,blood glucose,blood lipid and other indicators improved significantly.Conclusion: Modified Zhenwu decotion can improve the level of renal lower blood glucose and lipid in diabetic rats.
In two successive years ( and ), a set of commercial sugar beet cultivars was established in Randomized Complete Block experiments at two sites in central Greece. Cultivar combination was different between years, but not between sites. Leaf sampling took place once during the growing season and leaf area, LA [cm0], leaf midvein length, L [cm] and maximum leaf width, W [cm] were determined using an image analysis system. Leaf parameters were mainly affected by cultivars. Leaf dimensions and their squares (L0, W0) did not provide an accurate model for LA predictions. Using LW as an independent variable, a quadratic model (y = x0 - x + , r = , p< , n = ) provided the most accurate estimation of LA. With compromises in accuracy, the linear relationship between LW and LA (y = x + , r = , p< , n = ) could be used as a prediction model thanks to its simplicity.
The general increase in temperature, together with sudden episodes of extreme temperatures, are increasingly impacting plant species in the present climate change scenario. Limoniastrum monopetalum is a halophyte from the Mediterranean Basin, exposed to broad daily and seasonal changes in temperature and extreme high temperatures. We studied the photosynthetic responses (chlorophyll fluorescence dynamics and gas exchange) of L. monopetalum leaves exposed to temperatures from .0C to .0C under darkness in controlled laboratory conditions. L. monopetalum presented its optimum temperature for photosynthesis around +00C. The photosynthetic apparatus of L. monopetalum exhibited permanent damages at > .0C. L. monopetalum tolerated, without permanent damages, temperatures as low as .0C in darkness. L. monopetalum appears as a plant species very well adapted to the seasonality of the Mediterranean climate, which may work as a pre-adaptation to stand more extreme temperatures in the actual conte...
The article depicts direct and hidden (implicit and explicit) information giving in advertisement discourse, meaning advertising slogans. Having investigated this topic thoroughly, the author found out that cognitive types of presupposition and communicative implicatures played a great role in advertising slogans. There are definitions of phenomena "implicit" and "explicit" with examples. The cognitive types of presupposition (semantic and pragmatic) and their typology is discussed in the article. There is a possibility to figure out what strategy of communicative influence on human's cognition is. Some laws of neurolinguistic programming is also discussed.
- Loss:
TripletLoss
with these parameters:{ "distance_metric": "TripletDistanceMetric.COSINE", "triplet_margin": 0.4 }
Evaluation Dataset
Unnamed Dataset
- Size: 2,174 evaluation samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 83 tokens
- mean: 235.71 tokens
- max: 512 tokens
- min: 82 tokens
- mean: 234.64 tokens
- max: 512 tokens
- min: 86 tokens
- mean: 225.92 tokens
- max: 512 tokens
- Samples:
anchor positive negative In Organic Law / of 0rd October of the general arrangement of the educational system (LOGSE), the educational system includes the general regime education and the special regime education. Dance is included in the special regime as part of the artistic disciplines together with music, drama, the plastic arts and design. The aim of this article is to analyse the treatment given to Dance in the general regime. Thus, we will try to emphasize the inconsistency that exists between the areas of primary education, which will be obligatory and will have a global and integrated character, and the training of future teachers.
This work aims to analyze the treatment of health education in school textbooks during the period , and to compare it with the one that is conducted at present. It will attempt to verify how many current concepts and ideas were already present in those decades. In addition, the differences in the way of carrying out health education then and now will be outlined, especially those referred to pedagogic strategies and didactic materials. All this will be done from a double perspective: . The concept of health, hygiene and pedagogy of health education. . The program contents of health education in the didactic materials.
The vane-in-cup (VIC) geometry has been widely used for the rheological characterization of yield-stress fluids because it minimizes slip effects at the liquid/solid interface of the rotating geometry and reduces sample damage during the loading process. However, severe kinematic limitations arising from the spatial complexity of mixed shear and extensional flow have been identified for quantitative rheometrical measurements in complex fluids. Recently, vanes with fractal cross sections have been suggested as alternatives for accurate rheometry of elastoviscoplastic fluids. In this work, the steady fractal vane-in-cup (fVIC) flow of a Newtonian fluid and a nonthixotropic Carbopol®️ microgel as well as the unsteady flow of a thixotropic -Carrageenan gel are analyzed using rheo-particle image velocimetry (Rheo-PIV). We describe the velocity distributions in all cases and show that the fVIC produces an almost axisymmetric flow field and rotation rate-independent "effective radius" when us...
An ultrahigh vacuum three-axis cryogenic sample manipulator suitable for angle-resolved photoelectron spectroscopy experiments was developed. The sample manipulator is constructed by combining three modules with translation, polar rotation, and azimuthal-tilt rotation capabilities. Polar rotation and the azimuthal-tilt rotation are performed using a differentially pumped rotary stage and a sample goniometer, respectively. Continuous rotation around the polar axis is possible. The sample goniometer is capable of azimuthal rotation of up to and tilt rotation from to , measured from the plane normal to the polar axis. Nonmagnetic materials are used near the sample holder of the goniometer. The sample holder can be cooled using a continuous-flow cryostat. To serve as a radiation shield, the lower portion of the goniometer surrounding the sample holder is cooled separately by another cell filled with liquid nitrogen. With liquid nitrogen or liquid helium for the cryostat, the sample holder ...
In the soft x-ray region below keV, various electron yield (EY) techniques have been employed in x-ray absorption fine structure (XAFS) measurements of bulk materials. The fluorescent x-ray yield (FY) is also utilized for samples of low concentration. Although FY becomes much smaller for lighter elements, it has several advantages compared with EY to measure XAFS spectra; for example, a higher signal-to-background ratio and applicability to insulating materials. However, it has been thought to be unsuitable for concentrated samples due to a self-absorption effect. In this report, the sampling depth and self-absorption effect for bulk concentrated samples are discussed concerning XAFS measurements in a few keV energy region. Some typical FY XAFS spectra of concentrated materials, including insulators, are presented.
To investigate the distribution characteristics of TCM syndromes and the related herbal prescriptions for malignant tumors (MT). A clinical database of the TCM syndromes and the herbal prescriptions in treatment of MT patients were established. The data were then analyzed using cluster and frequency analysis. According to the cluster analysis, the TCM syndromes in MT patients mainly included two patterns: deficiency of both Qi and Yin and internal accumulation of toxic heat. The commonly-prescribed herbs were Huangqi (Astraglus), Nuzhenzi (Fructus Ligustri Lucidi), Lingzhi (Ganoderma Lucidum), Huaishan (Dioscorea Opposita), Xiakucao (Prunella Vulgaris), and Baihuasheshecao (Herba Hedyotidis). Deficiency of Qi and Yin is the primary syndrome of MT, and internal accumulation of toxic heat is the secondary syndrome. The herbs for Qi supplementation and Yin nourishment are mainly used, with the assistance of herbs for heat-clearance and detoxification.
Abstract Abstract Worldwide opposition to different aspects of globalisation indicates the emergence of a global social movement that typically targets the international bodies that regulate global trade and global finance, as well as the regulations themselves. The significance of the movement calls for a synthetic analysis that moves beyond the currently used fragmentary descriptions. A more profound conceptual framework will enable researchers to better understand the full dynamic of the movement within its global context In this article we explore the possibilities of applying David Korten's ideal-typical notion of fourth generation development to the anti-globalisation movement. We ask whether anti-globalisation organisation exhibits so-called Fourth Generation characteristics and activities. Our goal is to determine the extent to which the movement as a whole, and the individual organisations which constitute it, conform to the fourth generation development conceptual framework. ...
Abstract Globalisation is a complex, multi-faceted, phenomenon with widely contested meanings. While it has roots in the history of colonialism, capitalist development and imperialism, there are strong indications that what we are witnessing, since the 0000s, is a qualitative break with the past. Old boundaries, categories and meanings are being challenged in profound ways. New forms of exploitation and subjugation emerge in such a way that stark brutal force coexists with and may be increasingly supplanted by more subtle, pervasive forces of hegemonic rule. The latter, however, has opened up new terrains of struggle for people, movements, and governments opposed to one-dimensional 'corporate globalisation', seeking instead the globalisation of social and environmental justice. A continent like Africa much of which has sunk deeper into a 'fourth world' status of extreme under-development, social instability and neo-colonial dependence faces stark choices. Does it seek to partially or f...
So much has been written about the nation vis-a-vis other fields in the humanities, literature in particular. My interest in dance lies in its peculiar location within and vis-a-vis the discourse of the nation. An ephemeral form, dance has elicited various, and even contradictory, valuations; most of the time it is considered a mere form of entertainment. It is undeniable, though, that dance has articulated and informed our ideas of the nation and nationhood. In this paper, I explore how three contemporary dance companies based in Quezon City (The University of the Philippines Dance Company, Airdance, and Dance Forum) have rendered their imaginings of the Philippine nation. I focus on Philippine contemporary dance because as a cultural practice, I believe that it has choreographed the many trajectories and issues embodied in the Philippines's imagining of itself. A number of choreographies by the three companies mobilize motifs, forms, structures, and styles that constitute and signify...
- Loss:
TripletLoss
with these parameters:{ "distance_metric": "TripletDistanceMetric.COSINE", "triplet_margin": 0.4 }
Training Hyperparameters
Non-Default Hyperparameters
eval_strategy
: stepsper_device_train_batch_size
: 4per_device_eval_batch_size
: 32learning_rate
: 7e-06weight_decay
: 0.01num_train_epochs
: 1warmup_ratio
: 0.5fp16
: Truebatch_sampler
: no_duplicates
All Hyperparameters
Click to expand
overwrite_output_dir
: Falsedo_predict
: Falseeval_strategy
: stepsprediction_loss_only
: Trueper_device_train_batch_size
: 4per_device_eval_batch_size
: 32per_gpu_train_batch_size
: Noneper_gpu_eval_batch_size
: Nonegradient_accumulation_steps
: 1eval_accumulation_steps
: Nonetorch_empty_cache_steps
: Nonelearning_rate
: 7e-06weight_decay
: 0.01adam_beta1
: 0.9adam_beta2
: 0.999adam_epsilon
: 1e-08max_grad_norm
: 1.0num_train_epochs
: 1max_steps
: -1lr_scheduler_type
: linearlr_scheduler_kwargs
: {}warmup_ratio
: 0.5warmup_steps
: 0log_level
: passivelog_level_replica
: warninglog_on_each_node
: Truelogging_nan_inf_filter
: Truesave_safetensors
: Truesave_on_each_node
: Falsesave_only_model
: Falserestore_callback_states_from_checkpoint
: Falseno_cuda
: Falseuse_cpu
: Falseuse_mps_device
: Falseseed
: 42data_seed
: Nonejit_mode_eval
: Falseuse_ipex
: Falsebf16
: Falsefp16
: Truefp16_opt_level
: O1half_precision_backend
: autobf16_full_eval
: Falsefp16_full_eval
: Falsetf32
: Nonelocal_rank
: 0ddp_backend
: Nonetpu_num_cores
: Nonetpu_metrics_debug
: Falsedebug
: []dataloader_drop_last
: Falsedataloader_num_workers
: 0dataloader_prefetch_factor
: Nonepast_index
: -1disable_tqdm
: Falseremove_unused_columns
: Truelabel_names
: Noneload_best_model_at_end
: Falseignore_data_skip
: Falsefsdp
: []fsdp_min_num_params
: 0fsdp_config
: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap
: Noneaccelerator_config
: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}deepspeed
: Nonelabel_smoothing_factor
: 0.0optim
: adamw_torchoptim_args
: Noneadafactor
: Falsegroup_by_length
: Falselength_column_name
: lengthddp_find_unused_parameters
: Noneddp_bucket_cap_mb
: Noneddp_broadcast_buffers
: Falsedataloader_pin_memory
: Truedataloader_persistent_workers
: Falseskip_memory_metrics
: Trueuse_legacy_prediction_loop
: Falsepush_to_hub
: Falseresume_from_checkpoint
: Nonehub_model_id
: Nonehub_strategy
: every_savehub_private_repo
: Nonehub_always_push
: Falsegradient_checkpointing
: Falsegradient_checkpointing_kwargs
: Noneinclude_inputs_for_metrics
: Falseinclude_for_metrics
: []eval_do_concat_batches
: Truefp16_backend
: autopush_to_hub_model_id
: Nonepush_to_hub_organization
: Nonemp_parameters
:auto_find_batch_size
: Falsefull_determinism
: Falsetorchdynamo
: Noneray_scope
: lastddp_timeout
: 1800torch_compile
: Falsetorch_compile_backend
: Nonetorch_compile_mode
: Nonedispatch_batches
: Nonesplit_batches
: Noneinclude_tokens_per_second
: Falseinclude_num_input_tokens_seen
: Falseneftune_noise_alpha
: Noneoptim_target_modules
: Nonebatch_eval_metrics
: Falseeval_on_start
: Falseuse_liger_kernel
: Falseeval_use_gather_object
: Falseaverage_tokens_across_devices
: Falseprompts
: Nonebatch_sampler
: no_duplicatesmulti_dataset_batch_sampler
: proportional
Training Logs
Epoch | Step | Training Loss | Validation Loss | discipline-tuned_specter_2_022_cosine_accuracy | discipline-tuned_specter_2_024_cosine_accuracy |
---|---|---|---|---|---|
0.0023 | 25 | 0.2976 | 0.2980 | 0.9518 | - |
0.0046 | 50 | 0.3008 | 0.2969 | 0.9518 | - |
0.0069 | 75 | 0.3088 | 0.2953 | 0.9524 | - |
0.0092 | 100 | 0.3047 | 0.2929 | 0.9530 | - |
0.0115 | 125 | 0.2879 | 0.2897 | 0.9530 | - |
0.0138 | 150 | 0.2705 | 0.2855 | 0.9532 | - |
0.0161 | 175 | 0.2771 | 0.2804 | 0.9536 | - |
0.0184 | 200 | 0.2737 | 0.2744 | 0.9548 | - |
0.0207 | 225 | 0.2737 | 0.2676 | 0.9553 | - |
0.0230 | 250 | 0.2569 | 0.2600 | 0.9557 | - |
0.0253 | 275 | 0.2518 | 0.2512 | 0.9579 | - |
0.0276 | 300 | 0.2445 | 0.2416 | 0.9580 | - |
0.0299 | 325 | 0.2214 | 0.2310 | 0.9591 | - |
0.0322 | 350 | 0.2359 | 0.2204 | 0.9606 | - |
0.0345 | 375 | 0.2072 | 0.2090 | 0.9615 | - |
0.0368 | 400 | 0.1907 | 0.1976 | 0.9618 | - |
0.0391 | 425 | 0.1881 | 0.1850 | 0.9624 | - |
0.0414 | 450 | 0.1842 | 0.1733 | 0.9637 | - |
0.0437 | 475 | 0.1618 | 0.1628 | 0.9646 | - |
0.0460 | 500 | 0.1638 | 0.1533 | 0.9645 | - |
0.0483 | 525 | 0.1569 | 0.1440 | 0.9648 | - |
0.0506 | 550 | 0.1473 | 0.1354 | 0.9657 | - |
0.0529 | 575 | 0.1333 | 0.1281 | 0.9671 | - |
0.0552 | 600 | 0.1481 | 0.1223 | 0.9671 | - |
0.0575 | 625 | 0.1263 | 0.1167 | 0.9675 | - |
0.0598 | 650 | 0.114 | 0.1120 | 0.9684 | - |
0.0621 | 675 | 0.1097 | 0.1081 | 0.9693 | - |
0.0644 | 700 | 0.1152 | 0.1044 | 0.9698 | - |
0.0667 | 725 | 0.1009 | 0.0999 | 0.9705 | - |
0.0690 | 750 | 0.0895 | 0.0961 | 0.9709 | - |
0.0713 | 775 | 0.0855 | 0.0934 | 0.9711 | - |
0.0736 | 800 | 0.0853 | 0.0912 | 0.9715 | - |
0.0759 | 825 | 0.0942 | 0.0885 | 0.9714 | - |
0.0782 | 850 | 0.1035 | - | - | 0.9710 |
Framework Versions
- Python: 3.10.12
- Sentence Transformers: 3.3.1
- Transformers: 4.49.0.dev0
- PyTorch: 2.5.1+cu121
- Accelerate: 1.2.1
- Datasets: 3.2.0
- Tokenizers: 0.21.0
Citation
BibTeX
Sentence Transformers
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
TripletLoss
@misc{hermans2017defense,
title={In Defense of the Triplet Loss for Person Re-Identification},
author={Alexander Hermans and Lucas Beyer and Bastian Leibe},
year={2017},
eprint={1703.07737},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
- Downloads last month
- 209
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.
Model tree for m7n/discipline-tuned_specter_2_024
Base model
allenai/specter2_aug2023refresh_baseSpaces using m7n/discipline-tuned_specter_2_024 2
Evaluation results
- Cosine Accuracy on discipline tuned specter 2 022self-reported0.971
- Cosine Accuracy on discipline tuned specter 2 024self-reported0.971