corpus

Emoji Identification and Prediction in Hebrew Political Corpus

Aim/Purpose: Any system that aims to address the task of modeling social media communication need to deal with the usage of emojis. Efficient prediction of the most likely emoji given the text of a message may help to improve different NLP tasks. Background: We explore two tasks: emoji identification and emoji prediction. While emoji prediction is a classification task of predicting the emojis that appear in a given text message, emoji identification is the complementary preceding task of determining if a given text message includes emojies. Methodology: We adopt a supervised Machine Learning (ML) approach. We compare two text representation approaches, i.e., n-grams and character n-grams and analyze the contribution of additional metadata features to the classification. Contribution: The task of emoji identification is novel. We extend the definition of the emoji prediction task by allowing to use not only the textual content but also meta-data analysis. Findings: Metadata improve the classification accuracy in the task of emoji identification. In the task of emoji prediction it is better to apply feature selection. Recommendations for Practitioners: In many of the cases the classifier decision seems fitter to the comment content than the emoji that was chosen by the commentator. The classifier may be useful for emoji suggestion. Recommendation for Researchers: Explore character-based representations rather than word-based representations in the case of morphologically rich languages. Impact on Society: Improve the modeling of social media communication. Future Research: We plan to address the multi-label setting of the emoji prediction task and to investigate the deep learning approach for both of our classification tasks




corpus

Corpus Processing of Multi-Word Discourse Markers for Advanced Learners

Aim/Purpose. The most crucial aspects of teaching a foreign language to more advanced learners are building an awareness of discourse modes, how to regulate discourse, and the pragmatic properties of discourse components. However, in different languages, the connections and structure of discourse are ensured by different linguistic means which makes matters complicated for the learner. Background. By uncovering regularities in a foreign language and comparing them with patterns in one’s own tongue, the corpus research method offers the student unique opportunities to acquire linguistic knowledge about discourse markers. This paper reports on an investigation of the functions of multi-word discourse markers. Methodology. In our research, we combine the alignment model of the phrase-based statistical machine translation and manual treatment of the data in order to examine English multi-word discourse markers and their equivalents in Lithuanian and Hebrew translations by researching their changes in translation. After establishing the full list of multi-word discourse markers in our generated parallel corpus, we research how the multi-word discourse markers are treated in translation. Contribution. Creating a parallel research corpus to identify multi-word expressions used as discourse markers, analyzing how they are translated into Lithuanian and Hebrew, and attempting to determine why the translators made the choices add value to corpus-driven research and how to manage discourse. Findings. Our research proves that there is a possible context-based influence guiding the translation to choose a particle or other lexical item integration in Lithuanian or Hebrew translated discourse markers to express the rhetorical domain which could be related to the so-called phenomenon of “over-specification.” Recommendations for Practitioners. The comparative examination of discourse markers provides language instructors and translators with more specific information about the roles of discourse markers. Recommendations for Researchers. Understanding the multifunctionality of discourse markers provides new avenues for discourse marker application in translation research. Impact on Society. The current study may be a useful method to strengthen students’ language awareness and analytic skills and is particularly important for students specializing in English philology or translation. Beyond the empirical research, an extensive parallel data resource has been created to be openly used. Future Research. It should be noted that the observed phenomenon of “over-specification” could be analyzed further in future research.




corpus

On the Ground in Corpus Christi Whats Next for Offshore Energy Safety

Exporting just over half of U.S. crude oil exports in 2020, Corpus Christi, Texas, is on its way to becoming the Gulf of Mexico’s oil hub. As tankers equipped with millions of barrels of oil, cruise in and out of the city, safety is a top priority for the Gulf Research Program (GRP).




corpus

Improper modernism : Djuna Barnes's bewildering corpus [Electronic book] / Daniela Caselli.

Abingdon : Routledge, 2016.




corpus

A Quantum Algorithm To Locate Unknown Hashes For Known N-Grams Within A Large Malware Corpus. (arXiv:2005.02911v2 [quant-ph] UPDATED)

Quantum computing has evolved quickly in recent years and is showing significant benefits in a variety of fields. Malware analysis is one of those fields that could also take advantage of quantum computing. The combination of software used to locate the most frequent hashes and $n$-grams between benign and malicious software (KiloGram) and a quantum search algorithm could be beneficial, by loading the table of hashes and $n$-grams into a quantum computer, and thereby speeding up the process of mapping $n$-grams to their hashes. The first phase will be to use KiloGram to find the top-$k$ hashes and $n$-grams for a large malware corpus. From here, the resulting hash table is then loaded into a quantum machine. A quantum search algorithm is then used search among every permutation of the entangled key and value pairs to find the desired hash value. This prevents one from having to re-compute hashes for a set of $n$-grams, which can take on average $O(MN)$ time, whereas the quantum algorithm could take $O(sqrt{N})$ in the number of table lookups to find the desired hash values.




corpus

Cross-Lingual Semantic Role Labeling with High-Quality Translated Training Corpus. (arXiv:2004.06295v2 [cs.CL] UPDATED)

Many efforts of research are devoted to semantic role labeling (SRL) which is crucial for natural language understanding. Supervised approaches have achieved impressing performances when large-scale corpora are available for resource-rich languages such as English. While for the low-resource languages with no annotated SRL dataset, it is still challenging to obtain competitive performances. Cross-lingual SRL is one promising way to address the problem, which has achieved great advances with the help of model transferring and annotation projection. In this paper, we propose a novel alternative based on corpus translation, constructing high-quality training datasets for the target languages from the source gold-standard SRL annotations. Experimental results on Universal Proposition Bank show that the translation-based method is highly effective, and the automatic pseudo datasets can improve the target-language SRL performances significantly.




corpus

Port of Corpus Christi Auth. v. Sherwin Alumina Company

(United States Fifth Circuit) - Affirmed. The bankruptcy court's rejection of a Texas Port Authority's claims of sovereign immunity and fraud in their gambit to invalidate a bankruptcy sale that extinguished an easement they held was affirmed because there was no Eleventh Amendment violation or basis to claim fraud.




corpus

Port of Corpus Christi Auth. v. Sherwin Alumina Company

(United States Fifth Circuit) - Affirmed. The bankruptcy court's rejection of a Texas Port Authority's claims of sovereign immunity and fraud in their gambit to invalidate a bankruptcy sale that extinguished an easement they held was affirmed because there was no Eleventh Amendment violation or basis to claim fraud.




corpus

habeas corpus




corpus

Dissertatio medica, exhibens cogitationes physiologicas de vita, et vivificatione materiae humanum corpus constituentis / Joanni Theodoro vander Kemp.

Edinburgi : Excudebant Balfour et Smellie, 1782.




corpus

RBI corpus for MFs: Rs 4,000 crore borrowed by banks

The window was announced after the markets were roiled by news, last Thursday, of Franklin Templeton MF winding up six of its debt funds amid mounting redemptions. It will remain open till May 11.




corpus

Increased Notching of the Corpus Callosum in Fetal Alcohol Spectrum Disorder: A Callosal Misunderstanding? [PEDIATRICS]

BACKGROUND AND PURPOSE:

In the medicolegal literature, notching of the corpus callosum has been reported to be associated with fetal alcohol spectrum disorders. Our purpose was to analyze the prevalence of notching of the corpus callosum in a fetal alcohol spectrum disorders group and a healthy population to determine whether notching occurs with increased frequency in the fetal alcohol spectrum disorders population.

MATERIALS AND METHODS:

We performed a multicenter search for cases of fetal alcohol spectrum disorders and included all patients who had a sagittal T1-weighted brain MR imaging. Patients with concomitant intracranial pathology were excluded. The corpus callosum was examined for notches using previously published methods. A 2 test was used to compare the fetal alcohol spectrum disorders and healthy groups.

RESULTS:

Thirty-three of 59 patients with fetal alcohol spectrum disorders (0–44 years of age) identified across all centers had corpus callosum notching. Of these, 8 had an anterior corpus callosum notch (prevalence, 13.6%), 23 had a posterior corpus callosum notch (prevalence, 39%), and 2 patients demonstrated undulated morphology (prevalence, 3.4%). In the healthy population, the anterior notch prevalence was 139/875 (15.8%), posterior notch prevalence was 378/875 (43.2%), and undulating prevalence was 37/875 (4.2%). There was no significant difference among the anterior (P = .635), posterior (P = .526), and undulating (P = .755) notch prevalence in the fetal alcohol spectrum disorders and healthy groups.

CONCLUSIONS:

There was no significant difference in notching of the corpus callosum between patients with fetal alcohol spectrum disorders and the healthy population. Although reported to be a marker of fetal alcohol spectrum disorders, notching of the corpus callosum should not be viewed as a specific finding associated with fetal alcohol spectrum disorders.




corpus

Ship Operator Pleads Guilty to Crimes Related to Pollution from Cargo Ship Traveling to Corpus Christi, Texas

A ship management company headquartered in Greece that operated a 29,414 - ton cargo ship that made calls in multiple ports in Texas pleaded guilty and was sentenced late yesterday in federal court in Corpus Christi for deliberately concealing pollution discharges from the ship directly into the sea and for failing to notify the U. S. Coast Guard of numerous safety hazards on board the vessel.



  • OPA Press Releases

corpus

Justice Department Files Lawsuit Against Corpus Christi, Texas, Police Department for Sex Discrimination

The Justice Department today filed a lawsuit against the city of Corpus Christi, Texas, alleging that the city’s police department engaged in a pattern or practice of employment discrimination against women in violation of Title VII of the Civil Rights Act of 1964.



  • OPA Press Releases

corpus

Justice Department Settles Sex Discrimination Lawsuit Against City of Corpus Christi, Texas, Police Department

The Department of Justice announced today that it has entered into a settlement to resolve the department’s allegations that the city of Corpus Christi, Texas, violated Title VII of the Civil Rights Act of 1964 by discriminating against women when hiring entry-level police officers.



  • OPA Press Releases

corpus

U.S. District Court Orders Community Notice to Corpus Christi, Texas, Residents Who May Be Victims of Environmental Crimes by Citgo Refinery

Persons living around the CITGO refinery in Corpus Christi, Texas, who suffered immediate negative health effects from emissions from two large tanks at the facility that were operated between January 1994 and May 2003 in violation of the federal Clean Air Act, may be crime victims in United States v. CITGO Petroleum Corporation et al.



  • OPA Press Releases

corpus

Justice Department Settles Sex Discrimination Lawsuit Against Corpus Christi, Texas, Police Department

The Department of Justice announced today that it has reached a final settlement with the city of Corpus Christi, Texas, to resolve the department’s claim that the city violated Title VII of the Civil Rights Act of 1964 by engaging in a pattern or practice of discrimination against female applicants for entry-level police officer positions.



  • OPA Press Releases

corpus

ICAI - 100 crores corpus earmarked for scholarship to CA students

ICAI - 100 crores corpus earmarked for scholarship to CA students ...




corpus

Govt to set up dairy development fund with corpus of Rs 8,000 cr

Assistance of up to Rs 75 lakhs to be provided to every e-NAM (National Agricultural Market)




corpus

Central issues in jurisprudence : justice, law and rights / N.E. Simmonds, Fellow of Corpus Christi College, Professor of Jurisprudence in the University of Cambridge

Simmonds, N. E. (Nigel E.), author




corpus

Crafting and executing strategy : the quest for competitive advantage : concepts / Arthur A. Thompson, The University of Alabama, Margaret A. Peteraf, Dartmouth College, John E. Gamble, Texas A&M University-Corpus Christi, A.J. Strickland III, The Uni

Thompson, Arthur A., 1940- author




corpus

Corpus-based translation and interpreting studies in Chinese contexts: present and future / Kaibao Hu, Kyung Hye Kim, editors

Online Resource




corpus

Embodied conceptualization or neural realization: a corpus-driven study of Mandarin synaesthetic adjectives / Qingqing Zhao

Online Resource




corpus

From minimal contrast to meaning construct: corpus-based, near synonym driven approaches to Chinese lexical semantics / Qi Su, Weidong Zhan, editors

Online Resource




corpus

Building and using the Siarad Corpus: bilingual conversations in Welsh and English / Margaret Deuchar, Peredur Davies, Kevin Donnelly

Hayden Library - P115.5.G7 D48 2018




corpus

The Yehud stamp impressions [electronic resource] : a corpus of inscribed impressions from the Persian and Hellenistic periods in Judah / Oded Lipschits and David S. Vanderhooft

Lipschitz, Oded




corpus

The corpus of Al-Isfizārī in the sciences of weights and mechanical devices: new Arabic texts in theoretical and practical mechanics from the early XIIth century: English translation, partial analysis and historical context / by Mohammed Abattouy and Sali

Rotch Library - QC87.I8413 2015




corpus

Audubon Florida Records, 1900-1970, Box 3 Folder 29 : Corpus Christi Area, TX